1095510.1007/10955.1572-9613Journal of Statistical PhysicsJ Stat Phys0022-47151572-9613Springer USNew Yorks10955-020-02682-1268210.1007/s10955-020-02682-1ArticleQuantum Statistical Learning via Quantum Wasserstein Natural GradientBeckerSimon1aLiWuchen2grid.5335.00000000121885934DAMTPUniversity of CambridgeWilberforce RoadCB3 0WACambridgeUKgrid.254567.70000 0000 9075 106XDepartment of MathematicsUniversity of South CarolinaColumbiaUSA
In this article, we introduce a new approach towards the statistical learning problem argminρ(θ)∈PθWQ2(ρ⋆,ρ(θ))\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mathrm{argmin}_{\rho (\theta ) \in {\mathcal {P}}_{\theta }} W_{Q}^2 (\rho _{\star },\rho (\theta ))$$\end{document} to approximate a target quantum state ρ⋆\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho _{\star }$$\end{document} by a set of parametrized quantum states ρ(θ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho (\theta )$$\end{document} in a quantum L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-Wasserstein metric. We solve this estimation problem by considering Wasserstein natural gradient flows for density operators on finite-dimensional C∗\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$C^*$$\end{document} algebras. For continuous parametric models of density operators, we pull back the quantum Wasserstein metric such that the parameter space becomes a Riemannian manifold with quantum Wasserstein information matrix. Using a quantum analogue of the Benamou–Brenier formula, we derive a natural gradient flow on the parameter space. We also discuss certain continuous-variable quantum states by studying the transport of the associated Wigner probability distributions.
KeywordsQuantum transport information geometryQuantum state estimationQuantum Wasserstein information matrixQuantum Wasserstein natural gradientQuantum Schrödinger bridge problemEngineering and Physical Sciences Research Councilhttp://dx.doi.org/10.13039/501100000266EP/L016516/1BeckerSimonpublisher-imprint-nameSpringervolume-issue-count3issue-article-count20issue-toc-levels0issue-pricelist-year2021issue-copyright-holderSpringer Science+Business Media, LLC, part of Springer Natureissue-copyright-year2021article-contains-esmNoarticle-numbering-styleContentOnlyarticle-registration-date-year2020article-registration-date-month12article-registration-date-day7article-toc-levels0toc-levels0volume-typeRegularjournal-productArchiveJournalnumbering-styleContentOnlyarticle-grants-typeOpenChoicemetadata-grantOpenAccessabstract-grantOpenAccessbodypdf-grantOpenAccessbodyhtml-grantOpenAccessbibliography-grantOpenAccessesm-grantOpenAccessonline-firstfalsepdf-file-referenceBodyRef/PDF/10955_2020_Article_2682.pdfpdf-typeTypesettarget-typeOnlinePDFissue-online-date-year2021issue-online-date-month1issue-online-date-day24issue-print-date-year2021issue-print-date-month1issue-print-date-day24issue-typeRegulararticle-typeOriginalPaperjournal-subject-primaryPhysicsjournal-subject-secondaryStatistical Physics and Dynamical Systemsjournal-subject-secondaryTheoretical, Mathematical and Computational Physicsjournal-subject-secondaryPhysical Chemistryjournal-subject-secondaryQuantum Physicsjournal-subject-collectionPhysics and Astronomyopen-accesstrueIntroduction
The learning problem of quantum states, i.e. positive-definite trace class operators of unit trace, is central in modern quantum theory and commonly called quantum state tomography. The problem of quantum state estimation is ubiquitous in quantum mechanics and has a wide range of applications: This includes the analysis of optical devices [16] as well as the reliable estimation of qubit states in quantum computing [6, 24]. Until this day, there have been many recent computationally efficient approaches towards the quantum state estimation problem based on compressed sensing and machine learning methods such as [20, 40]. For a review of the most common classical approaches towards quantum state estimation, such as Maximal likelihood estimation (MLE), we refer to [36].
However, both in physics and non-commutative geometry, many problems come as a quantum state estimation problem in disguise: Over the past years, finding suitable physical descriptors for molecular structures from data has become a vast and growing area of research, cf. the review article [39] and references therein. Recently, such quantum machine learning approaches have also been based on optimization problems in Wasserstein distances, see for example [11], where a kernel ridge regression-based model relying on the Coulomb matrix is studied. The advantage of using the Wasserstein distance is that it leads to a continuous dependence on the position of the nuclei.
In said article, it has been discovered that it is key to use a suitable parametrization of the Coulomb matrix. This parametrization is ought to be invariant under 3D translations and rotations of the molecule and therefore related to the low-dimensional parametrization problem considered in this and previous articles, cf. [10]. Also, first attempts towards quantum Wasserstein generative adversarial networks have been considered in [14]. The quantum Wasserstein distance and its generalizations considered in [8, 9] have also far-reaching applications beyond quantum mechanics to the field of non-commutative probability theory which includes multivariable time series and vector-valued random variables [34]. Hence, solving the quantum state estimation problem in Wasserstein distance has become an important and widely applicable problem.
The analysis of geometric properties of the space of quantum states is called quantum information geometry and is central in the field of quantum information. The asymptotic theory of quantum state estimation and quantum information geometry has been developed in the second half of the 1980s by Nagaoka [33]. A comprehensive review of the modern field of quantum information geometry and its connection to quantum estimation can be found in [21]. In this article, we develop a new connection between these two fields based on the quantum Wasserstein metric.
It has been discovered, among others, by Otto [35], that various PDEs evolve according to the gradient flow with respect to the L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-Wasserstein metric [23]. Later, Carlen and Maas introduced in a series of articles [12, 13, 15] also quantum Wasserstein metrics for open quantum systems, satisfying a detailed balance condition. In these articles, they showed, that such open quantum systems also evolve according to the L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-Wasserstein gradient flow. Moreover, they also showed that their metric allows for a dynamical formulation extending the classical Benamou–Brenier formula [4] to the quantum setting. Here, we also mention the work by Datta and Rouzé [37, 38] for additional links to a quantum version of Ricci curvature and Fisher information functional. This analysis has been complemented by articles [8, 9] where different types of non-commutative multiplication operators are considered with favorable properties from a computational point of view. Besides, Carlen and Maas showed that for certain open quantum systems the gradient flow of the relative entropy with respect to an invariant state in the quantum Wasserstein metric coincides with the quantum evolution governed by the Lindblad equation. For continuous-variable states, a quantum transport framework with desirable physical features has been proposed in [17]. However, a dynamical formulation of this approach does not seem to exist, yet. Results on the entropy flow for open quantum systems have also been obtained in [32]. Another relevant definition of the Wasserstein distance is due to Golse et al. [19] and has been proposed in the study of uniform mean-field limits of quantum systems in the semiclassical parameter.
Recently, optimal transport gradient flows have been applied to estimation problems in classical probability theory. In particular, the parameter estimation problem of probability measures by using parameterized Wasserstein gradient flows on either Kullback–Leibler (KL) divergence, also referred to as relative entropy, or L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-Wasserstein distance has been addressed by the second author [10, 28, 29]. This leads to a joint study between optimal transport [42] and information geometry [1, 2], namely transport information geometry [26, 30]. Here, the natural gradient induced by optimal transport is first applied for statistical learning problems. Meanwhile, this approach also introduces a new estimation theory based on Wasserstein information matrix [25]. It also develops new scientific computing algorithms by the generative adversarial network to solve classical Fokker–Planck equations, in data-poor situations [27].
In this article, we present a new approach towards quantum state estimation based on L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-quantum Wasserstein gradients. We extend the study of the previous paragraph to quantum systems. We start by studying the problem of minimizing the distance with respect to a quantum Wasserstein metric d, for some fixed target density operator ρ⋆\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho _{\star }$$\end{document} over a parametrized manifold of states Pθ⊂D(H)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {P}}_{\theta } \subset {\mathscr {D}}({\mathcal {H}})$$\end{document}, i.e. we aim to identify argminρ∈Pθd(ρ⋆,ρ).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ \mathrm{argmin}_{\rho \in {\mathcal {P}}_{\theta }} d(\rho _{\star },\rho ).$$\end{document} We address the corresponding estimation problem for particular finite and infinite-dimensional quantum states. In the case of infinite-dimensional states, our approach towards statistical learning is based on the Wigner transform of continuous-variable quantum states. This makes this approach particularly tailored to experimental quantum state estimation in continuous-variable systems, where the Wigner distribution of the quantum state is approximately recovered [41]. A classical choice of the distance between probability measures is the Kullback–Leibler (KL) divergence. In classical probability, the metric induced by the L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document} Hessian of the KL divergence is the Fisher–Rao metric which provides a natural gradient descent method. The analogous concepts of relative entropy for faithful states ρ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho $$\end{document} and σ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\sigma $$\end{document}S(ρ‖σ)=-tr(ρ(log(ρ)-log(σ)))\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} S(\rho \Vert \sigma ) = - \mathrm{tr}(\rho (\log (\rho )-\log (\sigma ))) \end{aligned}$$\end{document}and Fisher information is well-established in quantum information theory, too. For finite-dimensional quantum states, our aim is then to establish low-dimensional parameterized quantum Wasserstein gradient flows based on quantum Wasserstein distances. This means we aim to find a low-dimensional representation of the minimization problem in parameter space by applying quantum Wasserstein dynamics. Our study starts by pulling back the quantum Wasserstein metric to a finite-dimensional parameter manifold, using the quantum transport (Wasserstein) information matrix. This leads to a natural gradient descent method for quantum states.
We also introduce and study a quantum analog of the Schrödinger bridge problem. As we show in this article, this problem can be solved by a quantum Benamou–Brenier’s formula with quantum Fisher information functional regularization.
Summary of novel results
We introduce the quantum transport information matrix and develop the related quantum transport/Wasserstein statistical manifold. This can be viewed as the first step of quantum transport information geometry.
We formulate the quantum transport natural gradient flow based on quantum Wasserstein statistical manifold. We apply this flow for solving the quantum statistical learning problem.
We also formulate the quantum Schrödinger bridge problem by controlling the quantum transport natural gradient flows.
We study the quantum Wasserstein statistical manifold for various finite-dimensional systems such as the quantum fermionic Fokker–Planck dynamics and more general finite-dimensional open quantum systems satisfying the detailed balance condition, as well as for continuous-variable systems with positive Wigner functions such as (mixtures of) Gaussian states.
We illustrate our results on some simple examples and also discuss how they apply to the parameter estimation problem for quantum channels.
Outline of the article In Sect. 2 we provide a brief review of classical optimal transport theory and quantum optimal transport, i.e.
In Sect. 3 we then introduce the quantum Wasserstein natural gradient 3.1, the Schrödinger bridge problem for finite-dimensional quantum systems in Sect. 3.2, and the same two for certain continuous-variable systems, including Gaussian systems, in Sect. 3.3. In Sect. 4 we discuss examples of our theory. This includes the transport problem for two Gaussian states and a fully explicit case of the fermionic Fokker–Planck equation. We finish our collection of examples by illustrating how the quantum transport information matrix can also be used to perform parameter estimation for quantum channels.
Notation We denote by states |n⟩\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\vert n \rangle $$\end{document}, for n∈N0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$n \in {\mathbb {N}}_0$$\end{document}, the canonical eigenbasis of the number operator N=a∗a\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$N= a^*a$$\end{document} where a is the standard annihilation operator. The continuous linear operators on a normed space X are denoted by L(X)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {L}}(X)$$\end{document}, the space of trace-class operators on a Hilbert space H\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {H}}$$\end{document} by TC(H).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mathrm{TC}({\mathcal {H}}).$$\end{document} For a set Ω\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Omega $$\end{document} we denote by int(Ω)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mathrm{int}(\Omega )$$\end{document} its interior. The set of quantum states (positive-definite operators of unit trace) on a Hilbert space H\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {H}}$$\end{document} is denoted by D(H).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathscr {D}}({\mathcal {H}}).$$\end{document} We denote the Riemannian manifold of faithful states by D+(H).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathscr {D}}_+({\mathcal {H}}).$$\end{document} We recall that ∂D(H)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\partial {\mathscr {D}}({\mathcal {H}})$$\end{document} are states with zero determinant and int(D(H))=D+(H).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mathrm{int}({\mathscr {D}}({\mathcal {H}})) ={\mathscr {D}}_+({\mathcal {H}}).$$\end{document} We also write {X,Y}=XY+YX\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\{X, Y\}=XY+YX$$\end{document} for the anti-commutator and [X,Y]=XY-YX\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$[X,Y]=XY-YX$$\end{document} for the commutator. We denote the spectrum of a linear operator T by Spec(T).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${{\,\mathrm{Spec}\,}}(T).$$\end{document}
Review of Classical and Quantum Optimal Transport
Our goal is to study the problem of minimizing the distance with respect to a L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-quantum Wasserstein distance WQ2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ W^2_Q$$\end{document}, for some fixed target density operator ρ⋆\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho _{\star }$$\end{document} over a parametrized manifold of states Pθ⊂D(H)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {P}}_{\theta } \subset {\mathscr {D}}({\mathcal {H}})$$\end{document}, i.e. we aim to identify argminρ∈PθWQ2(ρ⋆,ρ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ \mathrm{argmin}_{\rho \in {\mathcal {P}}_{\theta }} W^2_Q(\rho _{\star },\rho )$$\end{document}.
For this purpose, we start in this section with a review of the classical framework and highlight similarities and differences that appear in the quantum setting. In addition, we will also employ the classical framework for the study of Wigner distributions in the continuous-variable setting.
Classical Optimal Transport
The optimal transport problem dates back to 1781 when Monge asked how to find for two probability measures f0,f1\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$f_0,f_1$$\end{document} on Ω⊂Rn\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Omega \subset {\mathbb {R}}^n$$\end{document}, with finite second moment, an optimal transport plan T:Ω→Ω\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$T:\Omega \rightarrow \Omega $$\end{document} pushing f0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$f_0$$\end{document} to f1\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$f_1$$\end{document} such that the transportation cost is minimized and for all A⊂Ω\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$A \subset \Omega $$\end{document} measurableinfT∫Ω‖x-T(x)‖2f0(x)dx:T∗f0=f1\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \inf _T \int _{\Omega } \Vert x- T(x) \Vert ^2 f_0(x) \ dx : T_{*}f_0 = f_1 \end{aligned}$$\end{document}For two probability measures with densities f0,f1\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$f_0,f_1$$\end{document} on Ω⊂Rn\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Omega \subset {\mathbb {R}}^n$$\end{document} the square of the classical L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-Wasserstein distance is defined asWcl2(f0,f1):=infπ∈Π(f0,f1)∫Ω×Ω|x-y|2dπ(x,y)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} W^2_{\mathrm{cl}}(f_0,f_1):=\inf _{\pi \in \Pi (f_0,f_1)} \int _{\Omega \times \Omega } \vert x-y \vert ^2 d\pi (x,y) \end{aligned}$$\end{document}where Π(f0,f1)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Pi (f_0,f_1)$$\end{document} is the set of all couplings of the two measures f0(x)dx\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$f_0(x) \ dx$$\end{document} and f1(x)dx.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$f_1(x) \ dx.$$\end{document}
Equivalent to (2.1), and particularly relevant for our purposes, is a dynamical formulation, given by the Benamou–Brenier formula, which states that the Wasserstein metric is given byWcl2(f0,f1)=inf∫01∫Ω|vt(x)|2dμt(x)dt\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} W^2_{\mathrm{cl}}(f_0,f_1)=\inf \int _0^1 \int _{\Omega } \vert v_t(x) \vert ^2 \ d\mu _t(x) dt \end{aligned}$$\end{document}where the infimum is taken over all pairs (μt,vt)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$(\mu _t,v_t)$$\end{document} where μt\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mu _t$$\end{document} with μ0=f0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mu _0=f_0 $$\end{document} and μ1=f1\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mu _1 = f_1 $$\end{document} is a curve of measures and vt\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$v_t$$\end{document} a time-dependent vector field satisfying∂tμt+div(vtμt)=0.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \partial _t \mu _t + \mathrm{div}(v_t \mu _t) = 0. \end{aligned}$$\end{document}On a bounded domain Ω\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Omega $$\end{document} the above formulation is replaced by the corresponding Neumann problem.
The dynamical formulation above is closely connected to a Riemannian structure on the Wasserstein space. To fix ideas, we consider the space of strictly positive densitiesD+(Ω)={f∈C∞(Ω,(0,∞)):‖f‖L1=1}.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} {\mathscr {D}}_{+}(\Omega )=\{ f \in C^{\infty }(\Omega , (0,\infty )): \Vert f \Vert _{L^1}=1\}. \end{aligned}$$\end{document}The tangent space of D+\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathscr {D}}_{+}$$\end{document} is then just given byTfD+(Ω)=σ∈C∞(Ω):∫Ωσ(x)dx=0.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} T_{f}{\mathscr {D}}_{+}(\Omega )=\left\{ \sigma \in C^{\infty }(\Omega ): \int _{\Omega } \sigma (x) \ dx =0\right\} . \end{aligned}$$\end{document}For any Φ∈C∞(Ω)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Phi \in C^{\infty }(\Omega )$$\end{document} we can then setVΦ(x):=-div(f(x)∇Φ(x))∈TfD+(Ω).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} V_{\Phi }(x) := - \mathrm{div}(f(x) \nabla \Phi (x)) \in T_{f}{\mathscr {D}}_{+}(\Omega ). \end{aligned}$$\end{document}This map provides an isomorphism, at least if Ω\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Omega $$\end{document} is compact,C∞(Ω)/R→TfD+(Ω),with[Φ]↦VΦ.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} C^{\infty }(\Omega )/ {\mathbb {R}}\rightarrow T_{f}{\mathscr {D}}_{+}(\Omega ), \text { with }[\Phi ] \mapsto V_{\Phi }. \end{aligned}$$\end{document}We can therefore define the L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-Wasserstein metric tensor by introducing:
Fisher Information Regularization and Schrödinger Bridge Problem
After the works of Monge and Kantorovich, Schrödinger proposed in 1931 a similar transport problem which is nowadays referred to as the Schrödinger bridge problem (SBP):
Given two strictly positive densities f0,f1\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$f_0,f_1$$\end{document} on a domain Ω⊂Rn\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Omega \subset {\mathbb {R}}^n$$\end{document}, considerinfm,ρ∫01∫Ωm(t,x)2f(t,x)dxdt,\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \inf _{m,\rho } \int _0^1 \int _{\Omega } \frac{m(t,x)^2}{f(t,x)} \ dx \ dt, \end{aligned}$$\end{document}where the infimum is taken over all m and f satisfying∂tf(t,x)+div(m(t,x))=βΔf(t,x),f(0,x)=f0(x),f(1,x)=f1(x)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \partial _t f(t,x) + \mathrm{div}(m(t,x)) = \beta \Delta f(t,x), \quad f(0,x)=f_0(x), f(1,x)=f_1(x) \end{aligned}$$\end{document}with the boundary condition⟨m(t,x)-∇f(t,x),n(x)⟩=0∀x∈∂Ω\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \langle m(t,x) -\nabla f(t,x), n(x) \rangle =0 \quad \forall x \in \partial \Omega \end{aligned}$$\end{document}where n(x) is the normal vector of the boundary. We emphasize that the difference between the SBP and the L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-Wasserstein metric minimization (2.2) is the presence of the diffusion term βΔ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\beta \Delta $$\end{document} in the PDE (2.6). A discussion of the viscosity limit β↓0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\beta \downarrow 0$$\end{document} and the convergence of the solution to the SBP can be found in [22].
The minimization problem (2.5) with PDE (2.6) is, as has been shown in [7, 18] equivalent to minimizing the functionalinfm,ρ∫01∫Ωm(t,x)2f(t,x)+β2(∇log(f(t,x)))2f(t,x)dxdt+2βD(f1|f0),\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \inf _{m, \rho } \int _0^1 \int _{\Omega } \left( \frac{m(t,x)^2}{f(t,x)} + \beta ^2 (\nabla \log (f(t,x)))^2 f(t,x) \right) \ dx \ dt + 2\beta {\mathcal {D}}(f_1 \vert f_0), \end{aligned}$$\end{document}with a constant term representing the differences of entropies D(f1|f0)=∫Ωf1(x)log(f1(x))-f0(x)log(f0(x))dx\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {D}}(f_1\vert f_0) = \int _{\Omega } f_1(x) \log (f_1(x))-f_0(x) \log (f_0(x))\ dx$$\end{document} and f and m are linked by the transport equation∂tf(t,x)+div(m(t,x))=0,f(0,x)=f0(x),f(1,x)=f1(x).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \partial _t f(t,x) + \mathrm{div}(m(t,x))=0, \quad f(0,x)=f_0(x), f(1,x)=f_1(x). \end{aligned}$$\end{document}The advantage of studying the functional (2.7) over (2.2) is in the additional positivity and strict convexity enforced by the contribution of the Fisher informationI(f):=∫Ω|∇log(f(x))|2f(x)dx\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} {\mathcal {I}}(f):=\int _{\Omega } \vert \nabla \log (f(x)) \vert ^2 f(x) \ dx \end{aligned}$$\end{document}in the objective functional. Numerical aspects of this minimization problem have been thoroughly discussed in [31].
Quantum Optimal Transport
Before introducing quantum analogues of the L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-Wasserstein distance (2.1), we first define a notion of coupling of quantum states:
For two density operators ρin,ρfi∈D(H)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho _{\mathrm{in}},\rho _{\mathrm{fi}} \in {\mathscr {D}}({\mathcal {H}})$$\end{document} the set of all couplings Π(ρin,ρfi)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Pi (\rho _{\mathrm{in}},\rho _{\mathrm{fi}})$$\end{document} is defined as the set of density operator valued maps that smoothly (up to endpoints) connect the two statesΠ(ρin,ρfi):={ρ∈C([0,1],D(H))∩C∞((0,1),D(H));ρ(0)=ρin,ρ(1)=ρfi}.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \Pi (\rho _{\mathrm{in}},\rho _{\mathrm{fi}}) :=\Bigg \{ \rho \in C([0,1], {\mathscr {D}}({\mathcal {H}})) \cap C^{\infty }((0,1), {\mathscr {D}}({\mathcal {H}}));\rho (0) =\rho _{\mathrm{in}}, \rho (1)=\rho _{\mathrm{fi}} \Bigg \}. \end{aligned}$$\end{document}To give the definition of the 2-Wasserstein distance for finite-dimensional quantum systems satisfying the detailed balance equation, we employ the differential calculus introduced in [15, Def. 4.7]. This framework allows us, in particular, to reformulate the evolution of finite-dimensional open quantum systems satisfying the detailed balance condition as a gradient flow of the relative entropy S(ρ||σ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$S(\rho \vert \vert \sigma )$$\end{document} where σ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\sigma $$\end{document} is the invariant state, with respect to the Wasserstein metric. Before discussing this in the context of open quantum systems satisfying the detailed balance condition, we introduce the necessary differential structure:
Differential Calculus for Quantum Systems
Let A\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {A}}$$\end{document} be a finite-dimensional von Neumann algebra with faithful positive tracial linear functional τ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\tau $$\end{document} and D+(A)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathscr {D}}_{+}({\mathcal {A}})$$\end{document} the set of faithful states.
Definition 2.2
A differential structure on A\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {A}}$$\end{document} is defined as follows:
There exists a finite index set J and for each j∈J\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$j \in J$$\end{document} a finite-dimensional von Neumann algebra Bj\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$B_j$$\end{document} with a faithful positive tracial linear functional τj.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\tau _j.$$\end{document}
For each j∈J\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$j \in J$$\end{document} there exists a pair (lj,rj)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$(l_j,r_j)$$\end{document} of unital ∗\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$*$$\end{document}-homomorphisms from A\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {A}}$$\end{document} to Bj\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$B_j$$\end{document} such that τj(lj(A))=τj(rj(A))=τ(A).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \tau _j(l_j(A))= \tau _j(r_j(A))=\tau (A). \end{aligned}$$\end{document}
Logarithmic case The quantum L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-Wasserstein distance, for the above differentiable structure, has been defined in [15, (9.1)], byWQ2(ρin,ρfi):=infρ∈Π(ρin,ρfi)∫01‖ρ′(t)‖ρ(t)2dt.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} W^2_Q(\rho _{\mathrm{in}},\rho _{\mathrm{fi}}) := \inf _{\rho \in \Pi (\rho _{\mathrm{in}},\rho _{\mathrm{fi}})} \left\{ \int _0^1 \Vert \rho '(t) \Vert ^2_{\rho (t)} dt \right\} . \end{aligned}$$\end{document}Here, we use the norm ‖Z‖ρ2=Z,Lρ(Z)L2(τ).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Vert Z \Vert ^2_{\rho }= \left\langle Z, L_{\rho }(Z) \right\rangle _{L^2(\tau )}.$$\end{document} The quantum L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-Wasserstein distance can then be expressed as a variational problem -in analogy to the classical Brenier–Benamou formula (2.2) for the classical L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-Wasserstein distance- byWQ2(ρin,ρfi):=infρ∈Π(ρin,ρfi)∫01‖∇Φ(t)‖ρ(t)2dt\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} W^2_Q(\rho _{\mathrm{in}},\rho _{\mathrm{fi}}) := \inf _{\rho \in \Pi (\rho _{\mathrm{in}}, \rho _{\mathrm{fi}})} \left\{ \int _0^1 \Vert \nabla \Phi (t) \Vert ^2_{\rho (t)} \ dt \right\} \end{aligned}$$\end{document}where Φ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Phi $$\end{document} is coupled to ρ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho $$\end{document} by the following continuity equationρ′(t)+divLρ(t)(∇Φ(t))=0.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \rho '(t) + \mathrm{div}\left( L_{ \rho (t)}(\nabla \Phi (t)) \right) =0. \end{aligned}$$\end{document}The physical interpretation of the Riemannian metric gρ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$g_{\rho }$$\end{document} is that for two faithful states ρ,σ∈D+(H)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho , \sigma \in {\mathscr {D}}_{+}({\mathcal {H}})$$\end{document}, and the quantum relative entropy, defined bySσ(ρ)=τ(ρ(log(ρ)-log(σ))),\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} S_{\sigma }(\rho ) = \tau (\rho (\log (\rho )-\log (\sigma ))), \end{aligned}$$\end{document}[15, Prop. 2.7] shows that for D denoting the derivative, the gradient (gradSσ)(ρ):=(-Δρ)DSσ(ρ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$(\mathrm{grad}S_{\sigma })(\rho ):=(-\Delta _{\rho }) DS_{\sigma }(\rho )$$\end{document}, where DSσ(ρ)=log(ρ)-log(σ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$DS_{\sigma }(\rho )= \log (\rho )-\log (\sigma )$$\end{document}, and we have(L∗ρ)(ρ)=-(gradSσ)(ρ).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} ({\mathscr {L}}^* \rho )(\rho ) = - (\mathrm{grad}S_{\sigma })(\rho ). \end{aligned}$$\end{document}This implies that the gradient flow of the entropy Sσ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$S_{\sigma }$$\end{document} with respect to the metric gρ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$g_{\rho }$$\end{document} is the dynamics of the Liouville-von Neumann equation where σ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\sigma $$\end{document} is the invariant state of the dynamics defined by L∗.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathscr {L}}^*.$$\end{document}
Anti-commutator case When instead of using the the Feynman–Kubo–Mori integral, but rather the anti-commutatorLρac(T):=12{T,ρ}\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} L^{\mathrm{ac}}_{\rho }(T):=\frac{1}{2}\{T,\rho \} \end{aligned}$$\end{document}one is lead to introduce a different L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-Wasserstein distance [8]W~Q2(ρin,ρfi):=infρ∈Π(ρin,ρfi);∫01tr(ρv(t)∗v(t))dt\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \tilde{W}^2_Q(\rho _{\mathrm{in}},\rho _{\mathrm{fi}}) := \inf _{\rho \in \Pi (\rho _{\mathrm{in}},\rho _{\mathrm{fi}});} \left\{ \int _0^1 {{\,\mathrm{tr}\,}}(\rho v(t)^*v(t)) dt \right\} \end{aligned}$$\end{document}with v∗v=∑k=1Nvk∗vk\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$v^*v= \sum _{k=1}^N v_k^* v_k$$\end{document}, where v and ρ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho $$\end{document} are coupled byρ′(t)+divLρac(v)=0:,ρ(0)=ρin,ρ(1)=ρfi.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \rho '(t) +\mathrm{div}L^{\mathrm{ac}}_{\rho }(v)=0:, \quad \rho (0) =\rho _{\mathrm{in}}, \rho (1)=\rho _{\mathrm{fi}}. \end{aligned}$$\end{document}In particular, the operator Lρac(T)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^{\mathrm{ac}}_{\rho }(T)$$\end{document} is invertible for ρ>0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho >0$$\end{document} by standard results on the solvability of Lyapunov equations which imply that the inverse is explicitly given as(Lρac)-1(S)=-∫0∞e-ρsSe-ρsds.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} (L^{\mathrm{ac}}_{\rho })^{-1}(S) = -\int _0^{\infty } e^{-\rho s} S e^{-\rho s} \ ds. \end{aligned}$$\end{document}
Fermionic Fokker–Planck Equation
Due to its analogy to classical probability theory and classical gradient flows, we start by discussing the quantum fermionic Fokker–Planck equation. Instead of just stating it within the abstract differential calculus introduced in the previous section, we will provide full details to fix ideas.
The quantum fermionic Fokker–Planck equation, is the canonical gradient flow associated with the quantum Wasserstein metric and corresponds to the classical Fokker–Planck equation1∂ρ(x,t)∂t=div(ρ(t,x)∇V(x))+βΔρ(t,x),ρ(0,x)=ρ0(x)forx∈Rd.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \frac{\partial \rho (x,t)}{\partial t} = \mathrm{div}(\rho (t,x) \nabla V(x)) + \beta \Delta \rho (t,x), \quad \rho (0,x)=\rho _0(x) \text { for } x \in {\mathbb {R}}^d. \end{aligned}$$\end{document}Under suitable growth conditions on V this equation has a unique invariant measure dμ(x)∝e-βV(x)dx.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$d\mu (x) \propto e^{-\beta V(x)}dx.$$\end{document} Carlen and Maas introduced in [12] a Riemannian metric on density operators which extends the classical L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-Wasserstein metric to the quantum setting and with respect to which the quantum evolution of the fermionic Fokker–Planck equation is a gradient flow. We will explain in section how to use this metric to define a natural gradient flow for parametric models of density operators.
Clifford Algebra
Let C\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathfrak {C}}$$\end{document} be the Clifford algebra on Rn\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbb {R}}^n$$\end{document} generated by n self-adjoint operators Qj\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$Q_j$$\end{document}, j=1,..,n\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$j=1,..,n$$\end{document} satisfying the canonical anti-commutation relations {Qi,Qj}=2δij\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\{Q_i,Q_j\}= 2\delta _{ij}$$\end{document}. The operators Qj\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$Q_j$$\end{document} are also called the fermionic degrees of freedom. Moreover, C\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathfrak {C}}$$\end{document} becomes a 2n\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$2^n$$\end{document}-dimensional Hilbert space H∼L2(τ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {H}} \sim L^2(\tau )$$\end{document} with inner product ⟨A,B⟩L2(τ):=τ(A∗B),\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\langle A,B \rangle _{L^2(\tau )}:=\tau (A^*B),$$\end{document} where we introduce the normalized trace τ(A)=2-ntrC2n(A).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\tau (A)=2^{-n} {{\,\mathrm{tr}\,}}_{{\mathbb {C}}^{2^n}}(A).$$\end{document}
The density operators D(H)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathscr {D}}({\mathcal {H}})$$\end{document} in this setting is the closed convex set of positive operators ρ∈C\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho \in {\mathfrak {C}}$$\end{document} of unit normalized trace.
We can explicitly construct matrices Qj\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$Q_j$$\end{document} solely from Pauli matricesσx=0110,σy:=0-ii0,andσz=100-1.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \sigma _x=\left( \begin{matrix} 0 &{} 1 \\ 1&{} 0 \end{matrix} \right) , \sigma _y:= \left( \begin{matrix} 0 &{} -i \\ i&{} 0 \end{matrix} \right) ,\text { and }\sigma _z =\left( \begin{matrix} 1 &{} 0 \\ 0 &{} -1 \end{matrix} \right) . \end{aligned}$$\end{document}One realization of the fermionic operators Qj\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$Q_j$$\end{document}, is by defining them as Qj:=⊗i=1nXi\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$Q_j:=\otimes _{i=1}^n X_i$$\end{document} whereXi=σzfori<j,σxfori=j,andidC2fori>j.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} X_i= {\left\{ \begin{array}{ll} \sigma _z&{}\text { for }i<j, \\ \sigma _x&{}\text { for }i=j, \text { and }\\ \mathrm{id}_{{\mathbb {C}}^2} &{}\text { for }i>j. \end{array}\right. } \end{aligned}$$\end{document}The grading operator Γ:C→C\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Gamma : {\mathfrak {C}} \rightarrow {\mathfrak {C}}$$\end{document} is the linear operator defined, for α∈{0,1}n\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\alpha \in \{0,1\}^n$$\end{document}, by Γ(Qα):=(-1)|α|Qα\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Gamma (Q^{\alpha }):=(-1)^{\vert \alpha \vert } Q^{\alpha }$$\end{document} where Qα:=∏i=1nQiαi.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$Q^{\alpha }:= \prod _{i=1}^n Q_i^{\alpha _i}.$$\end{document} The index set α∈{0,1}n\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\alpha \in \{0,1\}^n$$\end{document} is called the fermionic multi-index set. The 2n\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$2^n$$\end{document} matrices Qα\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$Q^{\alpha }$$\end{document} for α∈{0,1}n\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\alpha \in \{0,1\}^n$$\end{document} form an orthonormal system spanning C\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathfrak {C}}$$\end{document} which satisfies τ(Qα)=δ0|α|.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\tau (Q^{\alpha })= \delta _{0\vert \alpha \vert }.$$\end{document}
For two density operators ρ1,ρ2∈D(H)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho _1,\rho _2 \in {\mathscr {D}}({\mathcal {H}})$$\end{document} we define the Feynman–Kubo–Mori operator L(ρ1,ρ2):L(C)→TC(C)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L_{(\rho _1,\rho _2)}: {\mathcal {L}}({\mathfrak {C}}) \rightarrow \mathrm{TC}({\mathfrak {C}})$$\end{document} byL(ρ1,ρ2)(T):=∫01ρ11-sTρ2sds\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} L_{(\rho _1,\rho _2)}(T):= \int _0^1 \rho _1^{1-s} T \rho _2^s \ ds \end{aligned}$$\end{document}which is a contraction map into the set of trace-class operators, as Hölder’s inequality shows‖L(ρ1,ρ2)(T)‖1≤∫01‖ρ11-sTρ2s‖1ds≤∫01‖ρ11-s‖(1-s)-1‖T‖∞‖ρ2s‖s-1ds≤‖T‖∞.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \Vert L_{(\rho _1,\rho _2)}(T) \Vert _1 \le \int _0^1 \Vert \rho _1^{1-s} T \rho _2^s \Vert _1 \ ds \le \int _0^1 \Vert \rho _1^{1-s}\Vert _{(1-s)^{-1}} \Vert T \Vert _{\infty } \Vert \rho _2^s\Vert _{s^{-1}} \ ds \le \Vert T \Vert _{\infty }. \end{aligned}$$\end{document}Under the stronger assumption ρ1,ρ2∈D+(H),\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho _1,\rho _2 \in {\mathscr {D}}_+({\mathcal {H}}),$$\end{document} the operator L(ρ1,ρ2)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L_{(\rho _1,\rho _2)}$$\end{document} becomes invertible and its inverse is given by [12, Theo. 3.4]L(ρ1,ρ2)-1(T)=∫0∞(ρ1+t)-1T(ρ2+t)-1dt.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} L_{(\rho _1,\rho _2)}^{-1}(T) = \int _0^{\infty } (\rho _1+t)^{-1} T (\rho _2+t)^{-1} \ dt. \end{aligned}$$\end{document}In particular, we will just write Lρ:=L(Γ(ρ),ρ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L_{\rho }:=L_{(\Gamma (\rho ), \rho )}$$\end{document} in the sequel.
The fermionic Dirichlet form on C\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathfrak {C}}$$\end{document} is defined byF(A,A)=τ((∇A)∗∇A)=∑j=1nτ((∇jA)∗∇jA)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} {\mathcal {F}}(A,A) =\tau ((\nabla A)^* \nabla A)= \sum _{j=1}^n \tau ( (\nabla _j A)^* \nabla _j A) \end{aligned}$$\end{document}with derivatives∇j(A)=12QjA-Γ(A)Qj∈C,forj∈{1,…,n}andA∈C.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \nabla _j(A) = \frac{1}{2} \left( Q_j A - \Gamma (A) Q_j\right) \in {\mathfrak {C}},\text { for }j \in \{1,\ldots ,n \}\text { and }A \in {\mathfrak {C}}. \end{aligned}$$\end{document}The gradient ∇:C→Cn\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\nabla : {\mathfrak {C}} \rightarrow {\mathfrak {C}}^n$$\end{document} is then defined as ∇(A):=∇1(A),...,∇n(A)∈Cn\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\nabla (A) := \left( \nabla _1(A),...,\nabla _n(A)\right) \in {\mathfrak {C}}^n$$\end{document} with nullspace ker(∇)=span(id).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mathrm{ker}(\nabla ) = \mathrm{span}(\mathrm{id}).$$\end{document} The L2(τ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2(\tau )$$\end{document}-adjoint of derivatives ∇j\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\nabla _j$$\end{document} is just given by∇j∗(A)=12QjA+Γ(A)Qj.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \nabla _j^*(A) = \frac{1}{2} \left( Q_j A + \Gamma (A) Q_j\right) . \end{aligned}$$\end{document}The divergence operator is defined, for A=(Aj)j∈Cn\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$A=(A_j)_j \in {\mathfrak {C}}^n$$\end{document} by div(A)=-∑j=1n∇j∗(Aj)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mathrm{div}(A) =-\sum _{j=1}^n\nabla _j^*(A_j)$$\end{document}. We define the fermionic number operator N\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {N}}$$\end{document} as the self-adjoint operator associated to the Dirichlet form F(B,A)=:⟨B,NA⟩L2(τ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {F}}(B,A) =: \langle B, {\mathcal {N}} A \rangle _{L^2(\tau )}$$\end{document} where NA=-div(∇A)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {N}}A = -\mathrm{div}(\nabla A)$$\end{document} for all A∈C\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$A \in {\mathfrak {C}}$$\end{document} and ker(N)=id.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mathrm{ker}({\mathcal {N}})=\mathrm{id}.$$\end{document} The dynamical semigroup generated by -N\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$-{\mathcal {N}}$$\end{document} is the quantum fermionic Fokker–Planck semigroup defined by Pt=e-tN\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$P_t=e^{-t{\mathcal {N}}}$$\end{document} which relaxes exponentially fast to its unique invariant state, the completely mixed state. In particular, N\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {N}}$$\end{document} is the generator of an ergodic Quantum Markov semigroup satisfying the detailed balance condition with respect to the completely mixed state.
This model can be casted in the differential calculus introduced in Sect. 2.4.1 by setting A:=Bj:=Cn\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {A}}:={\mathcal {B}}_j :={\mathfrak {C}}^n$$\end{document}, Vj:=Qj\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$V_j:=Q_j$$\end{document}, ωj:=0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\omega _j:=0$$\end{document}, lj:=Γ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$l_j:=\Gamma $$\end{document} and rj=id\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$r_j=\mathrm{id}$$\end{document} with derivatives as defined in (2.13) and a generator LA=2∑j=1n(QjAQj-A)=-4N.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathscr {L}}A = 2 \sum _{j=1}^n (Q_jA Q_j-A)=-4{\mathcal {N}}.$$\end{document}
Quantum Markov Semigroups with Detailed Balance Condition
In the rest of this section, we illustrate the ideas using the differential calculus in Sect. 2.4.1 in the case of Quantum Markov semigroups (Pt)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$(P_t)$$\end{document} with Lindblad generator L\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathscr {L}}$$\end{document}, in the Heisenberg picture, acting on a finite-dimensional C∗\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$C^*$$\end{document}-algebra A\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {A}}$$\end{document} satisfying the detailed balance condition (DBC). This means, that for all times t>0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$t>0$$\end{document} the operator Pt\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$P_t$$\end{document} is self-adjoint with respect to the inner product ⟨X,Y⟩1,σ:=τ(X∗σY)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\langle X, Y\rangle _{1,\sigma }:=\tau (X^* \sigma Y)$$\end{document} for some state σ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\sigma $$\end{document}. In particular, the DBC implies that σ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\sigma $$\end{document} is the unique state such that Pt∗(σ)=σ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$P_t^*(\sigma )=\sigma $$\end{document} for all times t>0.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$t>0.$$\end{document} Other possible applications of the differential calculus in Sect. 2.4.1 and thus also of the parameter estimation techniques studied in this paper are discussed in [15, Sect. 5] and include popular quantum channels such as the depolarizing channel.
The generators L\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathscr {L}}$$\end{document} of the quantum Markov semigroups in Heisenberg representation are characterized by [15, Theo 2.4]L=∑j∈Je-ωj/2LjandLj(A)=Vj∗[A,Vj]+[Vj∗,A]Vj\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} {\mathscr {L}} = \sum _{j \in J} e^{-\omega _j/2} {\mathscr {L}}_j \text { and } {\mathscr {L}}_j (A) = V_j^* [A,V_j]+ [V_j^*,A]V_j \end{aligned}$$\end{document}with J a finite set and a family of operators (Vj)j∈J\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$(V_j)_{j \in J}$$\end{document} closed under taking adjoints, as well as real numbers ωj\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\omega _j$$\end{document} such that the modulation operator Mσ(A):=Mσ,σ(A):=σAσ-1\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$M_{\sigma }(A):=M_{\sigma , \sigma }(A):=\sigma A \sigma ^{-1}$$\end{document} satisfiesMσ(Vj)=e-ωjVjandωj¯=-ωj.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} M_{\sigma }(V_j) = e^{-\omega _j }V_j \text { and }\omega _{\bar{j}} =-\omega _j. \end{aligned}$$\end{document}We then define A=Bj=L(H)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {A}}= B_j= {\mathcal {L}}({\mathcal {H}})$$\end{document} where H\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {H}}$$\end{document} is a finite-dimensional Hilbert space, write B:=∏jBj\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {B}}:=\prod _j B_j$$\end{document}, and set lj=rj=idA.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$l_j=r_j=\mathrm{id}_{{\mathcal {A}}}.$$\end{document} The partial derivatives are then just given by ∇jA=[Vj,A]\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\nabla _j A = [V_j,A]$$\end{document} and ∇j∗:=∇j¯\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\nabla _j^*:=\nabla _{\bar{j}}$$\end{document} where j¯\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\bar{j}$$\end{document} is such that Vj∗=Vj¯.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$V_j^*=V_{\bar{j}}.$$\end{document} The gradient vector is thus just ∇=(∇1,…,∇|J|)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\nabla = (\nabla _1,\ldots ,\nabla _{\vert J \vert })$$\end{document}. It follows from [15, Prop. 2.5] that the Lindblad generator induces a Dirichlet form with respect to the Kubo–Martin–Schwinger inner product ⟨A,B⟩KMS:=τ(X∗Yσ),\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\langle A,B \rangle _{\mathrm{KMS}}:=\tau (X^* Y \sigma ),$$\end{document} i.e.⟨∇A,∇B⟩=-⟨A,LB⟩LKMS2(σ)for allA,B∈A.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \langle \nabla A, \nabla B \rangle = - \langle A, {\mathscr {L}} B \rangle _{L^2_{\mathrm{KMS}}(\sigma )} \text { for all } A,B \in {\mathcal {A}}. \end{aligned}$$\end{document}We then define the operatorρj^=∫01(eωj/2l(ρ))1-s⊗(e-ωj/2r(ρ))sds∈A⊗A\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \widehat{\rho _j} = \int _0^1 (e^{\omega _j/2}l(\rho ))^{1-s} \otimes (e^{-\omega _j/2}r(\rho ))^{s} \ ds \in {\mathcal {A}} \otimes {\mathcal {A}} \end{aligned}$$\end{document}with inverse operatorρjˇ=∫0∞(t+eωj/2l(ρ))-1⊗(t+e-ωj/2r(ρ))-1dt.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \check{\rho _j} = \int _0^{\infty } (t+e^{\omega _j/2}l(\rho ))^{-1} \otimes (t+e^{-\omega _j/2}r(\rho ))^{-1} \ dt. \end{aligned}$$\end{document}In terms of a contraction operator #\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\#$$\end{document} that is uniquely defined as the linear extension of the map (A⊗B)#C:=ACB\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$(A \otimes B)\#C:=ACB$$\end{document} for A,B,C∈A\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$A,B,C \in {\mathcal {A}}$$\end{document} and Feynman–Kubo–Mori operatorLρ(C):=ρj^#C\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} L_{\rho }(C) := \widehat{\rho _j} \# C \end{aligned}$$\end{document}we may then introduce a positive-definite operator -Δρ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$-\Delta _{\rho }$$\end{document} on L2(A,τ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2({\mathcal {A}}, \tau )$$\end{document}-Δρ(A):=∑j∈J∇j∗(Lρ(∇jA)).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} -\Delta _{\rho }(A) :=\sum _{j \in J} \nabla _j^*(L_{\rho }(\nabla _j A)). \end{aligned}$$\end{document}This way, the L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-quantum Wasserstein metric becomesWQ2(ρin,ρfi):=infρ∈Π(ρin,ρfi)∫01⟨Φ(t),-Δρ(Φ(t))⟩τdt\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} W^2_Q(\rho _{\mathrm{in}},\rho _{\mathrm{fi}}) := \inf _{\rho \in \Pi (\rho _{\mathrm{in}},\rho _{\mathrm{fi}})} \left\{ \int _0^1 \langle \Phi (t),-\Delta _{\rho } (\Phi (t)) \rangle _{\tau } \ dt \right\} \end{aligned}$$\end{document}where Φ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Phi $$\end{document} is coupled to ρ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho $$\end{document} by the following continuity equationρ′(t)=Δρ(t)Φ(t).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \rho '(t) = \Delta _{\rho (t)} \Phi (t). \end{aligned}$$\end{document}
Quantum Natural Gradient and Open Quantum Systems
In the following we shall impose the following condition on generators of finite-dimensional open quantum systems we consider:
Assumption 1
We assume that L\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathscr {L}}$$\end{document} is ergodic, i.e. ker(L)=span{id}\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mathrm{ker}({\mathscr {L}})=\mathrm{span}\{ \mathrm{id}\}$$\end{document} satisfying the detailed balance condition with invariant state σ.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\sigma .$$\end{document}
Gradient Flow for Finite-Dimensional OQSs with DBC
By the ergodicity assumption, we are able to pull back the metric from the state space to the parameter space. In particular, the above assumptions are satisfied for the fermionic Fokker–Planck equation with the completely mixed state as the unique invariant state.
The statistical parameter space is as in the classical setting defined as a d-dimensional Riemannian manifold Θ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Theta $$\end{document} with connection Dθ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$D_{\theta }$$\end{document} and metric tensor ⟨ξ,η⟩θ=ξTGθη\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\langle \xi , \eta \rangle _{\theta } = \xi ^T G_{\theta } \eta $$\end{document}. We then take a continuous parametrization Θ∋θ↦ρ(θ)∈D+(A)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Theta \ni \theta \mapsto \rho (\theta ) \in {\mathscr {D}}_{+}({\mathcal {A}})$$\end{document} of density operators.
We then define a norm‖Z‖ρ2=Z,Lρ(Z)L2(τ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \Vert Z \Vert ^2_{\rho }= \left\langle Z, L_{\rho }(Z) \right\rangle _{L^2(\tau )} \end{aligned}$$\end{document}where Lρ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L_{\rho }$$\end{document} has been defined in (2.12) for the fermionic Fokker–Planck equation and in (2.15) for general open quantum systems satisfying the DBC. In addition, we allow for Lρ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L_{\rho }$$\end{document} the anti-commutation operator defined in (2.10).
The associated metric tensor on D+(H)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathscr {D}}_+({\mathcal {H}})$$\end{document} is given bygρ:(TρD+(H))2→R,gρ(X,X):=∇ΦX,Lρ(∇ΦX)L2(τ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} g_{\rho }: (T_{\rho }{\mathscr {D}}_+({\mathcal {H}}))^2 \rightarrow {\mathbb {R}}, \quad g_{\rho }(X,X):= \left\langle \nabla \Phi _X, L_{\rho }(\nabla \Phi _X) \right\rangle _{L^2(\tau )} \end{aligned}$$\end{document}where TρD+(H)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$T_{\rho }{\mathscr {D}}_+({\mathcal {H}})$$\end{document} is the tangent space at ρ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho $$\end{document} and ∇ΦX\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\nabla \Phi _X$$\end{document} is the unique gradient field cf. [12, Theo 3.17] and [15, Lem. 7.5] satisfyingX=-divLρ(∇ΦX).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} X = - \mathrm{div} \left( L_{\rho }(\nabla \Phi _X) \right) . \end{aligned}$$\end{document}In case of Lρ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L_{\rho }$$\end{document} being the anti-commutator, the gradient field ∇ΦX\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\nabla \Phi _X$$\end{document} can be found by solving the Lyapunov equation [8, (21)]∇(divgrad|span(id)⊥)-1X=Lρ(∇ΦX)∈L(Hn).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \nabla ( \mathrm{div}\mathrm{grad}\vert _{\mathrm{span}(\mathrm{id})^{\perp }} )^{-1} X = L_{\rho }(\nabla \Phi _X) \in {\mathcal {L}}({\mathcal {H}}^n). \end{aligned}$$\end{document}In particular, there exists a unique gradient ∇Φξ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\nabla \Phi _{\xi }$$\end{document} such that⟨Dθρ(θ),ξ⟩θ=-div(Lρ(θ)(∇Φξ)).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \langle D_{\theta }\rho (\theta ),\xi \rangle _{\theta } =- \mathrm{div}(L_{\rho (\theta )} (\nabla \Phi _{\xi })). \end{aligned}$$\end{document}Hence, we conclude that for ξ,η∈TθΘ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\xi ,\eta \in T_{\theta }\Theta $$\end{document} there are score functionsΦξ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Phi _{\xi }$$\end{document} and Φη\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Phi _{\eta }$$\end{document}2 such that we can define the pullback metric on the parameter spacegθ(ξ,η):=gρ(θ)(⟨Dθρ(θ),ξ⟩θ,⟨Dθρ(θ),η⟩θ)=∇Φξ,Lρ(θ)(∇Φη)L2(τ)=-Φξ,div(Lρ(θ)(∇Φη))L2(τ)=Φξ,⟨Dθρ(θ),η⟩θL2(τ).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} g_{\theta }(\xi ,\eta )&:=g_{\rho (\theta )}(\langle D_{\theta }\rho (\theta ), \xi \rangle _{\theta },\langle D_{\theta }\rho (\theta ),\eta \rangle _{\theta })\nonumber \\&= \left\langle \nabla \Phi _{\xi }, L_{\rho (\theta )} (\nabla \Phi _{\eta }) \right\rangle _{L^2(\tau )}\nonumber \\&=-\left\langle \Phi _{\xi }, \mathrm{div}(L_{\rho (\theta )} (\nabla \Phi _{\eta })) \right\rangle _{L^2(\tau )} \nonumber \\&=\left\langle \Phi _{\xi }, \langle D_{\theta } \rho (\theta ),\eta \rangle _{\theta } \right\rangle _{L^2(\tau )}. \end{aligned}$$\end{document}We then define the operator -Δθf:=-div(Lρ(θ)(∇f)).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$-\Delta _{\theta }f:=-\mathrm{div} (L_{\rho (\theta )}(\nabla f)).$$\end{document} This operator is self-adjoint with respect to ⟨∙,∙⟩L2(τ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\langle \bullet , \bullet \rangle _{L^2(\tau )}$$\end{document} and positive-definite with only span{id}\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mathrm{span}\{\mathrm{id}\}$$\end{document} in its nullspace by ergodicity. Using that ⟨Dθρ(θ),η⟩θ∈ker(Δθ)⊥\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\langle D_{\theta } \rho (\theta ),{\eta } \rangle _{\theta } \in \mathrm{ker}(\Delta _{\theta })^{\perp }$$\end{document}, this implies thatgθ(ξ,η):=τ⟨Dθρ(θ),ξ⟩θ(-Δθ)|span{id}⊥-1⟨Dθρ(θ),η⟩θ.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} g_{\theta }({\xi },{\eta }) := \tau \left( \langle D_{\theta } \rho (\theta ), {\xi } \rangle _{\theta }(- \Delta _{\theta })\vert _{\mathrm{span} \{\mathrm{id}\}^{\perp }}^{-1} \langle D_{\theta } \rho (\theta ),{\eta } \rangle _{\theta } \right) . \end{aligned}$$\end{document}We can rewrite this line as a bilinear form by using the matrix Gθ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$G_{\theta }$$\end{document}, introduced above, associated with the metric on the parameter space. We can thus define the positive definite Wasserstein information matrixGW(θ)=τei^TGθTDθρ(θ)(-Δθ)|span{id}-1Dθρ(θ)Gθej^i,j∈Rd×d.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} G_W(\theta )=\tau \left( \widehat{e_i}^T G_{\theta }^T D_{\theta } \rho (\theta )(-\Delta _{\theta })\vert _{\mathrm{span}\{\mathrm{id}\}}^{-1} D_{\theta } \rho (\theta )G_{\theta } \widehat{e_j} \right) _{i,j} \in {\mathbb {R}}^{d \times d}. \end{aligned}$$\end{document}Thus, it follows that the metric tensor of the pullback metric on the statistical manifold is of the simple formgθ(ξ,η)=⟨ξ,GW(θ)η⟩\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} g_{\theta }(\xi ,\eta )= \langle \xi , G_W(\theta ) \eta \rangle \end{aligned}$$\end{document}and as in Sect. 2.2 the natural Wasserstein gradient becomes for an objective function R(θ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$R(\theta )$$\end{document} defined byθ˙(t)=-∇gR(θ(t))\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \dot{\theta }(t) = -\nabla _{g} R(\theta (t)) \end{aligned}$$\end{document}with ∇g\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\nabla _g$$\end{document} uniquely defined bygθ(∇gR(θ),ξ)=⟨DθR(θ),ξ⟩θ∀ξ∈TθΘ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} g_{\theta }(\nabla _g R(\theta ), \xi ) = \langle D_{\theta } R(\theta ), \xi \rangle _{\theta } \quad \forall \xi \in T_{\theta }\Theta \end{aligned}$$\end{document}such that ∇gR(θ)=GW(θ)-1GθDθR(θ).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\nabla _g R(\theta ) = G_W(\theta )^{-1} G_{\theta } D_{\theta } R(\theta ).$$\end{document} This is illustrated in Sect. 4.2.2 for R being the von Neumann entropy.
The gradient descent method in parameter space naturally corresponds to a gradient descent method on the parametrized manifold of states:
Proposition 3.1
Consider an immersion Θ∋θ↦ρ(θ)∈D(H)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Theta \ni \theta \mapsto \rho (\theta ) \in {\mathscr {D}}({\mathcal {H}})$$\end{document} and an objective function R\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {R}}$$\end{document} on the set of states. We can then define an objective function R(θ)=R(ρ(θ))\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$R(\theta )={\mathcal {R}} (\rho (\theta ))$$\end{document} and the gradient evolutionθ˙(t)=-∇gR(θ),\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \dot{\theta }(t) = - \nabla _g R(\theta ), \end{aligned}$$\end{document}induces the gradient evolutionρ′(t)=-gradR(ρ(t))\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \rho '(t) = - \mathrm{grad} {\mathcal {R}}(\rho (t)) \end{aligned}$$\end{document}on the parametrized manifold of states where ρ(t)=ρ(θ(t))\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho (t)=\rho (\theta (t))$$\end{document} and grad(R(ρ(t0)))=⟨Dθρ(θ),∇gR(θt0)⟩θ.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mathrm{grad}({\mathcal {R}} (\rho (t_0)))=\langle D_{\theta }\rho (\theta ),\nabla _g R (\theta _{t_0}) \rangle _{\theta }.$$\end{document}
Proof
We always have that ddtρ(θ(t))=(ρθ)∗θ˙(t)=-(ρθ)∗∇gR(θ(t)).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\frac{d}{dt}\rho (\theta (t))=(\rho _{\theta })_{*} \dot{\theta }(t)= - (\rho _{\theta })_{*}\nabla _g R(\theta (t)).$$\end{document} Thus it suffices to show that (ρθ)∗∇gR(θ(t))=gradR(ρ(t)).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ (\rho _{\theta })_{*}\nabla _g R(\theta (t)) =\mathrm{grad} {\mathcal {R}}(\rho (t)).$$\end{document}
Using (3.1) and (3.4), we thus find that the geodesics on the parameter manifold (Θ,gθ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$(\Theta , g_{\theta })$$\end{document} minimize again the square geodesic distanceWQ2(ρ(∙,θ0),ρ(∙,θ1))=infθ∈C1(0,1)∩C[0,1]θ(0)=θ0,θ(1)=θ1∫01⟨θ˙(t),GW(θ(t))θ˙(t)⟩dt.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} W^2_Q(\rho (\bullet ,\theta ^0),\rho (\bullet ,\theta ^1)) = \inf _{\begin{array}{c} \theta \in C^1(0,1)\cap C[0,1]\\ \theta (0)=\theta ^0 , \theta (1)=\theta ^1 \end{array}}\left\{ \int _0^1 \langle \dot{\theta }(t), G_W(\theta (t)) \dot{\theta }(t)\rangle \ dt \right\} . \end{aligned}$$\end{document}The geodesics to the above Wasserstein distance are given as solutions to the following Hamiltonian systemθ˙-GW(θ)-1P=0andP˙+12PT∂θGW(θ)-1P=0.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned}&\dot{\theta }-G_W(\theta )^{-1} P = 0 \text { and }\nonumber \\&\dot{P}+ \frac{1}{2} P^T \partial _{\theta } G_W(\theta )^{-1} P=0. \end{aligned}$$\end{document}Indeed, for the Lagrangian L(θ(t),θ˙(t))=⟨θ˙(t),GW(θ(t))θ˙(t)⟩,\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {L}}(\theta (t), \dot{\theta }(t)) =\langle \dot{\theta }(t),G_W(\theta (t)) \dot{\theta }(t) \rangle ,$$\end{document} the associated momentum variable is P(t)=GW(θ(t))θ˙(t)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$P(t)=G_W(\theta (t))\dot{\theta }(t)$$\end{document} with Hamilton function H(P(t),θ(t))=12⟨P(t),GW(θ(t))P(t)⟩\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$H(P(t),\theta (t)) = \frac{1}{2} \langle P(t), G_W(\theta (t)) P(t)\rangle $$\end{document}. The system (3.9) are then precisely Hamilton’s equations.
On the other hand, the geodesic equations in D+(H)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathscr {D}}_+({\mathcal {H}})$$\end{document} with respect to the quantum Wasserstein metric for the fermionic Fokker–Planck equation are given by [12, Theo. 5.3]ρ′(t)+div(Lρ(t)∇Φ(t))=0Φ′(t)+12ρ(t)♭(∇Φ(t),∇Φ(t))=0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned}&\rho '(t) + \mathrm{div}(L_{\rho (t)} \nabla \Phi (t) ) = 0 \nonumber \\&\Phi '(t) + \frac{1}{2} \rho (t) \flat (\nabla \Phi (t), \nabla \Phi (t)) = 0 \end{aligned}$$\end{document}where we define for ρ∈D+(H)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho \in {\mathscr {D}}_+({\mathcal {H}})$$\end{document} and X,Y∈Cn\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$X,Y \in {\mathfrak {C}}^n$$\end{document} the mapρ♭(X,Y)=∫01∫01∫0α2ρα-β(1-s)+sρX∗Γ(ρ)1-αYρβ(1-s)+sρdβdαds.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \rho \flat (X,Y) = \int _0^1 \int _0^1 \int _0^{\alpha } \frac{2\rho ^{\alpha -\beta }}{(1-s)+s\rho } X^* \Gamma (\rho )^{1-\alpha } Y \frac{\rho ^{\beta }}{(1-s)+s\rho } \ d\beta \ d\alpha \ ds. \end{aligned}$$\end{document}The advantage of (3.9) over (3.10) lies in the low-dimensionality of the parameter space which turns (3.9) into an equation in a much lower dimensional space than the system in (3.10), in general.
Schrödinger Bridge Problem for Finite-Dimensional OQSs with DBC
We may now introduce a generalization of the quantum Brenier–Benamou formula in (2.9), to study a quantum version of the Schrödinger bridge problem, by adding a Fisher information regularizer to the dynamics. For this derivation, we shall restrict us to the scenario that the operator Lρ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L_{\rho }$$\end{document} is the Feynman–Kubo–Mori operator as in this case, one obtains direct links to quantum entropies and quantum dynamics.
The computational advantage of the Fisher information regularization are two-fold. Firstly, it induces additional convexity to the minimization problem. Secondly, it additionally forces the density operator to remain strictly positive.
Definition 3.2
The quantum Schrödinger bridge problem (SBP) is the minimization problem for two quantum states ρin,ρfi∈D+(H)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho _{\mathrm{in}},\rho _{\mathrm{fi}} \in {\mathscr {D}}_+({\mathcal {H}})$$\end{document}S(ρin,ρfi):=infρ∈Π(ρin,ρfi)infm∫01‖m‖ρ(t)-12dt.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} {\mathcal {S}}(\rho _{\mathrm{in}},\rho _{\mathrm{fi}}):=\inf _{\rho \in \Pi (\rho _{\mathrm{in}}, \rho _{\mathrm{fi}}) } \inf _m \int _0^1 \Vert m \Vert ^2_{\rho (t)^{-1}} \ dt. \end{aligned}$$\end{document}where we use the inner product⟨X,Y⟩ρ(t)-1:=⟨X,Lρ-1(Y)⟩L2(τ).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \langle X, Y \rangle _{\rho (t)^{-1}}:= \langle X, L_{\rho }^{-1}(Y) \rangle _{L^2(\tau )}. \end{aligned}$$\end{document}Here m is connected to ρ(t)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho (t)$$\end{document} by an inhomogeneous heat equationρ′(t)+div(m(t))=βTρ(t)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \rho '(t) + \mathrm{div}(m(t)) = \beta T \rho (t) \end{aligned}$$\end{document}for some fixed parameter β≥0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\beta \ge 0$$\end{document} where T=L∗\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$T={\mathscr {L}}^*$$\end{document} for OQS satisfying the DBC and T=-N\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$T=-{\mathcal {N}}$$\end{document} in the case of the fermionic Fokker–Planck equation.
As for (3.12), in the case β=0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\beta =0$$\end{document}, the SBP reduces to the minimization of the quantum L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-Wasserstein metric in (2.9). We now introduce the Fisher information matrixI(ρ):=‖∇(log(ρ)-log(σ))‖ρ2.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$I(\rho ):= \Vert \nabla (\log (\rho )-\log (\sigma )) \Vert ^2_{\rho }.$$\end{document} We can then express the optimal transport distance problem (3.12) as an equivalent dynamical problem with Fisher information regularization:
Theorem 1
The Schrödinger bridge problem (3.12) is equivalent to the following optimization problemS(ρin,ρfi):=infρ∈Π(ρin,ρfi)infM∫01‖M(t)‖ρ(t)-12+β2I(ρ(t))dt+2β(Sσ(ρfi)-Sσ(ρin))\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} {\mathcal {S}} (\rho _{\mathrm{in}},\rho _{\mathrm{fi}}):=\inf _{\rho \in \Pi (\rho _{\mathrm{in}},\rho _{\mathrm{fi}}) } \inf _M \int _0^1 \Vert M(t) \Vert ^2_{\rho (t)^{-1}} + \beta ^2 I(\rho (t)) \ dt + 2\beta (S_{\sigma }(\rho _{\mathrm{fi}})-S_{\sigma }(\rho _{\mathrm{in}})) \end{aligned}$$\end{document}where M satisfies the transport equationρ′(t)+divM(t)=0.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \rho '(t) + \mathrm{div} M(t) = 0. \end{aligned}$$\end{document}
Proof
We start by definingM(t):=m(t)-βLρ(t)(∇(log(ρ(t))-log(σ))),\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} M(t):=m(t)-\beta L_{\rho (t)}(\nabla ( \log (\rho (t))-\log (\sigma ))) , \end{aligned}$$\end{document}which turns the inhomogeneous heat equation into a simple transport equationρ′(t)+divM(t)=0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \rho '(t) + \mathrm{div} M(t) = 0 \end{aligned}$$\end{document}asL∗=div(Lρ(t)(∇(log(ρ(t))-log(σ)))),\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} {\mathscr {L}}^* =\mathrm{div}(L_{\rho (t)}(\nabla ( \log (\rho (t))-\log (\sigma )))), \end{aligned}$$\end{document}cf. the proof of [15, Prop. 2.7]. In case of the fermionic Fokker–Planck equation we also record that the quantum analog of the classical identity ∇f(x)=f(x)∇logf(x)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\nabla f(x) = f(x) \nabla \log f(x)$$\end{document} in the quantum setting becomes the identity [12, Lemma 3.1]∇iρ=Lρ(∇ilog(ρ)).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \nabla _i \rho = L_{\rho }(\nabla _i \log (\rho )). \end{aligned}$$\end{document}Thus, we have that‖m(t)‖ρ(t)-12=M(t)+βLρ(t)(∇(log(ρ(t))-log(σ)))ρ(t)-12=M(t)ρ(t)-12+2β⟨M(t),∇(log(ρ(t))-log(σ))⟩L2(τ)+β2‖∇(log(ρ(t))-log(σ))‖ρ(t)2.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \Vert m(t) \Vert ^2_{\rho (t)^{-1}}&= \left\Vert M(t)+\beta L_{\rho (t)} (\nabla ( \log (\rho (t))-\log (\sigma ))) \right\Vert ^2_{\rho (t)^{-1}} \nonumber \\&= \left\Vert M(t) \right\Vert ^2_{\rho (t)^{-1}} + 2 \beta \langle M(t), \nabla (\log (\rho (t))-\log (\sigma )) \rangle _{L^2(\tau )} \nonumber \\&\quad + \beta ^2 \Vert \nabla (\log (\rho (t))-\log (\sigma ))\Vert ^2_{\rho (t)}. \end{aligned}$$\end{document}The middle term in (3.15) is constant, and satisfies in terms of the relative von Neumann entropy Sσ(ρ)=τ(ρ(log(ρ)-log(σ)))\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$S_{\sigma }(\rho ) =\tau (\rho (\log (\rho )-\log (\sigma )))$$\end{document}∫01⟨M(t),∇(log(ρ(t))-log(σ))⟩L2(τ)dt=-∫01τ(div(M(t))(log(ρ(t))-log(σ)))dt=τ∫01ρ′(t)(log(ρ(t))-log(σ))dt=Sσ(ρfi)-Sσ(ρin)=const\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \int _0^1 \langle M(t), \nabla (\log (\rho (t))-\log (\sigma )) \rangle _{L^2(\tau )} \ dt&= -\int _0^1 \tau (\mathrm{div}(M(t)) (\log (\rho (t))-\log (\sigma )) ) \ dt\\&=\tau \left( \int _0^1 \rho '(t) (\log (\rho (t))-\log (\sigma )) \ dt \right) \\&=S_{\sigma }(\rho _{\mathrm{fi}})-S_{\sigma }(\rho _{\mathrm{in}})= \mathrm{const} \end{aligned}$$\end{document}where we integrated by parts to obtain the last line. □\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\square $$\end{document}
Continuous-Variable Systems
As in the theory of classical probability theory, there exists a close analogue of quantum Gaussian statesG(Hm)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {G}}({\mathcal {H}}_m)$$\end{document} on Hm:=L2(Rm)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {H}}_m:=L^2({\mathbb {R}}^m)$$\end{document} defined as follows (cf. [5] and references therein for more details):
Gaussian states are states ρ∈D(Hm)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho \in {\mathscr {D}}({\mathcal {H}}_m)$$\end{document} such that their characteristic function χρ:Cm→C\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\chi _{\rho }: {\mathbb {C}}^m \rightarrow {\mathbb {C}}$$\end{document}χρ(z):=tr(ρD(z))\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \chi _{\rho }(z):= {{\,\mathrm{tr}\,}}(\rho D(z)) \end{aligned}$$\end{document}is the characteristic function of a Gaussian random variable over Cm\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbb {C}}^m$$\end{document}, i.e. χ(ξ)=exp-14⟨ξ,γξ⟩+i⟨d,ξ⟩\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\chi (\xi )=\mathrm{exp}\left( -\frac{1}{4} \langle \xi ,\gamma \xi \rangle +i \langle d, \xi \rangle \right) $$\end{document} where γ>0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\gamma >0$$\end{document} is a positive definite matrix satisfying γ+iν≥0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\gamma +i\nu \ge 0$$\end{document}, for ν:=01-10⊕i=1m\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\nu :=\begin{pmatrix} 0 &{} 1 \\ -1 &{} 0 \end{pmatrix}^{\oplus _{i=1}^m}$$\end{document}, and d∈R2m.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$d \in {\mathbb {R}}^{2m}.$$\end{document} Here, D(z) is the displacement operatorD(z)=exp∑j=1m(zjaj∗-z¯jaj).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} D(z) = \mathrm{exp}\left( \sum _{j=1}^m (z_j a_j^* - \bar{z}_j a_j)\right) . \end{aligned}$$\end{document}Conversely, the density operator ρ∈D(Hm)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho \in {\mathscr {D}}({\mathcal {H}}_m)$$\end{document} can be recovered from its characteristic function byρ=∫Cmχρ(z)D(-z)dzπm.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \rho = \int _{{\mathbb {C}}^m} \chi _{\rho }(z)D(-z) \frac{dz}{\pi ^m}. \end{aligned}$$\end{document}We can associate a canonical random variable to any Gaussian state in terms of their Wigner functionPρ(z):=∫Cmχρ(w)ezTw∗-z†wdwπ2m≥0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} P_{\rho }(z):=\int _{{\mathbb {C}}^m} \chi _{\rho }(w) e^{z^T w^* -z^{\dagger } w } \frac{dw}{\pi ^{2m}}\ge 0 \end{aligned}$$\end{document}which is of unit L1\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^1$$\end{document} norm and a Gaussian distribution on R2m\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbb {R}}^{2m}$$\end{document} as well.
A particularly simple and relevant example of a Gaussian state are thermal states with mean photon number N∈[0,∞)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$N \in [0,\infty )$$\end{document}ρN:=1N+1∑n=0∞NN+1n|n⟩⟨n|\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \rho _N:= \frac{1}{N+1}\sum _{n=0}^{\infty } \left( \frac{N}{N+1} \right) ^n \vert n \rangle \langle n \vert \end{aligned}$$\end{document}as their characteristic functions and Wigner distributionsχρN(z):=e-(2N+1)|z|2/2andPρN(z):=2π(2N+1)e-22N+1|z|2.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \chi _{\rho _N}(z):=e^{-(2N+1) \vert z \vert ^2/2} \text { and }P_{\rho _N}(z) :=\frac{2}{\pi (2N+1)} e^{-\frac{2}{2N+1} \vert z \vert ^2}. \end{aligned}$$\end{document}are centered and uncorrelated.
Thermal states have the special property that they are the maximum entropy states for a fixed average energyρN=argmaxρ;tr(ρa∗a)≤N-tr(ρlog(ρ)).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \rho _N = \mathrm{argmax}_{\rho ; {{\,\mathrm{tr}\,}}(\rho a^*a)\le N} -{{\,\mathrm{tr}\,}}(\rho \log (\rho )). \end{aligned}$$\end{document}We finally mention that although Wigner distributions functions are positive as operators on L2(R2m)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2({\mathbb {R}}^{2m})$$\end{document}, they are not pointwise positive in general and therefore also not always genuine probability distributions (cf.the Wigner distribution function associated to |1⟩⟨1|\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\vert 1 \rangle \langle 1 \vert $$\end{document}).
In addition, the Wigner distribution function of a state ρ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho $$\end{document} satisfies the energy identity∫R2n|z|2Pρ(z)dz=∫R2n|z|2ρ(z)dz=tr(ρx2)+tr(ρp2)=tr((2a∗a+1)ρ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \int _{{\mathbb {R}}^{2n} } \vert z\vert ^2 P_{\rho }(z) \ dz = \int _{{\mathbb {R}}^{2n}} \vert z \vert ^2 \rho (z) \ dz = {{\,\mathrm{tr}\,}}(\rho x^2) + {{\,\mathrm{tr}\,}}(\rho p^2) = {{\,\mathrm{tr}\,}}((2a^*a+1)\rho ) \end{aligned}$$\end{document}where x and p are the position and momentum operator.
Thus, the classical L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-Wasserstein distance, corresponds in this formalism to an energy penalization and we define the optimal transport functional with phase-space variable square penalizationinfm.ρ∫01∫R2n|m(t,z)|2ρ(t,z)dzdt\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \inf _{m. \rho } \int _0^1 \int _{{\mathbb {R}}^{2n}} \frac{\vert m(t,z) \vert ^2}{\rho (t,z)} \ dz \ dt \end{aligned}$$\end{document}where ρ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho $$\end{document} satisfies the Fokker–Planck equation∂tρ(t,z)+div(m(t,z))=βΔρ(t,z),ρ(0,z)=ρ0(z),ρ(1,z)=ρ1(z)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \partial _t \rho (t,z) + \mathrm{div}(m(t,z)) = \beta \Delta \rho (t,z) , \quad \rho (0,z)=\rho _0(z), \ \rho (1,z)=\rho _1(z) \end{aligned}$$\end{document}with parameter β≥0,\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\beta \ge 0,$$\end{document} where β=0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\beta =0$$\end{document} corresponds to the optimal transport in L2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2$$\end{document}-Wasserstein distance [10] and β>0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\beta >0$$\end{document} to the Schrödinger bridge problem [31].
Proposition 3.3
(Separability) Let ρθ(i)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho ^{(i)}_{\theta }$$\end{document} be a family of Gaussian states on Hilbert spaces L2(R2n(i))\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L^2({\mathbb {R}}^{2n(i)})$$\end{document}, and ρθ:=⨂i=1Nρθ(i)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho _{\theta }:=\bigotimes _{i=1}^N \rho ^{(i)}_{\theta }$$\end{document}, then the Wasserstein information matrix satisfiesGW(ρ)=∑i=1NGW(ρ(i)).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} G_W(\rho )= \sum _{i=1}^N G_W(\rho ^{(i)}). \end{aligned}$$\end{document}
Proof
It follows directly from (3.16) that the characteristic function of a tensor product is the product of the individual characteristic functions. Using the Fourier transform and (3.17), this immediately translates into the Wigner functions being a product of Wigner functions (3.17). The result then follows from [25, Prop. 5]. □\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\square $$\end{document}
Examples
In this section, we demonstrate the quantum transport information matrix and its related gradient and Hamiltonian flows in some well-known probability models.
Examples for the Quantum Wigner DistributionGaussian Mixture Model
For Gaussian states ρi\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\rho _i$$\end{document} we consider the Gaussian Wigner probability distributions Pρi\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$P_{\rho _i}$$\end{document} associated to them.
Let Xi∼N(μi,Σi)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$X_i \sim {\mathcal {N}}(\mu _i, \Sigma _i)$$\end{document} be normal random variables, then it follows thatE(Xi)=mi,Var(Xi)=E(XiXi∗)-E(Xi)E(Xi)∗=Σ.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} {\mathbb {E}}(X_i) = m_i, \text {Var}(X_i) = {\mathbb {E}}(X_iX_i^* ) -{\mathbb {E}}(X_i){\mathbb {E}}(X_i)^* = \Sigma . \end{aligned}$$\end{document}Let X=∑i=1NλiXi\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$X= \sum _{i=1}^N \lambda _i X_i$$\end{document} be a Gaussian mixture with λi≥0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\lambda _i \ge 0$$\end{document} summing up to one, then clearly μX:=E(X)=∑i=1Nλiμi\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mu _X:={\mathbb {E}}(X)=\sum _{i=1}^N \lambda _i \mu _i$$\end{document} and also for the second moment mXi:=E(XiXi∗)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$m_{X_i}:={\mathbb {E}}(X_iX_i^*)$$\end{document} we findmX:=EX(xx∗)=∑i=1NλiμXi.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} m_{X}:=E_X(xx^*)= \sum _{i=1}^N \lambda _i \mu _{X_i}. \end{aligned}$$\end{document}Thus, the covariance matrix is given byVar(X)=∑i=1NλiΣi+∑i=1Nλiμiμi∗-E(X)E(X)∗\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \text {Var}(X) = \sum _{i=1}^N \lambda _i \Sigma _i + \sum _{i=1}^N \lambda _i \mu _i \mu _i^* -{\mathbb {E}}(X) {\mathbb {E}}(X)^* \end{aligned}$$\end{document}where ∑i=1Nλiμiμi∗-E(X)E(X)∗≥0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\sum _{i=1}^N \lambda _i \mu _i \mu _i^* -{\mathbb {E}}(X) {\mathbb {E}}(X)^* \ge 0$$\end{document} by Jensen’s inequality. Thus, since the variance of a mixture is increasing, the condition Σi+iν≥0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Sigma _i + i \nu \ge 0$$\end{document} is satisfied for the extremal states and clearly the state associated with the mixture X isρ=∑i=1Nλiρi.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \rho = \sum _{i=1}^N \lambda _i \rho _i. \end{aligned}$$\end{document}To parametrize multivariate Gaussian distributions N(μ,Σ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {N}}(\mu , \Sigma )$$\end{document} that are Wigner functions of Gaussian states, it is natural to consider the parameter space θ=(μ,Σ)∈Θ:=R2m×{γ∈R2m×2m;γ>0andγ+iν>0}.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\theta =(\mu , \Sigma )\in \Theta := {\mathbb {R}}^{2m} \times \{\gamma \in {\mathbb {R}}^{2m \times 2m};\gamma>0 \text { and } \gamma +i \nu > 0\}.$$\end{document} The Wasserstein metric tensor for the multivariate Gaussian model isgθ(ξ,η)=⟨μξ,μη⟩+tr(SξΣSη)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} g_{\theta }(\xi ,\eta ) = \langle \mu _{\xi }, \mu _{\eta } \rangle + {{\,\mathrm{tr}\,}}(S_{\xi } \Sigma S_{\eta }) \end{aligned}$$\end{document}for ξ=(μξ,Σξ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\xi = (\mu _{\xi }, \Sigma _{\xi })$$\end{document} and η=(μη,Ση)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\eta =(\mu _{\eta }, \Sigma _{\eta })$$\end{document} and Sξ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$S_{\xi }$$\end{document} and Sη\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$S_{\eta }$$\end{document} solving the Lyapunov equationsΣξ={Sξ,Σ}andΣη={Sη,Σ}.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \Sigma _{\xi }=\{S_{\xi }, \Sigma \} \text { and } \Sigma _{\eta } =\{S_{\eta }, \Sigma \}. \end{aligned}$$\end{document}In fact, for Q=Q∗\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$Q =Q^*$$\end{document}, we can define the map LΣ(Q):=∫0∞e-ΣtQe-Σtdt,\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L_{\Sigma }(Q):=\int _0^{\infty } e^{-\Sigma t} Q e^{-\Sigma t} \ dt,$$\end{document} solving Lyapunov equation Q={LΣ(Q),Σ}\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$Q=\{L_{\Sigma }(Q), \Sigma \}$$\end{document}, then LΣ(Ση)=Sη\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L_{\Sigma }(\Sigma _{\eta })= S_{\eta }$$\end{document} and LΣ(Σξ)=Sξ.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L_{\Sigma }(\Sigma _{\xi })= S_{\xi }.$$\end{document}
The Wigner function W(Σ0,μ0)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$W(\Sigma _0, \mu _0)$$\end{document}
The Wigner function W(Σ1,μ1)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$W(\Sigma _1, \mu _1)$$\end{document}
Optimal quantum transport map from quantum state with Wigner function W(θ0)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$W(\theta ^0)$$\end{document} to quantum state with Wigner function W(θ1)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$W(\theta ^1)$$\end{document}
This minimization problem can be easily solved using a simple Monte-Carlo algorithm minimizing (2.5) that only accepts transitions to states that satisfy the two constraintsΣi≥0andΣi+iν≥0.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \Sigma _i\ge 0 \text { and } \Sigma _i + i \nu \ge 0. \end{aligned}$$\end{document}The numerical solution to the quantum transport problem of the two parametrized Gaussian states is illustrate in Fig. 3.
Examples Involving the Quantum Fermionic Fokker–Planck EquationExample 2
(Fermionic Fokker–Planck equation; Analytic solution) We consider the fermionic Fokker–Planck equation as introduced in Sect. 2.5 for simplest case n=1,\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$n=1,$$\end{document} i.e. C\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathfrak {C}}$$\end{document} can be identified with the two-dimensional Hilbert space span{idC2×2,σx}\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mathrm{span}\{ \mathrm{id}_{{\mathbb {C}}^{2 \times 2}},\sigma _x\}$$\end{document} in which case we can solve the problem analytically.
The grading operator is defined byΓ(id)=idandΓ(σx)=-σx.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \Gamma (\mathrm{id})=\mathrm{id}\text { and }\Gamma (\sigma _x)=-\sigma _x. \end{aligned}$$\end{document}The faithful states in C\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathfrak {C}}$$\end{document} are then parametrized by(-1,1)∋θ↦ρ(θ):=id+θσx.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} (-1,1) \ni \theta \mapsto \rho (\theta ):= \mathrm{id}+\theta \sigma _x. \end{aligned}$$\end{document}We can diagonalize this density operator using the unitary map U=2-1/2(σx-σz).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$U=2^{-1/2}(\sigma _x- \sigma _z).$$\end{document} This way, Uρ(θ)U=diag(1+θ,1-θ).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$U \rho (\theta ) U =\mathrm{diag}(1+\theta ,1-\theta ).$$\end{document} The derivative is given by∇(αid+βσx)=βid.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \nabla (\alpha \mathrm{id}+\beta \sigma _x) = \beta \mathrm{id}. \end{aligned}$$\end{document}The operator Lρ(θ),Γ(ρ(θ))(id)=∫01(ρ(θ))1-s(ρ(-θ))sds\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L_{\rho (\theta ),\Gamma (\rho (\theta ))} (\mathrm{id}) =\int _0^1 (\rho (\theta ))^{1-s}(\rho (-\theta ))^{s} \ ds$$\end{document} becomes therefore after conjugating with UULρ(θ),Γ(ρ(θ))(id)U=∫01(1-θ)1-s(1+θ)sdsidC2×2=θartanh(θ)idC2×2.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} UL_{\rho (\theta ),\Gamma (\rho (\theta ))} (\mathrm{id})U=\int _0^1 (1-\theta )^{1-s} (1+\theta )^{s} \ ds \mathrm{id}_{{\mathbb {C}}^{2 \times 2}} = \frac{\theta }{\mathrm{artanh}(\theta )}\mathrm{id}_{{\mathbb {C}}^{2 \times 2}}. \end{aligned}$$\end{document}This implies that -Δρ(θ)|span(σx)=θartanh(θ)id.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$-\Delta _{\rho (\theta )} \vert _{\mathrm{span} (\sigma _x)}=\frac{\theta }{\mathrm{artanh}(\theta )} \mathrm{id}.$$\end{document} Using that Dθρ(θ)=σx\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$D_{\theta }\rho (\theta )=\sigma _x$$\end{document} and that Gθ=id\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$G_{\theta } =\mathrm{id}$$\end{document}, we find from (3.3) thatGW(θ)=artanh(θ)θ.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} G_W(\theta )= \frac{\mathrm{artanh}(\theta )}{\theta }. \end{aligned}$$\end{document}As before, our objective is to find geodesics on the parameter manifold (Θ,gθ)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$(\Theta , g_{\theta })$$\end{document} that minimize the square geodesic distanceWQ2(ρ(∙,θ0),ρ(∙,θ1))=infθ∈C1(0,1)∩C[0,1]θ(0)=θ0,θ(1)=θ1∫01L(θ(t),θ˙(t))dt\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} W^2_Q(\rho (\bullet ,\theta ^0),\rho (\bullet ,\theta ^1)) =\inf _{\begin{array}{c} \theta \in C^1(0,1)\cap C[0,1]\\ \theta (0) =\theta ^0 , \theta (1)=\theta ^1 \end{array}}\left\{ \int _0^1 {\mathcal {L}}(\theta (t), \dot{\theta }(t)) \ dt \right\} \end{aligned}$$\end{document}where L(θ(t),θ˙(t)):=θ˙(t)2GW(θ(t))\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {L}}(\theta (t),\dot{\theta }(t)):=\dot{\theta }(t)^2 G_W(\theta (t))$$\end{document} is the Lagrangian. The associated Euler–Lagrange equation∂1L(θ(t),θ˙(t))-ddt∂2L(θ(t),θ˙(t))=0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \partial _1 {\mathcal {L}}(\theta (t),\dot{\theta }(t)) - \frac{d}{dt} \partial _2 {\mathcal {L}}(\theta (t),\dot{\theta }(t))=0 \end{aligned}$$\end{document}becomesθ˙(t)2GW′(θ(t))-2ddt(θ˙(t)GW(θ(t)))=-θ˙(t)2GW′(θ(t))-2θ¨(t)GW(θ(t))=0.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned}&\dot{\theta }(t)^2 G_W'(\theta (t))- 2 \frac{d}{dt} (\dot{\theta }(t)G_W(\theta (t)))\nonumber \\&\quad =-\dot{\theta }(t)^2 G_W'(\theta (t))-2\ddot{\theta }(t) G_W(\theta (t))=0. \end{aligned}$$\end{document}Using that GW(θ)>0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$G_W(\theta )>0$$\end{document}, we find the identities for ±θ˙(t)>0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\pm \dot{\theta }(t) > 0$$\end{document}ddtlog(GW(θ(t)))=GW′(θ(t))θ˙(t)GW(θ(t))andddtlog(±θ˙(t))=θ¨(t)θ˙(t).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \frac{d}{dt}\log (G_W(\theta (t)))=\frac{G_W'(\theta (t))\dot{\theta }(t)}{G_W(\theta (t))} \text { and } \frac{d}{dt} \log (\pm \dot{\theta }(t)) =\frac{\ddot{\theta }(t)}{\dot{\theta }(t)}. \end{aligned}$$\end{document}Assuming that θ1>θ0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\theta ^1>\theta ^0$$\end{document} in the sequel and thus dropping ± for simplicity, the Euler–Lagrange equation is then equivalent, for some constant k∈R\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$k \in {\mathbb {R}}$$\end{document}, to the ODE-log(GW(θ(t)))+k=log(±θ˙(t)).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} -\log (G_W(\theta (t)))+k= \log (\pm \dot{\theta }(t)). \end{aligned}$$\end{document}Introducing then the function ζ(x):=Li2(x)-Li2(-x)2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\zeta (x):=\frac{\mathrm{Li}_2(x) -\mathrm{Li}_2(-x) }{2}$$\end{document} in terms of the dilogarithm, Li2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mathrm{Li}_2$$\end{document}, we can then specify the constant ek\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$e^k$$\end{document} byek:=∫01θ˙(t)θ(t)artanh(θ(t))dt=∫θ0θ1artanh(x)xdx=ζ(θ1)-ζ(θ0).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} e^k:=\int _0^1 \frac{\dot{\theta }(t)}{\theta (t)} \mathrm{artanh}(\theta (t)) \ dt = \int _{\theta ^0}^{\theta ^1} \frac{\mathrm{artanh}(x)}{x} \ dx =\zeta (\theta ^1) -\zeta (\theta ^0). \end{aligned}$$\end{document}In particular, this allows us to explicitly state the solution to the optimal transport problemθ(t)=ζ-1tζ(θ1)+(1-t)ζ(θ0).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \theta (t)= \zeta ^{-1} \left( t \zeta (\theta ^1)+(1-t) \zeta (\theta ^0) \right) . \end{aligned}$$\end{document}We illustrate this by choosing three different pairs of parameters (Fig. 4)θ10=-12,θ11=12,θ20=-910,θ21=910,θ30=-9991000,θ31=9991000.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \theta ^0_1&=-\tfrac{1}{2}, \ \theta ^1_1=\tfrac{1}{2}, \nonumber \\ \theta ^0_2&=-\tfrac{9}{10}, \ \theta ^1_2=\tfrac{9}{10}, \nonumber \\ \theta ^0_3&=-\tfrac{999}{1000}, \ \theta ^1_3=\tfrac{999}{1000}. \end{aligned}$$\end{document}
Illustration of solution (4.7) for three different boundary parameters (4.8)
Anti-commutator Case
We can repeat the previous analysis by considering instead of the Feynman–Kubo–Mori multiplication operator Lρ(θ),Γ(ρ(θ))\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$L_{\rho (\theta ),\Gamma (\rho (\theta ))}$$\end{document} the anti-commutator (2.10) which satisfiesLρ(θ)ac(id)=ρ(θ).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} L^{\mathrm{ac}}_{\rho (\theta )}(\mathrm{id})=\rho (\theta ). \end{aligned}$$\end{document}Thus, using that ∇∗(ρ(θ))=1\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\nabla ^*(\rho (\theta ))=1$$\end{document}, we find that -Δρ(θ)|span(σx)=1\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$-\Delta _{\rho (\theta )} \vert _{\mathrm{span}(\sigma _x)} =1$$\end{document} and therefore alsoGW(θ)≡1.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} G_W(\theta )\equiv 1. \end{aligned}$$\end{document}In particular, since the Lagrangian is just L(θ˙(t))=θ˙(t)2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathcal {L}}(\dot{\theta }(t)) = \dot{\theta }(t)^2$$\end{document}, the geodesics in parameter space are just straight lines as the Euler–Lagrange equation θ¨(t)=0\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\ddot{\theta }(t)=0$$\end{document} immediately shows.
Wasserstein Natural Gradient
We shall now also illustrate the Wasserstein natural gradient for the quantum Fokker–Planck equation as in Example 2 by minimizing the von Neumann entropy as objective functionR(θ)=τ(ρ(θ)log(ρ(θ))).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} R(\theta ) = \tau (\rho (\theta ) \log (\rho (\theta ))). \end{aligned}$$\end{document}From the matrix logarithmlog(ρ(θ))=12log(1-θ2)log1+θ1-θlog1+θ1-θlog(1-θ2)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \log (\rho (\theta )) =\tfrac{1}{2} \begin{pmatrix} \log (1-\theta ^2) &{} \log \left( \frac{1+\theta }{1-\theta } \right) \\ \log \left( \frac{1+\theta }{1-\theta } \right) &{} \log (1-\theta ^2) \end{pmatrix} \end{aligned}$$\end{document}we then immediately see thatR(θ)=12log(1-θ2)+θlog1+θ1-θ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} R(\theta )= \frac{1}{2} \left( \log (1-\theta ^2) + \theta \log \left( \frac{1+\theta }{1-\theta } \right) \right) \end{aligned}$$\end{document}and hence DθR(θ)=artanh(θ).\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$D_{\theta } R(\theta )= \mathrm{artanh}(\theta ).$$\end{document} Therefore, the Wasserstein gradient (3.5) becomes ∇gR(θ)=GW(θ)-1DθR(θ)=-θ.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\nabla _g R(\theta ) = G_W(\theta )^{-1} D_{\theta } R(\theta ) = -\theta .$$\end{document} The gradient descent equation therefore becomes in parameter spaceθ′(t)=-θ(t)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\begin{aligned} \theta '(t) = -\theta (t) \end{aligned}$$\end{document}which implies that we will converge exponentially fast to the unique minimizer the completely mixed state that corresponds to θ=0.\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\theta =0.$$\end{document}
Channel Parameter Estimation-Pushforward of Quantum States
The idea of parameter estimation of probability densities constructed from the pushforward of possibly nonlinear activation functions, relevant for neural networks, has been investigated by the second author in [25].
In quantum theory the framework is somewhat different, since quantum operations on a physical system are described by linear (super)-operators, so-called quantum channels rather than non-linear one-dimensional functions. A quantum channel is a completely positive and trace preserving (CPTP) map. Thus, it is natural to consider the situation where a state is parametrized by the output of a quantum channel Φθ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Phi _{\theta }$$\end{document} depending on some parameter θ\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\theta $$\end{document} which is the quantum analogue of the pushforward of probability measures by parametrized functions.
We shall illustrate how such problems can be studied in our framework by considering the quantum depolarizing channel (4.13) with the quantum fermionic Fokker–Planck equation, introduced in Sect. 2.5, for n=2\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$n=2$$\end{document}.
In this paper, we pull back the quantum Wasserstein-2 metric into a parameterized quantum statistical models. This allows us to develop a quantum Wasserstein/transport information matrix. Using this matrix, we develop the quantum transport natural gradient methods and apply them to the quantum statistical learning problems. Besides, we also consider the optimal control problem of quantum transport natural gradient flows, which leads to the derivation of quantum Schrödinger bridge problem. Several analytical examples, such as the transport of Gaussian states on the statistical manifold in Example 1, the transport of states for the gradient induced by quantum fermionic Fokker–Planck equation in Sect. 4.2 on the statistical manifold, and the parameter estimation problem for channels in Sect. 4.3, are provided.
Our results initialize the joint study among quantum information geometry and quantum optimal transport. We pull back the quantum system dynamics into a finite-dimensional parameter space generated by statistical and machine learning models. We call this area quantum transport information geometry. Here the interaction study between quantum Fisher and quantum Wasserstein information matrices becomes essential. We expect that this joint study would be useful in developing transport estimation theory of quantum information theory, and designing AI-driven quantum computing algorithms for quantum systems. In the future, we will continue this line of study following transport information geometry [26, 30].
Acknowledgements
This work was supported by the EPSRC Grant EP/L016516/1 for the University of Cambridge CDT, the CCA and generous funding from IPAM in Los Angeles (SB). Wuchen Li was supported by University of South Carolina, start up funding.
ReferencesAlickiROn the detailed balance condition for non-Hamiltonian systems19761022492581976RpMP...10..249A57322610.1016/0034-4877(76)90046-XAmariS20161IncorporatedSpringer Publishing Company10.1007/978-4-431-55978-8AyNJostJLêHVSchwachhöferL2017ChamSpringer10.1007/978-3-319-56478-4Becker, S., Datta, N., Lami, L., Rouzé, C.: Convergence rates for the quantum central limit theorem (2019). arXiv:1912.06129BenamouJ-DBrenierYA computational fluid mechanics solution to the Monge–Kantorovich mass transfer problem2000843375393173816310.1007/s002110050002Blume-KohoutROptimal, reliable estimation of quantum states20101240430342010NJPh...12d3034B10.1088/1367-2630/12/4/043034CarlenEMaasJAn analog of the 2-Wasserstein metric in non-commutative probability under which the fermionic Fokker–Planck equation is gradient flow for the entropy20143318879262014CMaPh.331..887C324805310.1007/s00220-014-2124-8CarlenEMaasJGradient flow and entropy inequalities for quantum Markov semigroups with detailed balance20183318879262014CMaPh.331..887C10.1007/s00220-014-2124-8CarlenEMaasJNon-commutative calculus, optimal transport and functional inequalities in dissipative quantum systems20202020178319378405524410.1007/s10955-019-02434-wÇaylak, O., Anatole von Lilienfeld, O., Baumeier, B.: Wasserstein metric for improved QML with adjacency matrix representations (2020). arXiv:2001.11005Chakrabarti, S., Huang, Y., Tongyang, L., Feizi, S., Wu, X.: Quantum Wasserstein GANs. In: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019)ChenYLiWOptimal transport natural gradient for statistical manifolds with continuous sample space20203132411900310.1007/s41884-020-00028-0ChenYGeorgiouTPavonMOn the relation between optimal transport and Schrödinger bridges: a stochastic control viewpoint20161692671691348982510.1007/s10957-015-0803-zChen, Y., Georgiou, T.T., Tannenbaum, A.: Wasserstein geometry of quantum states and optimal transport of matrix-valued measures. In: Emerging Applications of Control and Systems Theory, pp. 139–150. Springer, Berlin (2018)ChenYongxinGeorgiouTryphon TTannenbaumAllenMatrix optimal mass transport: a quantum mechanical approach201863826122619384598910.1109/TAC.2017.2767707D’ArianoGMLaurentisMParisMGAPorzioASolimenoSQuantum tomography as a tool for the characterization of optical devices200243S12710.1088/1464-4266/4/3/366De Palma, G., Trevisan, D.: Quantum optimal transport with quantum channels (2019). arXiv:1911.00803EvansLCGangboWDifferential equations methods for the Monge–Kantorovich mass transfer problem199913765316614641490920.49004GolseFMouhotCThierryPaulOn the mean field and classical limits of quantum mechanics201634311652052016CMaPh.343..165G347566410.1007/s00220-015-2485-722GrossDLiuYKFlammiaSBeckerSEisertJQuantum state tomography via compressed sensing2010105151504012010PhRvL.105o0401G10.1103/PhysRevLett.105.150401HayashiM2017BerlinSpringer10.1007/978-3-662-49725-8LaffertyJThe density manifold and configuration space quantization1988305269974192477610.1090/S0002-9947-1988-0924776-9Léonard, C.: A survey of the Schrödinger problem and some of its connections with optimal transport. arXiv:1308.0215 (2013)Li, W.C.: Transport information geometry I: Riemannian calculus on probability simplex (2018). arXiv:1803.06360Li, W.C.: Hessian metric via transport information geometry (2020). arXiv:2003.10526Li, W.C., Montufar, G.: Natural gradient via optimal transport. Information Geometry (2018)Li, W.C., Montufar, G.: Ricci curvature for parameter statistics via optimal transport, Information Geometry (2020)Li, W.C., Zhao, J.: Wasserstein information matrix (2020). arXiv:1910.11248Li, W.C., Yin, P., Osher, S.: Computations of optimal transport distance with Fisher information regularization (2018). arXiv:1704.04605Li, W.C., Liu, S., Zha, H., Zhou, H.: Parametric Fokker–Planck equation, GSI (2019)LvovskyAIRaymerMGContinuous-variable optical quantum state tomography20098112993322009RvMP...81..299L10.1103/RevModPhys.81.299MittnenzweigMMielkeAAn entropic gradient structure for Lindblad equations and couplings of quantum systems to macroscopic models201716722052332017JSP...167..205M362662810.1007/s10955-017-1756-4NagaokaHBelavkinVPHirotaOHudsonRLDifferential geometrical aspects of quantum state estimation and relative entropy1995New YorkPlenum44945210.1007/978-1-4899-1391-3_44NingLGeorgiouTTannenbaumAOn matrix-valued Monge–Kantorovich optimal mass transport2015602373382331016410.1109/TAC.2014.2350171OttoFThe geometry of dissipative evolution equations: the porous medium equation2001262001101174184242910.1081/PDE-100002243ParisMRehacekJ2004BerlinSpringer10.1007/b98673RouzéCDattaNConcentration of quantum states from quantum functional and transportation cost inequalities20196010122022019JMP....60a2202R390354410.1063/1.5023210RouzéDattaNRelating relative entropy, optimal transport and Fisher information: a quantum HWI inequality202021211521502020AnHP...21.2115D411748810.1007/s00023-020-00891-8SchmidtJMarquesMRGBottiSRecent advances and applications of machine learning in solid-state materials science201958310.1038/s41524-019-0221-0TorlaiGMazzolaGCarrasquillaJNeural-network quantum state tomography20181444745010.1038/s41567-018-0048-5VillaniC2009BerlinSpringer10.1007/978-3-540-71050-9VogelKRiskenHDetermination of quasiprobability distributions in terms of probability distributions for the rotated quadrature phase1989405284728491989PhRvA..40.2847V10.1103/PhysRevA.40.2847
Especially in statistical physics, the name Fokker–Planck equation is usually reserved for another equation acting on phase-space variables and the equation considered here is called the (Kramers)–Smoluchowski equation.
Score functions are only defined up to elements in ker(∇)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mathrm{ker}(\nabla )$$\end{document}.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.