# 计算机代写|机器学习代写Machine Learning代考|COMP5328 Bounds for the logistic (sigmoid) function

## 计算机代写|机器学习代写Machine Learning代考|Bounds for the logistic (sigmoid) function

In this section, we use the results on conjugate duality to derive upper and lower bounds to the logistic function, $\boldsymbol{\sigma}(x)=\frac{1}{1+e^{-x}}$.
6.5.4.1 Exponential upper bound
The sigmoid function is neither convex nor concave. However, it is easy to show that $f(x)=\log \boldsymbol{\sigma}(x)=$ $-\log \left(1+e^{-x}\right)$ is concave, by showing that its second derivative is negative. Now, any convex function $f(x)$ can be represented by
$$f(x)=\min _\eta \eta x-f^{\dagger}(\eta)$$
where
$$f^{\dagger}(\eta)=\min _x \eta x-f(x)$$
One can show that if $f(x)=\log \boldsymbol{\sigma}(x)$, then
$$f^{\dagger}(\eta)=-\eta \ln \eta-(1-\eta) \ln (1-\eta)$$
which is the binary entropy function. Hence
\begin{aligned} \log \boldsymbol{\sigma}(x) & \leq \eta x-f^{\dagger}(\eta) \ \boldsymbol{\sigma}(x) & \leq \exp \left(\eta x-f^{\dagger}(\eta)\right) \end{aligned}
This exponential upper bound on $\boldsymbol{\sigma}(x)$ is illustrated in Figure 6.13(a).

## 计算机代写|机器学习代写Machine Learning代考|Quadratic lower bound

It is also useful to compute a lower bound on $\boldsymbol{\sigma}(x)$. If we make this a quadratic lower bound, it will “play nicely” with Gaussian priors, which simplifies the analysis of several models. This approach was first suggested in [JJ96].
First we write
\begin{aligned} \log \boldsymbol{\sigma}(x) &=-\log \left(+e^{-x}\right)=-\log \left(e^{-x / 2}\left(e^{x / 2}+e^{-x / 2}\right)\right) \ &=x / 2-\log \left(e^{x / 2}+e^{-x / 2}\right) \end{aligned}
The function $f(x)=-\log \left(e^{x / 2}+e^{-x / 2}\right)$ is a convex function of $y=x^2$, as can be verified by showing $\frac{d}{d x^2} f(x)>0$. Hence we can create a linear lower bound on $f$, using the conjugate function
$$f^{\dagger}(\eta)=\max _{x^2} \eta x^2-f\left(\sqrt{x^2}\right)$$
We have
$$0=\eta-\frac{d x}{d x^2} \frac{d}{d x} f(x)=\eta+\frac{1}{4 x} \tanh \left(\frac{x}{2}\right)$$

Sigmoid 函数既不是凸函数也不是凹函数。然而，很容易证明 $f(x)=\log \boldsymbol{\sigma}(x)=-\log \left(1+e^{-x}\right)$ 是凹的，通过证明它的二 阶导数是尔的。现在，任何凸函数 $f(x)$ 可以表示为
$$f(x)=\min \eta \eta x-f^{\dagger}(\eta)$$ 在哪里 $$f^{\dagger}(\eta)=\min _x \eta x-f(x)$$ 可以证明，如果 $f(x)=\log \sigma(x)$ ，然后 $$f^{\dagger}(\eta)=-\eta \ln \eta-(1-\eta) \ln (1-\eta)$$ 这是二二元商函数。因此 $$\log \boldsymbol{\sigma}(x) \leq \eta x-f^{\dagger}(\eta) \boldsymbol{\sigma}(x) \quad \leq \exp \left(\eta x-f^{\dagger}(\eta)\right)$$ 这个指数上限 $\boldsymbol{\sigma}(x)$ 如图 6.13(a) 所示。

$$0=\eta-\frac{d x}{d x^2} \frac{d}{d x} f(x)=\eta+\frac{1}{4 x} \tanh \left(\frac{x}{2}\right)$$

