# 计算机代写|机器学习代写Machine Learning代考|COMP4702 Individual and Ensemble

## 计算机代写|机器学习代写Machine Learning代考|Individual and Ensemble

Ensemble learning, also known as multiple classifier system and committee-based learning, trains and combines multiple learners to solve a learning problem.

As shown in $\boldsymbol{-}$ Figure 8.1, the typical workflow of ensemble learning is training a set of individual learners first and then combining them via some strategies, where an individual learner is usually trained by an existing learning algorithm, such as the $\mathrm{C} 4.5$ algorithm and the BP neural network algorithm. An ensemble is said to be homogeneous if all individual learners are of the same type, e.g., a “decision tree ensemble” contains only decision trees, while a “neural network ensemble” contains only neural networks. For homogeneous ensembles, the individual learners are called base learners, and the corresponding learning algorithms are called base learning algorithms. In contrast, a heterogeneous ensemble contains different individual learners and learning algorithms, and there is no single base learner or base learning algorithm. For heterogeneous ensembles, the simply individual learners.

By combining multiple learners, the generalization ability of an ensemble is often much stronger than that of an individual learner, and this is especially true for weak learners. Therefore, theoretical studies on ensemble learning often focus on weak learners, and hence base learners are sometimes called weak learners. In practice, however, despite that an ensemble of weak learners can theoretically obtain good performance, people still prefer strong learners for some reasons, such as reducing the number of individual learners and reusing existing knowledge about the strong learners.

## 计算机代写|机器学习代写Machine Learning代考|Boosting

Boosting is a family of algorithms that convert weak learners to strong learners. Boosting algorithms start with training a base learner and then adjust the distribution of the training samples according to the result of the base learner such that incorrectly classified samples will receive more attention by subsequent base learners. After training the first base learner, the second base learner is trained with the adjusted training samples, and the result is used to adjust the training sample distribution again. Such a process repeats until the number of base learners reaches a predefined value $T$, and finally, these base learners are weighted and combined.

The most well-known Boosting algorithm is AdaBoost (Freund and Schapire 1997), as shown in $\boldsymbol{- 1}$ Agorithm 8.1, where $y_i \in{-1,+1}$ and $f$ is the ground-truth function.

There are multiple ways to derive the AdaBoost algorithm, but one that is easy to understand is based on the additive model, that is, using the linear combination of base learners
$$H(\boldsymbol{x})=\sum_{t=1}^T \alpha_t h_t(\boldsymbol{x})$$
to minimize the exponential loss function (Friedman et al. 2000)

Boosting 是一系列将弱学习器转换为强学习器的算法。Boosting 算法从训练一个其学习器开始，然后根据基学习器的结果调整 训练样本的分布，使错误分类的样本受到后续基学习器的更多关注。训练完第一个基学习器后，用调整后的训练样本训练第二个基 学习器，结果再次用于调整川拣样本分布。重笣这样的过程，直到其础学习者的数量达到预定义的值 $T$ ，最后对这些其础学习器进 行加权和组合。

$$H(\boldsymbol{x})=\sum_{t=1}^T \alpha_t h_t(\boldsymbol{x})$$

