自標籤 (Self-Labeled) 介紹

時間 2021-04-01 標籤算法 ide 學習 spa orm ci rem input it io

本文將對自標籤（self-labeled）做簡要介紹，主要包括定義和分類。其中定義給出中英文對照。文章參考自[1]。算法

定義

首先是對這類方法的定義，以下圖所示ide

Semi-supervised learning (SSL):

結合監督學習和無監督學習來給模式識別提供額外信息。
An extension of unsupervised and supervised learning by including additional information typical of the other learning paradigm.學習

SSL 分爲如下兩類：spa

Semi-supervised classification (SS-Cla):
關注於半監督分類問題orm
Semi-supervised clustering (SS-Clu):
關注於半監督聚類問題ci

Self-Labeled 方法關注(SS-Cla)，即分類問題。rem

Self-Labeled Method:

自標籤方法通常指經過標註無標籤樣原本擴充數據集(EL)。
These techniques aim to obtain one (or several) enlarged labeled set(s) (EL), based on their most confident predictions, to classify unlabeled data.input

Self-training:
利用帶標註樣本訓練一個分類器，給無標籤樣本標註。而後使用置信度高的無標籤標註樣本擴充數據集EL來retrain模型。
A classifier is trained with an initial small number of labeled examples, aiming to classify unlabeled points. Then it is retrained with its own most confident predictions, enlarging its labeled training set. This model does not make any specific assumptions for the input data, but it accepts that its own predictions tend to be correct.it
Co-training:
訓練多個分類器，各個分類器互相用各自的置信度高的樣本學習。
It trains one classifier in each specific view, and then the classifiers teach each other the most confidently predicted examples. Multi-view learning for SSC is usually understood to be a generalization of co-training.io

自標籤 (Self-Labeled) 介紹

定義

Semi-supervised learning (SSL):

Semi-supervised classification (SS-Cla):

Semi-supervised clustering (SS-Clu):

Self-Labeled Method:

Self-training:

Co-training:

分類

根據 Addition mechanism:

Incremental：

Batch：

Amending：

根據 Single-learning versus multi-learning:

single-learning：預測由單一分類算法/分類器給出。

multi-learning：預測由分類器給出。

根據 Single-view versus multi-view:

multi-view

single-view

根據 Confidence measures:

Simple

Agreement and combination

根據 Self-teaching versus mutual-teaching:

mutual-teaching：每種分類器互相提供各自的EL。

Self-teaching：每種分類器使用各自的EL。

根據 Stopping criteria:

選擇全集

選擇部分

假設不變