在机器学习领域,迁移学习(Transfer learning)是一个比较新的名词。目前国内做这个方面的很少,我目前只知道香港科技大学杨强教授及上海交大的机器学习小组在从事这方面的研究,近几年他们已经取得大量的成果,发表了十几篇AI领域顶级的会议论文,着实让我崇拜不已。接下来的研究生活,偶希望能循着他们的足迹慢慢摸索!

Qiang Yang

Sinno Jialin Pan



迁移学习( Transfer Learning )


       在传统的机器学习的框架下,学习的任务就是在给定充分训练数据的基础上来学习一个分类模型;然后利用这个学习到的模型来对测试文档进行分类与预测。然而,我们看到机器学习算法在当前的Web挖掘研究中存在着一个关键的问题:一些新出现的领域中的大量训练数据非常难得到。我们看到Web应用领域的发展非常快速。大量新的领域不断涌现,从传统的新闻,到网页,到图片,再到博客、播客等等。传统的机器学习需要对每个领域都标定大量训练数据,这将会耗费大量的人力与物力。而没有大量的标注数据,会使得很多与学习相关研究与应用无法开展。其次,传统的机器学习假设训练数据与测试数据服从相同的数据分布。然而,在许多情况下,这种同分布假设并不满足。通常可能发生的情况如训练数据过期。这往往需要我们去重新标注大量的训练数据以满足我们训练的需要,但标注新数据是非常昂贵的,需要大量的人力与物力。从另外一个角度上看,如果我们有了大量的、在不同分布下的训练数据,完全丢弃这些数据也是非常浪费的。如何合理的利用这些数据就是迁移学习主要解决的问题。迁移学习可以从现有的数据中迁移知识,用来帮助将来的学习。迁移学习(Transfer Learning)的目标是将从一个环境中学到的知识用来帮助新环境中的学习任务。因此,迁移学习不会像传统机器学习那样作同分布假设。



        基于实例的迁移学习的基本思想是,尽管辅助训练数据和源训练数据或多或少会有些不同,但是辅助训练数据中应该还是会存在一部分比较适合用来训练一个有效的分类模型,并且适应测试数据。于是,我们的目标就是从辅助训练数据中找出那些适合测试数据的实例,并将这些实例迁移到源训练数据的学习中去。在基于实例的迁移学习方面,我们推广了传统的算法,提出一种具有迁移能力的boosting算法:Tradaboosting [9],使之具有迁移学习的能力,从而能够最大限度的利用辅助训练数据来帮助目标的分类。我们的关键想法是,利用boosting的技术来过滤掉辅助数据中那些与源训练数据最不像的数据。其中,boosting的作用是建立一种自动调整权重的机制,于是重要的辅助训练数据的权重将会增加,不重要的辅助训练数据的权重将会减小。调整权重之后,这些带权重的辅助训练数据将会作为额外的训练数据,与源训练数据一起从来提高分类模型的可靠度。




2.1 基于特征的有监督迁移学习


2.2 基于特征的无监督迁移学习:自学习聚类

        我们提出的自学习聚类算法[3]属于基于特征的无监督迁移学习方面的工作。这里我们考虑的问题是:现实中可能有标记的辅助数据都难以得到,在这种情况下如何利用大量无标记数据辅助数据进行迁移学习的问题。自学习聚类 的基本思想是通过同时对源数据与辅助数据进行聚类得到一个共同的特征表示,而这个新的特征表示由于基于大量的辅助数据,所以会优于仅基于源数据而产生的特征表示,从而对聚类产生帮助。


3 异构空间下的迁移学习:翻译学习



List of Conferences and Workshops Where Transfer Learning Paper Appear


This webpage will be updated regularly.

Main Conferences

Machine Learning and Artificial Intelligence Conferences
Transfer Learning for Structured Data (TLSD-09)

Workshop, in conjunction with , Dec 7-12, 2009, Vancouver, B.C., Canada




Recently, transfer learning (TL) has gained much popularity as an approach to reduce the training-data calibration effort as well as improve generalization performance of learning tasks. Unlike traditional learning, transfer learning methods make the best use of data from one or more source tasks in order to learn a target task. Many previous works on transfer learning have focused on transferring the knowledge across domains where the data are assumed to be i.i.d. In many real-world applications, such as identifying entities in social networks or classifying Web pages, data are often intrinsically non i.i.d., which present a major challenge to transfer learning. In this workshop, we call for papers on the topic of transfer learning for structured data. Structured data are those that have certain intrinsic structures such as network topology, and present several challenges to knowledge transfer. A first challenge is how to judge the relatedness between tasks and avoid negative transfer. Since data are non i.i.d., standard methods for measuring the distance between data distributions, such as KL divergence, Maximum Mean Discrepancy (MMD) and A-distance, may not be applicable. A second challenge is that the target and source data may be heterogeneous. For example, a source domain is a bioinformatics network, while a target domain may be a network of webpage. In this case, deep transfer or heterogeneous transfer approaches are required.
Heterogeneous transfer learning for structured data is a new area of research, which concerns transferring knowledge between different tasks where the data are non-i.i.d. and may be even heterogeneous. This area has emerged as one of the most promising areas in machine learning. In this workshop, we wish to boost the research activities of knowledge transfer across structured data in the machine learning community. We welcome theoretical and applied disseminations that make efforts (1) to expose novel knowledge transfer methodology and frameworks for transfer mining across structured data. (2) to investigate effective (automated, human-machined-cooperated) principles and techniques for acquiring, representing, modeling and engaging transfer learning on structured data in real-world applications.
This workshop on Transfer learning for structured data will bring active researchers in artificial intelligence, machine learning and data mining together toward developing methods or systems together, to explore methods for solving real-world problems associated with learning on structured data. The workshop invites researchers interested in transfer learning, statistical relational learning and structured data mining to contribute their recent works on the topic of interest.
