Active Learning with Generalized Queries 报告人: Charles Ling, PhD, Professor 时间:5月18日(周一)下午3点 Abstract: t their labels to reduce the number of labeled examples needed for building an accurate classifier. However, previous works of active learning can only ask specific queries with all attribute values, many of which may be irrelevant. A more natural and powerful way is to ask ``generalized queries'' with only rel evant attributes, such as ``are people over 50 with knee pain likely to have o steoarthritis?'' (with only two attributes: age and type of pain while omittin g many other irrelevant ones, such as fever, blood type, etc.). The power of a sking such generalized queries is that one generalized query may be equivalent to many specific ones. However, overly general queries may receive uncertain labels from the oracle, and this makes learning difficult. s. We demonstrate experimentally that our new method asks significantly fewer queries compared with the previous works of active learning. Our method can be readily deployed in real-world data mining tasks where obtaining labeled exam ples is costly. Bio: both of his MSc and PhD from Computer and Information Science at Univ of Penn sylvania (Ivy League) within four years. estern Ontario, Canada. He is currently a Professor. deling, and child education. ences. , and IEEE Senior Member. lopment in CRM, Bioinformatics, and the Internet. |
