导航

Beyond Search

最好走的路越走越难,最难走的路越走越容易

« Google Search Quality 官方说明Google 评价 blog 的指标 »

Are Machine-Learned Models Prone to Catastrophic Errors?

最近忙,paper 看得多,blog 看得少,险些错过一些非常有意思的文章。上一次提到的 "Introduction to Google Search Quality" 算一篇,这次要说的是另外一篇 "Are Machine-Learned Models Prone to Catastrophic Errors?"。 不过这两个 blog 都被我们伟大的 GFW 拌掉了。

Peter Norvig 这样的大师的意见,我们需要仔细体会。我整理一下我感兴趣的。
  1. tow phase of google search algorithms
    • An offline phase, which is time-consuming and query-independent.
    • An on-line phrase, in response to a user query in a few milliseconds.
  2. Tons of training data ... from the armies of "raters" employed by Google
  3. The big surprise is that Google still uses the manually-crafted formula for its search results, despite the fact that, their best machine-learned model is now as good as, and sometimes better than, the hand-tuned formula on the results quality metrics that Google uses.
  4. two reasons
    • the human experts who created the algorithm believe they can do better than a machine-learned model
    • Google's search team worries that machine-learned models may be susceptible to catastrophic errors on unforeseen query types, which is different from the training data.
  5. Nassim Taleb divides Black Swan phenomena into two classes
    • Mediocristan
    • Extremistan
  6. The current generation of machine learning algorithms can work well in Mediocristan but not in Extremistan.

So the thing is, how to figure out whether new machine learning algorithms can be devised that work well in Extremistan, or prove that it cannot be done?

发表评论:

◎欢迎参与讨论,请在这里发表您的看法、交流您的观点。

Powered By Z-Blog 1.7 Laputa Build 70216 .Templete from Google黑板报

Copyright 2007-2008 Guwendong.cn. All Rights Reserved.

我的豆瓣九点

Feed@FeedSky

网站目录

Search

  •  

文章归档