YuanJie's profile关于生活的方方面面PhotosBlogListsMore Tools Help

Blog


    May 24

    a visit to baiti

    白堤全长1千米,东起断桥,经锦带桥而止于平湖秋月。白堤横亘湖上,把西湖划分为外湖和里湖,并将孤山和北山连接在一起。白堤在唐代原名白沙堤,宋代又叫孤山路。明代堤上广植桃柳,景色绚烂,故又称十锦塘。平静坦荡、景色秀美的白堤,堤上内层是婀娜多姿的垂柳,外层是绚丽多彩的碧桃,远望如一条彩色的锦带。逢春日,暖风熏面,景致绝佳。
     
    照片见相册。
    May 23

    Talking about www.wolframalpha.com

    最近 , wolframalpha上线了,号称是要和GOOGLE 比拼的一个 SEARCH ENGINE, 卖点在于"Computational Knowledge"。
     
    我试用了一下,感觉其特点主要在2方面:
    1. 对QUERY的结构化表示 , 比如shanghai beijing distance 会变成 distance: from shanghai to beijing
     这点和SEMANTIC WEB的理念很像, 而这个功能感觉非常像 JACKSON他们以前ISWC做的 KEYWORD QUERY TO SEMANTIC QUERY (不知道我有没有理解错?)
     
    2. 做了非常多的SUBENGINE, 比如 化学分子式的显示 , 地图的显示 , 乐谱,计算器等等, 搜索的结果是各个SUBENGINE RESULT的 融合。
     
    不过,该SEARCH ENGINE目前对NATURAL LANGUAGE QUERY支持的不好, 而且我个人感觉其背后的INDEX也不是全网的INDEX?
     
     
     
    May 19

    MSN QnA Beta Shutdown

    Recent news reported that the MSN QnA beta will be shutdown recent days. Actually I touch this site at 2006. At that time, the site was just built and few user use this site. The site has a feature that the question can just have tags but the question can not be categorized.  In my opinion, the lost of category information introduce difficulty for  user browsing existing questions. Maybe it is good for websites, coz website has the property of diversity, more or less it will have some different features, while a question is more focus. Users can hardly focus on one specific field in the mode of MSN QnA. Also, the MSN QnA did not have a lot of advertisement which made it harder to compete against Yahoo! Answers.
     
    so what can be improved in current qa services?
     
    In SIGIR09, there existed 2 papers i found related to cqa. They still focused on similar question finding and anwer ranking, which was an old topic as I thought. Although these topics did have some effect for improving current cqa site, they can not exploit the real business value of the qa service.
     
    so in my point:
     
    in general view, one important thing is to extract the real knowledge from these sites, excluding many spam informations(the spam defined here might be just chatting threads like in forum). The "knowledge" defined here is the "General" , "Abstract" Knowledge which can be extracted and be put into wikipedia not just "query-specific" knowledge.
     
    in user view, the large value of these site is to  mine user's information according to their question and answers, including users' interests, jobs, or other information. These data may be a great resource for company business.
     
    What's your point?