¡°²éѯ¹Ø¼ü×ÖÀí½â¡±Èý²¿Çú
µÚÒ»£º²éѯ¹Ø¼ü×Ö·ÖÀà
²éѯ¹Ø¼ü×ÖÀí½â£¨Query Understanding£©¡£Ò²¾ÍÊÇ˵£¬ÎÒÃÇÏ£Íûͨ¹ý²éѯ¹Ø¼ü×ÖÀ´Á˽âÓû§ÖÖÖÖÐÐΪ±³ºóµÄÄ¿µÄ¡£²éѯ¹Ø¼ü×Ö²úÉúµÄÌØÕ÷£¨Feature£©ÍùÍùÊǺÜÇ¿µÄÖ¸µ¼ÒòËØ£¬Ò²ÊǸöÐÔ»¯ËÑË÷½á¹û·Ç³£ÖØÒªµÄԴȪ¡£Òò´Ë£¬ÉîÈëÁË½â²¢ÕÆÎÕ²éѯ¹Ø¼ü×ÖÀí½â·½ÃæµÄ¼¼Êõ¾Í±äµÃºÜÓбØÒª¡£
²éѯ¹Ø¼ü×ÖÀí½â×î»ù±¾µÄÒ»¸ö²½Öè¾ÍÊǸø²éѯ¹Ø¼ü×Ö·ÖÀà
£¨Classification£©£¬¿´ÕâЩ²éѯ¹Ø¼ü×ÖÓÐʲôÓû§Òâͼ£¨Intent£©¡£½ñÌìÎÒ¾ÍÀ´ÁÄÒ»ÁIJéѯ¹Ø¼ü×Ö·ÖÀàµÄһЩ»ù±¾¸ÅÄîºÍ¼¼Êõ£¬ÈÃÄã¶ÔÕâ·½ÃæµÄ¿ª·¢ºÍÑо¿ÓÐÒ»¸ö»ù±¾ÈÏʶ¡£
²éѯ¹Ø¼ü×Ö·ÖÀàµÄÀúÊ·
´ÓÉÌÒµËÑË÷ÒýÇæ¿ªÊ¼ÃæÊÀµÄµÚÒ»ÌìÆð£¬ÈËÃǾͷ¢ÏÖ£¬¿ÉÒÔ´Ó²éѯ¹Ø¼ü×ÖÖеõ½ºÜ¶àÓû§µÄÐÅÏ¢£¬ÌرðÊÇÀí½âÓû§µÄÒâͼ¡£ÔçÔÚ 1997 Ä꣬ÉÌÒµËÑË÷ÒýÇæ Excite ¾Í¿ªÊ¼Á˰ÙÍò¼¶±ð²éѯ¹Ø¼ü×ÖµÄÑо¿¹¤×÷¡£È»¶ø£¬ÕæÕý¶Ô²éѯ¹Ø¼ü×Ö·ÖÀà½øÐÐϵͳ²ûÊöµÄÊǰ²µÂÁÒ¡¤²¼Â޵£¨Andrei Broder£©µÄÂÛÎÄ¡¶ÍøÒ³ËÑË÷·ÖÀà¡·£¨A Taxonomy of Web Search£©¡£
°²µÂÁÒºÜÓÐÃûÍ·£¬ÔÚ˹̹¸£´óѧ¹¥¶Á²©Ê¿ÆÚ¼äʦ´ÓͼÁé½±µÃÖ÷¸ßµÂÄÉ£¨Donald Knuth£©£¬È»ºóÔÚÔø¾ÃûÔëһʱµÄµÚÒ»´úËÑË÷ÒýÇæ¹«Ë¾ AltaVista£¨ºó±»ÑÅ»¢ÊÕ¹º£©µ£ÈÎÊ×ϯ¿ÆÑ§¼Ò£¬Ö®ºó¼ÓÈëλÓÚŦԼµÄ IBM Ñо¿Ôº×齨ÆóÒµ¼¶ËÑ
Ë÷ƽ̨£¬2012 Äêºó¼ÓÈë Google£¬µ£Èνܳö¿ÆÑ§¼Ò£¨Distinguished Scientist£©¡£Ëû»¹ÊÇ ACM£¨Association of Computing Machinery£¬¼ÆËã»úлᣩºÍ IEEE£¨Institute of Electrical and Electronics Engineers£¬µçÆøµç×Ó¹¤³Ìʦѧ»á£©µÄË«ÁÏԺʿ¡£
°²µÂÁÒµÄÕâÆªÂÛÎÄ¿ÉÒÔ˵Êǵ춨Á˲éѯ¹Ø¼ü×Ö·ÖÀàµÄ¼áʵ»ù´¡¡£ÕâÖ®ºóÑо¿ÈËÔ±µÄºÜ¶à¹¤×÷¶¼ÊÇÎ§ÈÆ×ÅÈçºÎ×Ô¶¯»¯·ÖÀà¡¢ÈçºÎ¶¨Òå¸ü¼Ó¾«Ï¸µÄÓû§ÒâͼÀ´Õ¹¿ªµÄ¡£
²éѯ¹Ø¼ü×Ö·ÖÀàÏê½â
ÎҾʹӰ²µÂÁÒÕâÆª·Ç³£ÓÐÃûµÄÎÄÕÂ˵Æð¡£ÔÚÍøÂçËÑË÷£¨Web Search£©³ÉΪ±È½ÏÖ÷Á÷µÄ×Éѯ²éѯÊÖ¶Î֮ǰ£¬´«Í³µÄÐÅÏ¢¼ìË÷ÈÏΪ£¬²éѯµÄÖ÷ҪĿµÄÊÇÍê³ÉÒ»¸ö³éÏóµÄ¡°ÐÅÏ¢ÐèÇó¡±£¨Information Needs£©¡£ÔÚ´«Í³ÐÅÏ¢¼ìË÷µÄÊÀ½çÀ×îÖ÷ÒªµÄÓ¦ÓÃÓ¦¸ÃÊÇͼÊé¹Ý¼ìË÷»òÕßÕþ¸®Ñ§Ð£µÈÆóÊÂÒµµ¥Î»µÄ¼ìË÷¡£Òò´Ë£¬ÔÚÕâÑùµÄ³¡¾°Ï£¬¼Ù¶¨Ã¿Ò»¸ö²éѯÖ÷ÒªÊÇÂú×ãij¸ö¡°ÐÅÏ¢ÐèÇó¡±¾ÍÏԵúÜÓеÀÀíÁË¡£
È»¶ø£¬ÔçÔÚ 2002 Ä꣬°²µÂÁÒ¾ÍÈÏΪÕâÑùµÄ´«Í³¼Ù¶¨ÒѾ²»ÊʺÏÍøÂçʱ´úÁË¡£Ëû¿ªÊ¼°Ñ²éѯ¹Ø¼ü×ÖËù´ú±íµÄÄ¿µÄ»®·ÖΪÈý¸ö´óÀࣺ
µ¼º½Ä¿µÄ£¨Navigational£©£» ÐÅϢĿµÄ£¨Informational£©£» ½»Ò×Ä¿µÄ£¨Transactional£©¡£
´ËºóÊ®¶àÄêÀ²éѯ¹Ø¼ü×ÖµÄÕâÈý´ó·ÖÀà¶¼ÊÇÕâ¸ö·½ÏòÑо¿ºÍʵ¼ùµÄ»ùʯ¡£ÎÒÃÇÏÈÀ´¿´Õâ¸ö·ÖÀàµÄÄÚº¡£
µÚÒ»À࣬ÒÔµ¼º½ÎªÒâͼµÄ²éѯ¹Ø¼ü×Ö£¬ÕâÀà²éѯ¹Ø¼ü×ÖµÄÄ¿±êÊǴﵽij¸öÍøÕ¾¡£ÕâÓпÉÄÜÊÇÓû§ÒÔǰ·ÃÎʹýÕâ¸öÍøÕ¾£¬»òÕßÊÇÓû§¼ÙÉèÓÐÕâôһ¸ö¹ØÓÚËùÌá½»²éѯ¹Ø¼ü×ÖµÄÍøÕ¾¡£ÕâÒ»Àà²éѯ¹Ø¼ü×Ö°üÀ¨¹«Ë¾µÄÃû×Ö£¨È硰΢Èí¡±£©¡¢È˵ÄÃû×Ö£¨Èç¡°°Â°ÍÂí¡±£©»òÕßij¸ö·þÎñµÄÃû×Ö£¨Èç¡°Áª°î¿ìµÝ¡±£©µÈ¡£
´ËÀà²éѯ¹Ø¼ü×ÖµÄÒ»¸öÖØÒªÌØµã¾ÍÊÇ£¬ÔÚ´ó¶àÊýÇé¿öÏ£¬ÕâЩ²éѯ¹Ø¼ü×Ö¶¼¶ÔӦΨһµÄ»òÕߺÜÉٵġ°±ê×¼´ð°¸¡±ÍøÕ¾¡£±ÈÈ磬ËÑË÷¡°Î¢Èí¹«Ë¾¡±£¬Ï£ÍûÄܹ»ÕÒµ½µÄ¾ÍÊÇ΢Èí¹«Ë¾µÄ¹Ù·½ÍøÕ¾¡£ÁíÒ»·½ÃæÊÇ˵£¬Ä³Ð©¡°ÐÅÏ¢¼¯³É¡±ÍøÕ¾Ò²ÊÇ¿ÉÒÔ½ÓÊܵġ°´ð°¸¡±¡£±ÈÈ磬²éѯ¡°°Â°ÍÂí¡±£¬ËÑË÷·µ»ØµÄ½á¹ûÊÇÒ»¸öÁоÙÁËËùÓÐÃÀ¹ú×ÜͳµÄÍøÕ¾¡£
µÚ¶þÀ࣬ÒÔÐÅϢΪÒâͼµÄ²éѯ¹Ø¼ü×Ö£¬ÕâÀà²éѯ¹Ø¼ü×ÖµÄÄ¿±êÊÇËѼ¯ÐÅÏ¢¡£ÕâÒ»ÀàµÄ²éѯºÍ´«Í³µÄÐÅÏ¢¼ìË÷·Ç³£½Ó½ü¡£ÖµµÃÌá¼°µÄÊÇ£¬´ÓºóÃæµÄÑо¿½áÂÛÀ´¿´£¬ÕâÒ»Àà²éѯ¹Ø¼ü×ÖËù°üº¬µÄÄ¿±ê²»½ö½öÊÇѰÕÒµ½Ä³ÀàȨÍþÐÔÖÊ£¨Authority£©µÄÍøÒ³£¬»¹°üÀ¨ÁоÙȨÍþÐÅÏ¢µÄË׳ơ°½áµã¡±£¨Hub£©µÄÍøÕ¾¡£
µÚÈýÀ࣬ÒÔ½»Ò×ΪÒâͼµÄ²éѯ¹Ø¼ü×Ö£¬ÕâÀà²éѯ¹Ø¼ü×ÖµÄÄ¿±êÊǵ½´ïÒ»¸öÖмäÕ¾µã´Ó¶ø½øÒ»²½Íê³É¡°½»Òס±£¨Transaction£©¡£ÕâÒ»Àà²éѯ¹Ø¼ü×ÖµÄÖ÷Òª¶ÔÏó¾ÍÊÇ¡°¹ºÎ¡£ÏÖÔÚÎÒÃǶԡ°µç×ÓÉÌÎñ¡±µÄ̬¶È¿ÉÒÔ˵ÊǷdz£×ÔÈ»ÁË£¬µ«ÊÇÊ®¶àÄêǰ£¬ÔÚ´«Í³ÐÅÏ¢¼ìË÷½çͳÖεÄËÑË÷Ñо¿ÁìÓò£¬Ìá³ö¡°½»Òס±ÀàÐ͵IJéѯ¹Ø¼ü×Ö¿ÉÒÔ˵ÊǺÜÓÐÐÂÒâµÄ¡£
µ±È»£¬ÕâÑùµÄ·ÖÀàÈç¹û½ö½öÊǸÅÄîÉϵÄÇø·ÖÄǾÍûÓÐÌ«´óµÄÒâÒå¡£°²µÂÁÒÀûÓÃËÑË÷ÒýÇæ AltaVista ½øÐÐÁËÒ»´Îµ÷²éÑо¿£¬Õâ´Îµ÷²éÓдóÔ¼ 3 ǧ¶àµÄÓû§·´À¡¡£Ïëµ½ÕâÊÇÔÚ 2001 ÄêµÄµ÷²é£¬¿ÉÒÔ˵ÒѾÊÇ´ó¹æÄ£µÄÑо¿ÁË¡£
Õâ´Îµ÷ÑеĽá¹ûÊÇÕâÑùµÄ£ºÔÚÓû§Ìá½»µÄÐÅÏ¢ÖУ¬µ¼º½ÀàÐ͵IJéѯ¹Ø¼ü×ÖÕ¼ 26%£¬½»Ò×ÀàÐ͵IJéѯ¹Ø¼ü×ÖÕ¼µ½ÁË 24%£¬¶øÊ£ÏµĽ«½ü 50% ÊÇÐÅÏ¢ÀàÐ͵IJéѯ¹Ø¼ü×Ö£¬Óû§µÄÈÕÖ¾£¨Log£©·ÖÎö½øÒ»²½Ö¤ÊµÁËÕâÒ»Êý¾Ý¡£
Äã¿ÉÒÔ¿´µ½£¬ÕâÖְѲéѯ¹Ø¼ü×Ö½øÐзÖÀàµÄÑо¿ÊǶÔÓû§ÐÐΪ½øÐн¨Ä£µÄ±ØÒª²½Öè¡£ÓÚÊÇ£¬ºÜ¿ì¾ÍÓв»ÉÙÑо¿ÈËÔ±Ðáµ½Á˲éѯ¹Ø¼ü×Ö·ÖÀàµÄ¼ÛÖµ¡£È»¶ø£¬ÍêÈ«ÒÀ¿¿Óû§Ö±½Ó·´À¡À´»ñÈ¡ÕâÀàÐÅÏ¢Ôò±äµÃÔ½·¢À§ÄÑ¡£
ÕâÀïÖ÷ÒªÓÐÈý¸öÔÒò¡£µÚÒ»£¬²»¿ÉÄܼÄÏ£ÍûÓÚÓû§»ã±¨×Ô¼ºËùÓйؼü×ÖµÄÒâͼ£»µÚ¶þ£¬Ãæ¶ÔÒÚÍòÓû§ÊäÈëµÄ²éѯ¹Ø¼ü×Ö£¬ÊÖ¹¤±ê×¢Ò²ÊDz»¿ÉÄܵģ»×îºó£¬°²µÂÁÒµÄÈýÀà·ÖÀ໹ÊÇÌ«´ÖáîÁË£¬ÔÚʵ¼ÊÓ¦ÓÃÖÐÏ£ÍûµÃµ½¸ü¼Óϸ¿ÅÁ£¶ÈµÄÓû§Òâͼ¡£
°Ñ²éѯ¹Ø¼ü×Ö·ÖÀàÎÊÌâת»»³ÉΪ±ê×¼µÄ»úÆ÷ѧϰÈÎÎñÆäʵºÜÖ±¹Û¡£È·ÇеØËµ£¬ÕâÀïÐèÒª×öµÄÊǰѲéѯ¹Ø¼ü×Ö·ÖÀàת»»³ÉΪ¼à¶½Ñ§Ï°ÈÎÎñ¡£ÕâÀÿһ¸ö²éѯ¹Ø¼ü×Ö£¬¾ÍÊÇÒ»¸öÊý¾ÝÑù±¾£¬¶øÏìÓ¦±äÁ¿£¬ÔòÊǶÔÓ¦µÄÀà±ð¡£¾ßÌåÇé¿öÈ¡¾öÓÚÎÒÃǵÄÈÎÎñÊǽö½ö°Ñ²éѯ¹Ø¼ü×Ö·ÖΪ¼¸¸öÀà±ð£¬²¢ÇÒÈÏΪÕâЩÀà±ðÖ®¼äÊÇ»¥Ïà¶ÀÁ¢µÄ£¬»¹ÊÇÈÏΪÕâЩÀà±ðÊÇ¿ÉÒÔͬʱ´æÔڵġ£
ÔÚ×î¼òµ¥µÄ¼ÙÉèÏ£¬²éѯ¹Ø¼ü×Ö·ÖÀà¾ÍÊÇÒ»¸öÆÕͨµÄ¶àÀà·ÖÀàÎÊÌ⣬¿ÉÒÔʹÓÃÆÕÊʵĶàÀà·ÖÀàÆ÷£¬±ÈÈçÖ§³ÖÏòÁ¿»ú£¨SVM£©¡¢Ëæ»úÉÁÖ£¨Random Forest£©ÒÔ¼°Éñ¾ÍøÂ磨Neural Networks£©µÈÀ´½â¾öÕâÀàÎÊÌâ¡£
¶ÔÓÚ¾ø´ó¶àÊý¼à¶½Ñ§Ï°ÈÎÎñ¶øÑÔ£¬×îÖØÒªµÄÒ»¸ö×é³É²¿·Ö¾ÍÊÇÑ¡È¡ÌØÕ÷¡£ËæºóºÜ¶àÄêµÄÑо¿¿ª·¢¹¤×÷ÖУ¬ÓÐÒ»²¿·Ö¾Í¼¯ÖÐÔÚ³¢ÊÔʹÓò»Í¬µÄÌØÕ÷£¬È»ºóÀ´¿´¶ÔÌá¸ß·ÖÀàµÄ¾«¶ÈÊÇ·ñÓÐЧ¹û¡£
¹ýÈ¥µÄÑо¿·´¸´Ö¤Ã÷£¬ÒÔϼ¸ÀàÌØÕ÷·Ç³£ÓÐЧ¡£
µÚÒ»ÀàÌØÕ÷¾ÍÊDzéѯ¹Ø¼ü×Ö±¾ÉíµÄÐÅÏ¢¡£±ÈÈ磬²éѯ¹Ø¼ü×ÖÖÐÒѾ°üÀ¨ÁËÒÑÖªµÄÈËÃû»òÕß¹«Ë¾Ãû£¬ÕâÖÖʱºò£¬·ÖÀà½á¹û¾Í²»Ì«¿ÉÄÜÊǽ»Ò×ÒâͼµÄÀà±ð¡£Ò²¾ÍÊÇ˵£¬²éѯ¹Ø¼ü×Ö£¬ÌرðÊÇijЩ´Ê»òÕß´Ê×éºÍÀà±ðÓÐijÖÖ¹ØÁªÐÅÏ¢£¬¶øÕâÖÖ¹ØÁªºÜ´ó³Ì¶ÈÉÏÄܱ»Ö±½Ó·´Ó³³öÀ´¡£
µÚ¶þÀàÌØÕ÷ÊÇËÑË÷ÒýÇæ·µ»ØµÄ²éѯ¹Ø¼ü×ÖÏà¹ØµÄÒ³Ãæ±¾ÉíµÄÐÅÏ¢¡£Äã¿ÉÒÔÏëÏóһϣ¬¼ÙÈçËÑË÷¡°°Â°ÍÂí¡±Õâ¸ö¹Ø¼ü×Ö£¬·µ»ØµÄÒ³Ãæ¶¼ÊÇά»ù°Ù¿ÆµÄÒ³ÃæÒÔ¼°°Â°ÍÂí»ù½ð»áµÄÒ³Ãæ£¬ÄÇôÕâÐ©Ò³ÃæÉÏÃæµÄÄÚÈÝ¿ÉÄܺÜÄѰüº¬ÈκÎÉÌÒµµÄ¹ºÂòÐÅÏ¢¡£¶ø¶ÔÓÚ¡°¼ÑÄÜÏà»ú¡±Õâ¸ö²éѯ¹Ø¼ü×Ö¶øÑÔ£¬·µ»ØµÄÒ³ÃæºÜ¿ÉÄܶ¼Êǵç×ÓÉÌÎñÍøÕ¾µÄÉÌÆ·ÐÅÏ¢£¬´Ó¶øÄܹ»¸ü¼Ó׼ȷµØÅжϡ°¼ÑÄÜÏà»ú¡±µÄ·ÖÀà¡£
µÚÈýÀàÌØÕ÷ÔòÊÇÓû§µÄÐÐΪÐÅÏ¢£¬ÄǾÍÊÇÓû§ÔÚÊäÈë²éѯ¹Ø¼ü×ÖÒÔºó»áµã»÷Ê²Ã´ÍøÕ¾£¬»áÔÚÄÄÐ©ÍøÕ¾Í£Áô¡£Ò»°ãÀ´Ëµ£¬ÄÄÐ©ÍøÕ¾µã»÷Âʸߡ¢Í£Áôʱ¼ä³¤£¬¾Í±íÃ÷ÕâÐ©ÍøÕ¾ÔÚ·µ»Ø½á¹ûÖпÉÄܸüÏà¹Ø¡£ÓÚÊÇ£¬²ÉÓÃÕâÐ©ÍøÕ¾À´×÷Ϊ²éѯ¹Ø¼ü×ÖËù´ú±íµÄÄÚÈÝ£¬¾Í¿ÉÄܸü¼Ó¿¿Æ×¡£
ÔÚʵ¼ÊµÄÓ¦ÓÃÖУ¬²éѯ¹Ø¼ü×ֵķÖÀàÍùÍù»¹ÊÇÓкܴóÄѶȵġ£ÒòΪÔÚÆÕͨµÄÏÖ´úËÑË÷ÒýÇæÉÏ£¬Ã¿Ìì¿ÉÄÜÓÐÈý·ÖÖ®Ò»¡¢ÉõÖÁ¸ü¶àµÄ¹Ø¼ü×ÖÊÇ֮ǰûÓгöÏÖ¹ýµÄ¡£Òò´Ë£¬ÈçºÎ´¦Àí´ÓÀ´Ã»ÓгöÏÖ¹ýµÄ¹Ø¼ü×Ö¡¢ÈçºÎ´¦Àí³¤Î²ÖÐµÄµÍÆµ¹Ø¼ü×Ö£¬¾Í³ÉÁËÈÃËÑË÷½á¹ûµÄ¾«¶ÈÔÙÉÏÒ»¸ǫ̈½×µÄÖØÒªÒòËØ¡£
µÚ¶þ£º²éѯ¹Ø¼ü×Ö½âÎö
¸ü¼Ó¾«Ï¸µÄ²éѯ¹Ø¼ü×ÖÀí½âÄ£¿é£º²éѯ¹Ø¼ü×Ö½âÎö£¨Parsing£©¡£
Ïà¹ØÍÆ¼ö£º