ÁúÔ´ÆÚ¿¯Íø http://www.qikan.com.cn
Îı¾ÌØÕ÷ÌáÈ¡·½·¨Ñо¿×ÛÊö
×÷ÕߣºÐì¹Ú»ª ÕÔ¾°Ðã ÑîºìÑÇ Áõˬ À´Ô´£º¡¶Èí¼þµ¼¿¯¡·2018ÄêµÚ05ÆÚ
Õª Òª£ºÌØÕ÷ÌáÈ¡ÊÇÎı¾ÍÚ¾ò¡¢ÐÅÏ¢¼ìË÷¡¢×ÔÈ»ÓïÑÔ´¦Àí£¨NLP£©¡¢Îı¾Çé¸Ð·ÖÎö¡¢ÍøÂçÓßÇé·ÖÎöµÈÁìÓòµÄÑо¿ÈÈµã¡£ÌØÕ÷ÌáÈ¡×÷ΪÎı¾ÍÚ¾òϵͳµÄÖ÷ÒªÒòËØ£¬Îı¾ÌØÕ÷ÌáÈ¡ÐÔÄÜÊÇÎı¾·ÖÀà½á¹ûµÄÖØÒªÐÔ¶ÈÁ¿¡£´ÓÁ½·½Ãæ¶ÔÌØÕ÷Ñ¡ÔñËã·¨½øÐÐ×ܽᣬ·ÖÎö¹úÄÚÍâ¶Ô³£ÓÃÌØÕ÷ÌáÈ¡Ëã·¨µÄ¸Ä½øºÍ´´Ð£¬×îºóÕë¶ÔÓ°ÏìÌØÕ÷ÌáÈ¡µÄÒòËØ£¬Ö¸³öÔÚʵ¼ÊÓ¦ÓÃÖÐÓ¦¿¼ÂǵÄÎÊÌâ¡£ ¹Ø¼ü´Ê£ºÌØÕ÷ÌáÈ¡£»¾àÀë²â¶È£»ÐÅÏ¢²â¶È DOI£º10.11907/rjdk.172617 ÖÐͼ·ÖÀàºÅ£ºTP-0
ÎÄÏ×±êʶÂ룺A ÎÄÕ±àºÅ£º1672-7800£¨2018£©005-0013-06
Abstract£ºFeature extraction is the research focus of text mining£¬ information retrieval£¬ Natural Language Processing £¨NLP£©£¬ text sentiment analysis£¬ network public opinion
analysis£¬ etc. Feature extraction is the main factor of text mining system£¬ and the performance of text feature extraction is the important measurement of text categorization results. This paper
summarizes two kinds of feature selection algorithms£¬ and analyzes the improvement and innovation of common feature extraction algorithms at home and abroad. Finally£¬ it points out issues which should be taken into account in practical application influenced by feature extraction. Key Words£ºfeature extraction£» distance measure£» information measure 0 ÒýÑÔ
Ëæ×Å»¥ÁªÍøµÄ·¢Õ¹£¬ÒÔ¼°¼ÆËã»úºÍÐÅÏ¢¼¼ÊõµÄ²»¶Ï¸üл»´ú£¬ÍøÂçÉÏ´æ´¢µÄÐÅÏ¢Ô½À´Ô½·á¸»¡£Îı¾×÷ΪÐÅÏ¢µÄÓÐЧ±íÏÖÐÎʽ£¬ÊýÁ¿Ò²Ôö³¤Ñ¸ËÙ¡£½üÄêÀ´£¬Ëæ×ÅÔÆ¼ÆËãºÍ´óÊý¾ÝµÄÐËÆð£¬Ê¹µÃº£Á¿µÄÎı¾ÐÅÏ¢µÃµ½ÓÐЧµÄ×éÖ¯ºÍ¹ÜÀí¡£ÈçºÎ¸ßЧ¡¢×¼È·µØ»ñÈ¡ÓÐЧÐÅÏ¢³ÉΪÎı¾ÍÚ¾ò¡¢ÐÅÏ¢¼ìË÷¡¢ÍøÂçÓßÇé·ÖÎöµÈ¹¤×÷µÄÖ÷ҪĿµÄ¡£
ÍøÂçÎı¾ÐÅÏ¢ÓбðÓÚ´«Í³Îı¾ÐÅÏ¢£¬¾ßÓжàÑùÐÔ¡¢¸´ÔÓÐÔ¡¢ÈßÓàÐÔ¡¢²»¹æ·¶ÐÔµÈÌØµã¡£Òò´Ë£¬¶ÔÎı¾¸ßά¶ÈµÄ¸´ÔÓÌØÕ÷¿Õ¼ä½øÐÐÌØÕ÷½µÎ¬³ÉΪÎı¾·ÖÀàµÄÖ÷Òª¹Ø¼üµã¡£ÌØÕ÷ÌáÈ¡[1]µÄÄ¿µÄÊǶԳõʼ¸ßÎ¬ÌØÕ÷½øÐÐÓÐЧ½µÎ¬£¬´Ó¸ßÎ¬ÌØÕ÷¿Õ¼äÖÐÑ¡Ôñ³öÒ»¸ö×îÓÅÌØÕ÷×Ó¼¯¡£¸ù¾Ý×îÓÅÌØÕ÷×Ó¼¯µÄ²úÉú¹ý³Ì»®·Ö¹éÄÉÌØÕ÷ÌáÈ¡·½·¨£¬¿É½«ÌØÕ÷ÌáÈ¡·½·¨·ÖΪÁ½´óÀࣺFilter¹ýÂËʽºÍWrapper·âװʽ[2]¡£
Ïà¹ØÍÆ¼ö£º