第一范文网 - 专业文章范例文档资料分享平台

Southampton and The Open University. Preface(17)

来源:用户分享 时间:2021-06-02 本文由揽清幽 分享 下载这篇文档 手机版
说明:文章内容仅供预览,部分内容可能不全,需要完整文档或者需要复制内容,请下载word后使用。下载word有问题请添加微信号:xxxxxx或QQ:xxxxxx 处理(尽可能给您提供完整文档),感谢您的支持与谅解。

morepotentialknowledgewhichmustthenbevalidated.

EvidenceBuildingandValidation-comesfromtheseriesofweakap-

proacheseachofwhicharecombinedtogivearatingwhichcanbeusedto

validateknowledge.Firstlyadditionalredundantsourcesofthesamedataare

identi edviaasimplesearch,e.g.google ndingrelevantURLs.Thisredun-

dancybyitselfisnolongerconsideredvalidevidenceasthereliabilityofsources

themselvesmustalsobetakenintoaccount,e.g.asourcewithlotsofpoten-

tialacademicsisconsideredabettersourcethanasourcedetailingjustasingle

academic.

Theratingofsourcesin uencesArmadillo’sperceivedvalidityofthepoten-

ingthepreviouslymentionedSimMetriclibrary,section

3,theextractedentitiesarecrossexaminedforsimplesimilarities:ifsomething

issuitablydissimilarfromtherest,forexampleafailedcapture40timeslonger

thannormal,itisconsideredmorepossiblyanerroranditsvalidityratingis

decreasedaswellastheratingofthesourcefromwhichitwasfound.Atpresent

thecrosssimilaritytestsareacombinationofSimMetric’ssimilaritymetrics,

althoughmorecomplexapproachescouldbeemployed.Thecontextarounda

capturecanalsodetaillikelihood,e.g.thesimilarityofPartofSpeechtagssur-

roundinginstances,variousothersimilaritytechniquestoprovidefurtherweak

evidencecouldbeused,forexampleavectorspacemodelofthesource(e.g.web-

page)similarity,acomparisonofsimilaritiesinalinkanalysisofdatasourcesor

ananalysisofthesourcedocumentsDOMstructureanditssimilaritytoother

sources.

Thiscombinationofmultipleweaktechniquescanprovideimprovedcon-

denceintheextractedknowledgeArmadillo nds,(againalthoughnotyet

implementeditisenvisionedtoprioritisetechniquesthroughInformationGain

andmemoryortimecosts).

Extractionofpotentialknowledge-armadilloasinpreviouspapers[5]inte-

gratestheAmilcare[2](LP)2algorithmtoextractpotentialnewknowledge,but

canbeextendedtoencompassdi erentMLalgorithms,forexampleT-Rex[7].

Usingratingsfromtheevidencebuildingapproach(Section4.3)theInformation

Extractionlearnsitscontextualrulesonthemosthighlyratedsourcesusingthe

mostlikelyinstancesasseeddata.

搜索“diyifanwen.net”或“第一范文网”即可找到本站免费阅读全部范文。收藏本站方便下次阅读,第一范文网,提供最新人文社科Southampton and The Open University. Preface(17)全文阅读和word下载服务。

Southampton and The Open University. Preface(17).doc 将本文的Word文档下载到电脑,方便复制、编辑、收藏和打印
本文链接:https://www.diyifanwen.net/wenku/1192501.html(转载请注明文章来源)
热门推荐
Copyright © 2018-2022 第一范文网 版权所有 免责声明 | 联系我们
声明 :本网站尊重并保护知识产权,根据《信息网络传播权保护条例》,如果我们转载的作品侵犯了您的权利,请在一个月内通知我们,我们会及时删除。
客服QQ:xxxxxx 邮箱:xxxxxx@qq.com
渝ICP备2023013149号
Top