基于TF-IDF的食品风险分析模型的构建与应用
作者:
作者单位:

(宁波市产品食品质量检验研究院(宁波市纤维检验所) 浙江宁波 315048)

作者简介:

通讯作者:

中图分类号:

基金项目:

国家市场监管总局科技计划项目(2019MK080,2020MK117);浙江省基础公益研究计划项目(LGC20C200013);宁波市自然科学基金项目(202003N4196,2019A610438,2019A610437);宁波市泛3315创新团队(2018B-18-C);宁波市高新精英创新团队(甬高科[2018]63号)


The Building and Applying of Food Risk Analysis Model Based on TF-IDF
Author:
Affiliation:

(Ningbo Academy of Product and Food Quality Inspection (Ningbo Fibre Inspection Institute)Ningbo 315048, Zhejiang)

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    食品检测数据作为食品风险分析的重要工具,针对同类食品所做检测项目不同而导致最终的数据矩阵部分缺失,且已有的食品检测数据大部分为未检出等问题,通过引入词频-逆文档频率(term frequency-inverse document frequency,TF-IDF)的权重确定办法,构建一种新型的食品风险分析模型。本文以2019-2020年为时间段,收集某市食用农产品的蔬菜样本抽检信息作为分析数据,通过模型计算得到蔬菜中各样品的风险指数。结果显示:2019-2020年间检测的蔬菜产品中,风险指数高的为韭菜和芹菜,超标指数为毒死蜱,在监管中需加强关注,而其余蔬菜大多呈现低风险情况。本分析模型相较于其它传统分析方法,能给出具体的风险指数,在评价上具有直观性,且当数据样本越大,评价效果越好。同时,本模型基于信息理论来设置权重,消除了主观因素在评价中的影响,在应对多样化食品数据时更具有实用性。模型的建立在大数据的时代背景下,对于深入研究食品安全风险及其评价方法新路径提供一个新思路。

    Abstract:

    Food testing data is an important tool for food risk analysis. The final data matrix is missing due to different testing items for similar foods, and most of the existing food testing data is undetected. Through the introduction of TF-IDF (The term frequency-inverse document frequency) weight determination method has constructed a new type of food risk analysis model. This paper uses the sampling information of vegetable samples of edible agricultural products in a city from 2019 to 2020 as the research data, and calculates the risk index of each sample in the vegetable through the model. The results show that among the vegetable products tested from 2019 to 2020, the high-risk index is leeks and celery, and the over-standard index is chlorpyrifos, which needs to be paid more attention in supervision, while most of the remaining vegetables are low-risk. Compared with other traditional analysis methods, this analysis model can give a specific risk index, is intuitive in evaluation, and shows better evaluation performance in big data analysis. At the same time, this model sets weights in an objective and universal mode, which eliminates the influence of subjective factors in the evaluation and further enhances the practicability in diversified data analysis. The model is established in the context of the era of big data, and provides a new way of thinking for further in-depth research and exploration of new paths for food safety risks and evaluation methods.

    参考文献
    相似文献
    引证文献
引用本文

姚振民,邢家溧,承海,郑睿行,毛玲燕,徐晓蓉,张书芬,沈坚.基于TF-IDF的食品风险分析模型的构建与应用[J].中国食品学报,2022,22(12):324-331

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2021-12-23
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2023-01-09
  • 出版日期:
版权所有 :《中国食品学报》杂志社     京ICP备09084417号-4
地址 :北京市海淀区阜成路北三街8号9层      邮政编码 :100048
电话 :010-65223596 65265375      电子邮箱 :chinaspxb@vip.163.com
技术支持:北京勤云科技发展有限公司

漂浮通知