Information and Communications Technology and Policy

Information and Communications Technology and Policy

Information and Communications Technology and Policy ›› 2019, Vol. 45 ›› Issue (7): 44-50.

Previous Articles     Next Articles

Research on a semi-automatic labeling method for machine learning data sets

  

  • Online:2019-07-15 Published:2020-11-26

Abstract: Based on the teacher- student model, a semi-automatic annotation method for datasets was proposed, which solved the problem of large workload of dataset manual annotation, different data quality and high professional threshold in supervised learning. In the cloud experiment, the annotation method was used to realize the semi-automatic labeling of the clock synchronization pattern classification data. On the other hand, the automatic evaluation of the difficulty of the data set was realized, which can be used to guide the optimization and evaluation of the machine learning model.

Key words: machine learning, data annotation, teacher-student model