Machine learning approach to auto-tagging online content for content marketing efficiency: A comparative analysis between methods and content type
Rights© 2019 Elsevier. Reproduced in accordance with the publisher's self-archiving policy. This manuscript version is made available under the CC-BY-NC-ND 4.0 license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
MetadataShow full item record
AbstractAs complex data becomes the norm, greater understanding of machine learning (ML) applications is needed for content marketers. Unstructured data, scattered across platforms in multiple forms, impedes performance and user experience. Automated classification offers a solution to this. We compare three state-of-the-art ML techniques for multilabel classification - Random Forest, K-Nearest Neighbor, and Neural Network - to automatically tag and classify online news articles. Neural Network performs the best, yielding an F1 Score of 70% and provides satisfactory cross-platform applicability on the same organisation's YouTube content. The developed model can automatically label 99.6% of the unlabelled website and 96.1% of the unlabelled YouTube content. Thus, we contribute to marketing literature via comparative evaluation of ML models for multilabel content classification, and cross-channel validation for a different type of content. Results suggest that organisations may optimise ML to auto-tag content across various platforms, opening avenues for aggregated analyses of content performance.
CitationSalminen J, Yoganathan V, Corporan J et al (2019) Machine learning approach to auto-tagging online content for content marketing efficiency: A comparative analysis between methods and content type. Journal of Business Research. 101: 203-217.
Link to publisher’s versionhttps://doi.org/10.1016/j.jbusres.2019.04.018
Showing items related by title, author, creator and subject.
Electronic word of mouth in social media: the common characteristics of retweeted and favourited marketer-generated content posted on TwitterAlboqami, H.; Al-Karaghouli, W.; Baeshen, Y.; Erkan, I.; Evans, C.; Ghoneim, Ahmad (2015)Marketers desire to utilise electronic word of mouth (eWOM) marketing on social media sites. However, not all online content generated by marketers has the same effect on consumers; some of them are effective while others are not. This paper aims to examine different characteristics of marketer-generated content (MGC) that of which one lead users to eWOM. Twitter was chosen as one of the leading social media sites and a content analysis approach was employed to identify the common characteristics of retweeted and favourited tweets. 2,780 tweets from six companies (Booking, Hostelworld, Hotels, Lastminute, Laterooms and Priceline) operating in the tourism sector are analysed. Results indicate that the posts which contain pictures, hyperlinks, product or service information, direct answers to customers and brand centrality are more likely to be retweeted and favourited by users. The findings present the main eWOM drivers for MGC in social media.
Critical values for Lawshe's content validity ratio: revisiting the original methods of calculationAyre, Colin A.; Scally, Andy J. (2014-01)The content validity ratio originally proposed by Lawshe is widely used to quantify content validity and yet methods used to calculate the original critical values were never reported. Methods for original calculation of critical values are suggested along with tables of exact binomial probabilities.
Video extraction for fast content access to MPEG compressed videosJiang, Jianmin; Weng, Y. (2009-06-09)As existing video processing technology is primarily developed in the pixel domain yet digital video is stored in compressed format, any application of those techniques to compressed videos would require decompression. For discrete cosine transform (DCT)-based MPEG compressed videos, the computing cost of standard row-by-row and column-by-column inverse DCT (IDCT) transforms for a block of 8 8 elements requires 4096 multiplications and 4032 additions, although practical implementation only requires 1024 multiplications and 896 additions. In this paper, we propose a new algorithm to extract videos directly from MPEG compressed domain (DCT domain) without full IDCT, which is described in three extraction schemes: 1) video extraction in 2 2 blocks with four coefficients; 2) video extraction in 4 4 blocks with four DCT coefficients; and 3) video extraction in 4 4 blocks with nine DCT coefficients. The computing cost incurred only requires 8 additions and no multiplication for the first scheme, 2 multiplication and 28 additions for the second scheme, and 47 additions (no multiplication) for the third scheme. Extensive experiments were carried out, and the results reveal that: 1) the extracted video maintains competitive quality in terms of visual perception and inspection and 2) the extracted videos preserve the content well in comparison with those fully decompressed ones in terms of histogram measurement. As a result, the proposed algorithm will provide useful tools in bridging the gap between pixel domain and compressed domain to facilitate content analysis with low latency and high efficiency such as those applications in surveillance videos, interactive multimedia, and image processing.