The use of a semantic neural network in the tasks of analyzing the quality of the layout process of a book edition. Наукові записки. Інститут поліграфії та медійних технологій. НУ «Львівська політехніка»

Author(s)	Collection number	Pages	Download abstract	Download full text
Плахтина З. І., Selmenska Z. M.	№ 2 (69)	91-101

Summary
References

This research presents an innovative approach to developing intelligent content moderation systems for electronic publications through the integration of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) architecture. The study addresses critical challenges in automated content moderation, focusing on the detection of misinformation, manipulative content, and potentially harmful materials. The proposed system combines the contextual understanding capabilities of LLM with RAG’s ability to access and utilize current information, creating a more accurate and adaptable moderation solution.

The research examines the practical aspects of implementing such systems in modern electronic publications and analyzes the results of real-world testing. The methodology includes a comprehensive evaluation of system performance across various content types, demonstrating significant improvements in moderation accuracy and efficiency. Special attention is paid to the system’s self-learning capabilities and its ability to adapt to new types of content and threats.

The paper also explores the economic efficiency of implementing automated moderation systems, presenting data on operational cost reduction and improvement in publication workflow. The results show substantial reduction in manual moderation requirements while maintaining high accuracy standards, particularly in detecting complex violation cases such as hidden advertising and sophisticated forms of misinformation. The findings contribute to the ongoing development of content management technologies and offer practical solutions for modern digital publishing challenges.

The system described in this paper represents a significant advancement in content moderation technology, offering both theoretical insights and practical applications for the digital publishing industry. Its implementation demonstrates the potential for improving content quality and safety in the modern information space while maintaining operational efficiency.

Keywords: content moderation, automated moderation systems, machine learning algorithms, large language models (LLM), RAG architecture, hybrid moderation systems, semantic text analysis, fact-checking.

doi: 10.32403/1998-6912-2024-2-69-82-90

1. Gorwa-Ciesielska, M., & Marwick, A. E. (2021). Online content moderation: A review of research on social media platforms. New Media & Society, 23(10), 2821–2839.
2. Young, S., Hazarika, D., Poria, S., & Cambria, E. (2018). Recent advances in natural language processing via large pre-trained language models. arXiv preprint arXiv:1810.04805.
3. Reich, J., & Palacios, D. (2020). «Machine Learning for Content Moderation: A Systematic Literature Review». ACM Computing Surveys, 53(5), 1-37.
4. Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2019). «BERT: Pretraining of Deep Bidirectional Transformers for Language Understanding». Proceedings of NAACL-HLT 2019, 4171-4186.
5. Brown, T. B., Mann, B., Ryder, N., et al. (2020). «Language Models are Few-Shot Learners». arXiv preprint arXiv:2005.14165.
6. Weaviate. (2023). «Building RAG-based Applications with Weaviate». Weaviate Documentation. https://weaviate.io/developers/weaviate/current/retrieval-with-rag.html
7. Lewis, P., Perez, E., Piktus, A., et al. (2020). «Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks». Advances in Neural Information Processing Systems, 33, 9459-9474.
8. Chowdhery, A., Narang, S., Devlin, J., et al. (2022). «PaLM: Scaling Language Modeling with Pathways». arXiv preprint arXiv:2204.02311.
9. Gillespie, T. (2020). «Content moderation, AI, and the question of scale». Big Data & Society, 7(2), 2053951720943234.
10. Gorwa, R., Binns, R., & Katzenbach, C. (2020). «Algorithmic content moderation: Technical and political challenges in the automation of platform governance». Big Data & Society, 7(1), 2053951719897945.
11. Cushing, E. (2022). «The Economic Impact of Content Moderation». MIT Technology Review. https://www.technologyreview.com/2022/04/15/content-moderation-economics/.