Comparative analysis of raster and vector graphics. Наукові записки. Інститут поліграфії та медійних технологій. НУ «Львівська політехніка»

Author(s)	Collection number	Pages	Download abstract	Download full text
Oliyarnyk T. I., Kudriashova A. V.	№ 2 (71)	26-32

Summary
References

The article discusses a simple but dangerous problem. CRM data sometimes contains a small admixture of records that look plausible, pass the usual checks, but quietly shift the model solutions. Since event collection occurs from forms on the website, from mobile applications, from the call center or through partner interfaces, this is where fake calls, bot traffic and false confirmations of actions most often appear, because the event is created by a person or a partner and it is difficult to distinguish it from a real signal. As a result, the system is more likely to make mistakes about who to write or call, what discount to give, how to distribute the budget. This leads to unnecessary costs, worse customer retention and lower forecast accuracy. We offer a practical CRMPGuard framework. It works on top of existing processes and does three things. First. Checks the origin and plausibility of the data and sends suspicious batches to quarantine for verification. Second. It looks for atypical clusters and individual records that have too much impact on training, and reduces their contribution. Third. It updates the model on cleaned subsets in a safe loop, compares the results in two branches, and only then returns the solution to work. All steps are recorded for audit and compliance with personal data protection requirements.

We present the results in an understandable form. If the impurity is small, the model error increases in approximately the same proportion. After cutting off suspicious examples, the error decreases. That is, even in the presence of impurities, the quality of solutions is kept under control. Example. About one percent of fake reviews appear in a small segment. The indicator increases, suspicious records are isolated, the model is retrained on cleaned data, and a check in two branches shows the restoration of the accuracy and consistency of predictions. This means fewer false contacts and discounts to the wrong customers, more stable campaign operation, and faster recovery from incidents.

Keywords: CRM models, data contamination, origin verification, anomaly detection, learning with limited exposure to suspicious records, prediction quality.

doi: 10.32403/1998-6912-2025-2-71-13-25

1. P. Paillier, “Public-Key Cryptosystems Based on Composite Degree Residuosity Classes,” in EUROCRYPT 1999, LNCS 1592, 1999.
2. T. ElGamal, “A Public Key Cryptosystem and a Signature Scheme Based on Discrete Logarithms,” IEEE Transactions on Information Theory, vol. 31, no. 4, 1985.
3. C. Gentry, “Fully Homomorphic Encryption Using Ideal Lattices,” in Proceedings of STOC, 2009.
4. C. Gentry, A Fully Homomorphic Encryption Scheme, Ph.D. thesis, Stanford University, 2009.
5. Z. Brakerski and V. Vaikuntanathan, “Fully Homomorphic Encryption from Ring-LWE and Security for Key Dependent Messages,” in CRYPTO, 2011.
6. Z. Brakerski, C. Gentry, and V. Vaikuntanathan, “(Leveled) Fully Homomorphic Encryption without Bootstrapping,” in ITCS, 2012.
7. J. Fan and F. Vercauteren, “Somewhat Practical Fully Homomorphic Encryption,” IACR ePrint 2012/144, 2012.
8. Z. Brakerski, “Fully Homomorphic Encryption without Modulus Switching from Classical GapSVP,” in CRYPTO, 2012.
9. J. H. Cheon, A. Kim, M. Kim, and Y. Song, “Homomorphic Encryption for Arithmetic of Approximate Numbers,” in ASIACRYPT, 2017.
10. L. Ducas and D. Micciancio, “FHEW: Bootstrapping Homomorphic Encryption in Less Than a Second,” in EUROCRYPT, 2015.
11. I. Chillotti, N. Gama, M. Georgieva, and M. Izabachène, “TFHE: Fast Fully Homomorphic Encryption over the Torus,” Journal of Cryptology, vol. 33, 2020.
12. S. Halevi and V. Shoup, “HElib – Algorithms and Design,” IBM Research Report, 2014–2020.
13. Microsoft Research, Microsoft SEAL: Simple Encrypted Arithmetic Library, 2015–2024.
14. C. Dwork, F. McSherry, K. Nissim, and A. Smith, “Calibrating Noise to Sensitivity in Private Data Analysis,” in TCC, 2006.
15. V. Costan and S. Devadas, “Intel SGX Explained,” Foundations and Trends in Electronic Design Automation, 2016.