Authors :
Damla Demir; Gökçe Karahan Adalı
Volume/Issue :
Volume 10 - 2025, Issue 12 - December
Google Scholar :
https://tinyurl.com/576nam3b
Scribd :
https://tinyurl.com/y6ktuc8a
DOI :
https://doi.org/10.38124/ijisrt/25dec513
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Note : Google Scholar may take 30 to 40 days to display the article.
Abstract :
The rapid rise in e-commerce has forced companies to have good knowledge of customer behavior and tailor the
marketing strategies accordingly. This study discusses the appropriateness of the K-Means algorithm for customer
segmentation from behavioral and demographic data obtained by a systematic Likert-scale survey. Clusters with high
interpretability were obtained and validated through silhouette analysis with values up to 0.75, indicating high internal
consistency. Key findings show that female interviewees prefer shopping by mobile to a far greater extent than male
interviewees, while male interviewees are more responsive to promotional emails and SMS. Younger and middle-aged
users are similarly more susceptible to social media advertising, with older segments having more neutral or selective
orientations. These results illustrate the complexity of customers' behavior and that demographic and behavioral data
should be combined in segmentation studies. By its demonstration of the value of clustering techniques in providing
insightful customer profiles, this study contributes to practical and methodological applications to data-driven decision-
making in e-commerce. Future research is encouraged to expand the dataset size and incorporate more advanced methods
such as predictive modeling and sentiment analysis to further improve segmentation precision.
Keywords :
K-Means Clustering, Customer Segmentation, E-commerce Analytics.
References :
- Aggarwal, D. (2023). Exploring the Role of AI for Online Grocery Shopping through Enhancing Personalized Recommendations and Customer Segmentation. Technoarete Transactions On Advances In Computer Applications (TTACA), 2(3).
- Alzahrani, R., Habib, M., & Khan, M. A. (2022). Student Satisfaction Clustering using K-Means with Orange Data Mining Tool. International Journal of Advanced Computer Science and Applications, 13(2), 34-40.
- Analytics Vidhya (2025). What is k-means clustering? https://www.analyticsvidhya.com/blog/2019/08/comprehensive-guide-k-means-clustering/ Accessed on 02.03.2025)
- Cooil, B., Aksoy, L., & Keiningham, T. L. (2008). Approaches to Customer Segmentation. Journal of Relationship Marketing, 6(3–4), 9–39. https://doi.org/10.1300/J366v06n03_02
- Daruvuri, S., Raj, A., & Tripathi, M. (2025). Improving customer segmentation through PCA-enhanced K-Means clustering in online retail datasets. Journal of Big Data, 12(1), 48. https://journalofbigdata.springeropen.com/articles/10.1186/s40537-025-01111-y
- García-Murillo, M., & Annabi, H. (2002). Customer Knowledge Management. Journal of the Operational Research Society, 53(8), 875–884.
- Google Developers. (2024). What is clustering? Accessed on 19.03.2025, https://developers.google.com/machine-learning/clustering/overview
- IBM, (2025, March 02) K-means clustering. https://www.ibm.com/think/topics/k-means-clustering#:~:text=K-means%20is%20an%20iterative,the%20characteristics%20of%20the%20data
- John, I., Shobayo, O., & Ogunleye, O. (2024). Clustering customer segments in UK retail using RFM and machine learning algorithms. ArXiv preprint. https://arxiv.org/abs/2402.04103
- Kashwan, K. R. & C M, Velu. (2013). Customer Segmentation Using Clustering and Data Mining Techniques. International Journal of Computer Theory and Engineering. 5. 856-861. 10.7763/IJCTE.2013.V5.811.
- Kirubakaran, Vignesh Saravanan & G, Sakthi. (2025). Customer Segmentation using Clustering Techniques: Data-Driven Approach to Enhance Marketing Strategy. 10.1109/ICSCNA63714.2024.10864053.
- Kuhn, Max & Johnson, Kjell. (2013). Applied Predictive Modeling. 10.1007/978-1-4614-6849-3.
- Lau, M. M., Cheung, R., Lam, A. Y., & Chu, Y. T. (2019). Segmenting online consumers by motivations for using e-commerce platforms. Journal of Retailing and Consumer Services, 47, 70-78.
- Nimbalkar, A. A., & Berad, A. T. (2021). The increasing importance of AI applications in e-commerce. Vidyabharati International Interdisciplinary Research Journal, 13(1), 388-391.
- Nugroho, M. A., Darmawan, M. A., & Yudhistira, A. (2024). Customer segmentation using optimized K-Means clustering in retail data. Journal of Intelligent Data Science and Systems, 4(2), 112–120. https://www.idss.iocspublisher.org/index.php/jidss/article/view/236
- Rahm, E., & Do, H. H. (2000). Data cleaning: Problems and current approaches. IEEE Data Eng. Bull., 23(4), 3-13.
- Rimakka, H., Öztürk, A., & Yilmaz, S. (2023). User segmentation based on purchasing habits and preferences on the Amazon platform using K-Means clustering. International Journal of Data Science Research, 6(3), 201–210. https://www.researchgate.net/publication/377972483
- Semoglou, V., Papakostas, G. A., & Likas, A. (2025). K-Sil: A silhouette-based improvement of K-Means clustering for better customer segmentation. Expert Systems with Applications, 233, 120900. https://doi.org/10.1016/j.eswa.2025.120900
- Syakur, M. A., Khotimah, B. K., Rochman, E. M. S., & Satoto, B. D. (2018). Integration of K-Means clustering method and elbow method for identification of the best customer profile cluster. IOP Conference Series: Materials Science and Engineering, 336(1), 012017. https://doi.org/10.1088/1757-899X/336/1/012017
- Tuma, M. N., Decker, R., & Scholz, S. W. (2011). A survey-based customer segmentation using clustering of latent class probabilities. European Journal of Operational Research, 213(3), 564-572.
- Wang, T. (2025). Reinforcement learning-based hybrid clustering for customer behavior analysis. Applied Soft Computing, 145, 110401. https://doi.org/10.1016/j.asoc.2025.110401
- Wirth, R., & Hipp, J. (2000, April). CRISP-DM: Towards a standard process model for data mining. In Proceedings of the 4th international conference on the practical applications of knowledge discovery and data mining (Vol. 1, pp. 29-39).
- Wu, R. S., & Chou, P. H. (2011). Customer segmentation of multiple category data in e-commerce using a soft-clustering approach. Electronic Commerce Research and Applications, 10(3), 331-341. https://doi.org/10.1016/j.elerap.2010.11.002
- Zhou, Y., Cheng, L., & Xu, H. (2020). Personalized Recommendation System Based on Customer Segmentation Using K-means Clustering and PCA. Procedia Computer Science, 174, 838-843
The rapid rise in e-commerce has forced companies to have good knowledge of customer behavior and tailor the
marketing strategies accordingly. This study discusses the appropriateness of the K-Means algorithm for customer
segmentation from behavioral and demographic data obtained by a systematic Likert-scale survey. Clusters with high
interpretability were obtained and validated through silhouette analysis with values up to 0.75, indicating high internal
consistency. Key findings show that female interviewees prefer shopping by mobile to a far greater extent than male
interviewees, while male interviewees are more responsive to promotional emails and SMS. Younger and middle-aged
users are similarly more susceptible to social media advertising, with older segments having more neutral or selective
orientations. These results illustrate the complexity of customers' behavior and that demographic and behavioral data
should be combined in segmentation studies. By its demonstration of the value of clustering techniques in providing
insightful customer profiles, this study contributes to practical and methodological applications to data-driven decision-
making in e-commerce. Future research is encouraged to expand the dataset size and incorporate more advanced methods
such as predictive modeling and sentiment analysis to further improve segmentation precision.
Keywords :
K-Means Clustering, Customer Segmentation, E-commerce Analytics.