Clustering and Forecasting Beef Prices: WPGMA and Time Series Regression Integration
DOI:
https://doi.org/10.15649/2346075X.5312Keywords:
Beef Price, Calendar Variation, PCA, TSR, WPGMAAbstract
Introduction. Beef is the second most consumed animal protein in Indonesia after chicken. Its demand increases during Eid al-Fitr and Eid al Adha, driving seasonal price spikes. These fluctuations challenge the government in maintaining price stability, with regional differences further complicating policy decisions. Objectives. This study aims to group Indonesian provinces by monthly beef price trends using time series clustering and to perform group-level forecasting with Time Series Regression that incorporates the effects of Eid al-Fitr and Eid al Adha. This approach simplifies modeling complexity and supports efficient policy-making for beef price stabilization. Materials and Methods. Principal Component Analysis (PCA) was used to optimize the dataset before clustering using the Weighted Pair Group Method with Arithmetic Mean (WPGMA) algorithm, combined with the Autocorrelation-Function (ACF) Distance similarity measure. Time Series Regression (TSR) models with dummy variables for Eid al-Fitr and Eid alAdha effects were then applied to each cluster for forecasting. Results. The analysis identified five optimal clusters, with a silhouette coefficient value of 0.61. Forecasting within each cluster showed in sample MAPE values ranging from 1.05% to 5.15%. All clusters exhibited an increasing trend in beef prices from September 2023 to August 2024, highlighting the impact of calendar events and regional characteristics on price dynamics. Conclusions. The clustering-based forecasting approach effectively simplifies price analysis across regions, supporting targeted policy interventions and improving the accuracy of beef price predictions in Indonesia.
References
1. Firmansyah, Maruli P, Harahap A. Analysis of beef market integration between consumer and producer regions in Indonesia. Open Agric 2023;8. https://doi.org/10.1515/opag-2022-022
2. Kusumaningrum R, Darjanto A, Nurmalina R, Mulatsih S, Suprehatin. Effect of Import Policy on Beef Supply and Demand in Indonesia before and after the COVID-19 Pandemic. Tropical Animal Science Journal 2024;47:242–51. https://doi.org/10.5398/TASJ.2024.47.2.242
3. Hadi SN, Chung RH. Estimation of Demand for Beef Imports in Indonesia: An Autoregressive Distributed Lag (ARDL) Approach. Agriculture (Switzerland) 2022;12. https://doi.org/10.3390/agriculture12081212
4. Kousiga T, Vadivu RS. Hierarchical clustering algorithms in data mining. Int J Sci Dev Res. 2019;4:1–3.
5. Sokhonn L, Park YS, Lee MK. Hierarchical clustering via single and complete linkage using fully homomorphic encryption. Sensors. 2024;24. https://doi.org/10.3390/s24154826
6. Zhou S, Liu F, Song W. Estimating the optimal number of clusters via internal validity index. Neural Process Lett. 2021;53:1013–34. https://doi.org/10.1007/s11063-021-10427-8
7. Thrun MC, Stier Q. Fundamental clustering algorithms suite. SoftwareX. 2021;13:100642. https://doi.org/10.1016/j.softx.2020.100642
8. Radovanovic A, Li J, Milanovic JV, Milosavljevic N, Storchi R. Application of agglomerative hierarchical clustering for clustering of time series data. IEEE PES Innov Smart Grid Technol Conf Eur. 2020;2020-Oct:640–4. https://doi.org/10.1109/ISGT-Europe47291.2020.9248759
9. Zhang Y, Shi Q, Zhu J, Peng J, Li H. Time series clustering with topological and geometric mixed distance. Mathematics. 2021;9. https://doi.org/10.3390/math9091046
10. Wei WWS. Time series analysis: univariate and multivariate methods. 2nd ed. New York: Pearson Education; 2006.
11. Rahman NHA, Lee MH, Suhartono, Latif MT. Artificial neural networks and fuzzy time series forecasting: an application to air quality. Qual Quant. 2015;49:2633–47. https://doi.org/10.1007/s11135-014-0132-6
12. Monika P, Ruchjana BN, Abdullah AS, Budiarto R. Integration of GSTARIMA model with heteroskedastic error and kriging for climate forecasting: a systematic review. Appl Math Inf Sci. 2024;18:551–67. https://doi.org/10.18576/amis/180307
13. Ahmed SE, Aydin D, Yilmaz E. Semiparametric time-series model using local polynomial: an application on the effects of financial risk factors on crop yield. J Risk Financ Manag. 2022;15. https://doi.org/10.3390/jrfm15030141
14. Montgomery DC, Friedman DJ. Prediction using regression models with multicollinear predictor variables. IIE Trans. 1993;25:73–85. https://doi.org/10.1080/07408179308964293
15. Arashi M, Norouzirad M, Roozbeh M, Mamode Khan N. A high-dimensional counterpart for the ridge estimator in multicollinear situations. Mathematics. 2021;9. https://doi.org/10.3390/math9233057
16. Ismail NH, Ahmad N, Mohamed NA, Tahar MR. Analysis of geomagnetic ap index on worldwide earthquake occurrence using principal component analysis and hierarchical cluster analysis. Sains Malays. 2021;50:1157–64. https://doi.org/10.17576/jsm-2021-5004-25
17. Maugeri A, Barchitta M, Favara G, La Mastra C, La Rosa MC, Magnano San Lio R, et al. Application of clustering on principal components for nutritional epidemiology: a workflow to derive dietary patterns. Nutrients. 2023;15. https://doi.org/10.3390/nu15010195
18. Wang T, Jing Z, Zhang S, Qiu C. Utilizing principal component analysis and hierarchical clustering to develop driving cycles: a case study in Zhenjiang. Sustainability (Switzerland). 2023;15. https://doi.org/10.3390/su15064845
19. Jombart T, Devillard S, Balloux F. Discriminant analysis of principal components: a new method for the analysis of genetically structured populations. BMC Genet. 2010;11. https://doi.org/10.1186/1471-2156-11-94
20. Chan JY Le, Leow SMH, Bea KT, Cheng WK, Phoong SW, Hong ZW, et al. Mitigating the multicollinearity problem and its machine learning approach: a review. Mathematics. 2022;10. https://doi.org/10.3390/math10081283
21. Penkova TG. Principal component analysis and cluster analysis for evaluating the natural and anthropogenic territory safety. Procedia Comput Sci. 2017;112:99–108. https://doi.org/10.1016/j.procs.2017.08.179
22. Contreras JM, Molina Portillo E, Fernández Luna JM. Evaluation of hierarchical clustering methodologies for identifying patterns in timeout requests in EuroLeague basketball. Mathematics. 2025;13. https://doi.org/10.3390/math13152414
23. Saraçli S, Doǧan N, Doǧan I. Comparison of hierarchical cluster analysis methods by cophenetic correlation. J Inequal Appl. 2013;2013. https://doi.org/10.1186/1029-242X-2013-203
24. Xie WB, Lee YL, Wang C, Chen DB, Zhou T. Hierarchical clustering supported by reciprocal nearest neighbors. Inf Sci. 2020;527:279–92. https://doi.org/10.1016/j.ins.2020.04.016
25. Vijaya, Aayushi S, Bateja R. A review on hierarchical clustering algorithms. J Eng Appl Sci. 2017;12:7501–7.
26. Lu G, Tian Z, Chen P, Liang Z, Zeng X, Zhao Y, et al. Comprehensive morphological and molecular insights into drought tolerance variation at germination stage in Brassica napus accessions. Plants. 2024;13. https://doi.org/10.3390/plants13233296
27. Dani ATR, Wahyuningsih S, Rizki NA. Penerapan hierarchical clustering metode agglomerative pada data runtun waktu. Jambura J Math. 2019;1:64–78. https://doi.org/10.34312/jjom.v1i2.2354
28. Herden T, Bunzel S. Archetypes of supply chain analytics initiatives—an exploratory study. Logistics. 2018;2:10. https://doi.org/10.3390/logistics2020010
29. Rousseeuw PJ. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math. 1987;20:53–65. https://doi.org/10.1016/0377-0427(87)90125-7
30. Shutaywi M, Kachouie NN. Silhouette analysis for performance evaluation in machine learning with applications to clustering. Entropy. 2021;23. https://doi.org/10.3390/e23060759
31. Babii A, Ghysels E, Striaukas J. Machine learning time series regressions with an application to nowcasting. J Bus Econ Stat. 2022;40:1094–106. https://doi.org/10.1080/07350015.2021.1899933
32. Suhartono, Ashari DE, Prastyo DD, Kuswanto H, Lee MH. Deep neural network for forecasting inflow and outflow in Indonesia. Sains Malays. 2019;48:1787–98. https://doi.org/10.17576/jsm-2019-4808-26
33. Xu W, Peng H, Zeng X, Zhou F, Tian X, Peng X. A hybrid modelling method for time series forecasting based on a linear regression model and deep learning. Appl Intell. 2019;49:3002–15. https://doi.org/10.1007/s10489-019-01426-3
34. Budiantara IN, Ratnasari V, Ratna M, Zain I. The combination of spline and kernel estimator for nonparametric regression and its properties. Appl Math Sci. 2015;9:6083–94. https://doi.org/10.12988/ams.2015.58517
35. Ratnasari V, Budiantara N, Dani ATR. Nonparametric regression mixed estimators of truncated spline and Gaussian kernel based on CV, GCV, and UBR methods. Int J Adv Sci Eng Inf Technol. 2021;11:2400–6. https://doi.org/10.18517/ijaseit.11.6.14464
36. Draper NR, Smith H. Applied regression analysis. 3rd ed. Canada: John Wiley & Sons; 1998.
37. Montgomery DC, Peck EA, Vining GG. Introduction to linear regression analysis. 5th ed. Canada: John Wiley & Sons; 2012.
38. Fitriyah AT, Chamidah N, Saifudin T. Prediction of paddy production in Indonesia using semiparametric time series regression least square spline estimator. Data Metadata. 2025;4:1–12. https://doi.org/10.56294/dm2025527
39. Fibriyani V, Chamidah N, Saifudin T. Estimating time series semiparametric regression model using local polynomial estimator for predicting inflation rate in Indonesia. J King Saud Univ Sci. 2024;36. https://doi.org/10.1016/j.jksus.2024.103549
40. Alabdulrazzaq H, Alenezi MN, Rawajfih Y, Alghannam BA, Al-Hassan AA, Al-Anzi FS. On the accuracy of ARIMA-based prediction of COVID-19 spread. Results Phys. 2021;27:104509. https://doi.org/10.1016/j.rinp.2021.104509
41. Bank Indonesia. Strategic Food Price Information Center (PIHPS) [Internet]. n.d. Available from: https://www.bi.go.id/hargapangan (accessed January 16, 2023 and September 8, 2023).
42. Komalawati K, Malik A, Jatuningtyas RK, Wulanjari ME. Price transmission on the Indonesian beef market. In: IOP Conference Series: Earth and Environmental Science. IOP Publishing Ltd; 2021. https://doi.org/10.1088/1755-1315/788/1/012209
43. Wibowo RP, Pebriyani D, Indrawati T, Khaliqi M. How Beef, Chicken, and Egg Demand Elasticities Vary with Income in Indonesia. Jurnal AGRISEP: Kajian Masalah Sosial Ekonomi Pertanian dan Agribisnis. 2025 Mar 14;361–84. https://doi.org/10.31186/jagrisep.24.01.361-384.
44. Fatimah SN, Zainuddin AF, Mardiana N, Mukhaiyar U. Beef Price Forecasting Based on Temporal, Spatial and Space-Time Parameter Indices. Barekeng 2025;19:1805–24. https://doi.org/10.30598/barekengvol19iss3pp1805-1824
45. Guo J, Tanaka T. The effectiveness of self-sufficiency policy: International price transmissions in beef markets. Sustainability (Switzerland) 2020;12. https://doi.org/10.3390/su12156073
Downloads
Published
How to Cite
Downloads
Issue
Section
License
Copyright (c) 2026 Innovaciencia

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
All articles published in this scientific journal are protected by copyright. The authors retain copyright and grant the journal the right of first publication, with the work simultaneously licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0), which permits sharing the work with authorship recognition and without commercial purposes.
Readers may copy and distribute the material from this journal issue for non-commercial purposes in any medium, provided the original work is cited and credit is given to the authors and the journal.
Any commercial use of the material from this journal is strictly prohibited without written permission from the copyright holder.
For more information on the copyright of the journal and open access policies, please visit our website.










