# The optimization and statistical models and methods in recognizing properties of data sets measured with errors

OSMoMeSIP-IP-2016-06-6545

This project is financed by Croatian Science Foundation.

Project duration: 1. 3. 2017. - 28. 2. 2021.

### PROJECT DESCRIPTION

As a part of an attractive and active area of research known as big data analysis, optimization and statistical aspects of recognizing data sets properties will be analyzed. Research will be focused on clustering problems, deconvolution models and applications. The assumption is that the observed data sets represent the measured values of the variables to be analyzed but also that they contain a measurement error. In large data sets it is often appropriate to cluster data sets on the basis of certain characteristics and then apply specific models for each group that can describe variable properties such as relationships among them, possibility of separation, edges, specific form of the set of values, dimensions (length, surface or volume) of the set of values or general parameter vector which determines them. The problem in many practical situations can be formulated as an optimization problem for which the objective function is generally neither differentiable nor convex. In order to solve such problems effectively, rapid and accurate numerical procedures will be developed. Also, due to errors in the data, in order to understand and correctly interpret the results, statistical models will be used and important statistical properties will be characterized.

### PROJECT RESEARCHERS

**Principal investigator: **Prof. Rudolf Scitovski, Department of Mathematics, University of Osijek, Croatia

**Project members:** Prof. Andrew Barron (Yale University, USA), Prof. Mirta Benšić (Department of Mathematics, University of Osijek, Croatia), Prof. Dragan Jukić (Department of Mathematics, University of Osijek, Croatia), Prof. Kristian Sabo (Department of Mathematics, University of Osijek, Croatia), Assistant Prof. Karlo Emanuel Nyarko (Faculty of Electrical Engineering, Computer Science and Information Technology Osijek, University of Osijek, Croatia), Dr. Safet Hamedović (Faculty of Metallurgy and Materials, University of Zenica, BIH), Dr .Petar Taler (Department of Mathematics, University of Osijek, Croatia) and Una Radojičić (2017 - 2019 PhD student, Department of Mathematics, University of Osijek, Croatia).

### ACTIVITIES

**Journal Publications (published or accepted)**

1. R. Scitovski, A new global optimization method for a symmetric Lipschitz continuous function and the application to searching for a globally optimal partition of a one-dimensional set, Journal of Global Optimization** 68** (2017), 713-727, DOI: 10.1007/s10898-017-0510-4, (SCIE, Mathematics, Applied)

2. D. Jukić, An elementary proof of the quadratic envelope characterization of zero-derivative points, Optimization Letters,12 (2018), 1155 - 1156 (SCIE, Mathematics, Applied)

3. M. Benšić, P. Taler, S. Hamedović, E.K. Nyarko, K. Sabo, LeArEst: Length and Area Estimation from Data Measured with Additive Error, The R Journal, **9**/2 (2017), 461-473 (SCIE, Probability & Statistics)

4. S. Hamedović, M. Benšić, K. Sabo, P. Taler, Estimating the size of an object captured with error, Cent Eur J Oper Res **26**/3 (2018), 771-781, (SCIE, Operations Research & Management Science)

5. R. Scitovski, M. Vinković, K. Sabo, A. Kozić, A research project ranking method based on independent reviews by using the principle of the distance to the perfectly

assessed project, Croatian Operational Research Review, **8** (2017), 429-442) (Web of Science Emerging Sources Citation Index (ESCI))

6. L. Jakobek, P. Matić, V. Krešić, A. R. Barron, Adsorption of Apple Polyphenols onto β-Glucan, Czech J. Food Sci. **6** (2017), 476–482 (SCIE, )

7. A. Barron, M. Benšić, K. Sabo, A Note on Weighted Least Square Distribution Fitting and Full Standardization of the Empirical Distribution Function, TEST **27**/4 (2018), 946-967 (SCIE, Statistics&Probability)

8. R. Scitovski, K. Sabo, Application of the DIRECT algorithm to searching for an optimal k-partition of the set $\A\subset\R^n$ and its application to the multiple circle detection problem, Journal of Global Optimization (SCIE, Mathematics, Applied), 74/1 (2019), 63-77

9. R. Scitovski, U. Radojičić, K. Sabo, A fast and efficient method for solving the multiple line detection problem, Rad HAZU, Matematičke znanosti (Web of Science Emerging Sources Citation Index (ESCI), MRcc), **23** (2019), 123-140

10. R. Scitovski, K. Sabo, DBSCAN-like clustering method for various data densities, Pattern Analysis and Applications **23 (2020)**, 541–554 (SCIE, )

11. M. Zekić Sušac, M. Knežević, R. Scitovski, Modeling the cost of energy in public sector buildings by linear regression and deep learning Cent Eur J Oper Res * * **28** (2019), 1 - 16, (SCIE, Operations Research & Management Science)

12. R. Scitovski, K. Sabo, The adaptation of the k-means algorithm to solving the multiple ellipses detection problem by using an initial approximation obtained by the DIRECT global optimization algorithm, Applications of Mathematics 64/6 (2019), 663-678 (SCIE, Mathematics, Applied), 2019

13. S. Hamedović, M. Benšić, K. Sabo, Estimating the width of a uniform distribution under symmetric measurement errors, Journal of the Korean Statistical Society (SCIE, Statistics & Probability), 2019, accepted

14. R. Scitovski, K. Sabo, A combination of k-means and DBSCAN algorithm for solving the multiple generalized circle detection problem, Advances in Data Analysis and Classification (SCIE, Statistics & Probability), 2020, accepted

15. D. Jukić, A necessary and sufficient criterion for the existence of the global minima of a continuous lower bounded function on a noncompact set, Journal of Computational and Applied Mathematics, (SCIE, Mathematics, Applied), 2020, accepted

16. Lidija Jakobek, Petra Margetić, Šima Kraljević, Šime Ukić, Mirta Benšić, Andrew Barron, Adsorption between quercetin derivatives and beta-glucan studied with a novel approach to modeling adsorption isotherms, Applied Sciences **10** (2020), 1 - 16 (SCIE, )

17. M. Balog, V. Ivić, R. Scitovski, I. Labak, K.F. Szűcs, R. Gaspar, S.G. Vari, M. Heffer, A mathematical model reveals sex-specific changes in glucose and insulin tolerance during rat puberty and maturation, Croatian medical journal **61**/2 (2020), 107-118 (SCIE, )

18. M. Lauc, V. Ivić, R. Scitovski, Parameter identification in the mathematical model of glucose and insulin tolerance test – the mathematical markers of diabetes, Croatian Operational Research Review 11 (2020), 121 - 133 (Web of Science Emerging Sources Citation Index (ESCI))

19. K. Sabo, D. Grahovac, R. Scitovski, Incremental method for multiple line detection problem - iterative reweighted approach, Mathematics and Computers in Simulation, 178 (2020), 588–602 (SCIE Mathematics, Applied), 2020, accepted

20. D. Jukić, T. Marošević, An existence level for residual sum of squares of the power-law regression with an unknown location parameter, Mathematica Slovaca (SCIE Mathematics), 2020, accepted

21. R. Scitovski, S. Majstorović, K. Sabo, A combination of RANSAC and DBSCAN methods for solving the multiple geometrical object detection problem, Journal of Global Optimization (SCIE, Mathematics, Applied), 2020, accepted

22. L. Jakobek, I. Buljeta, J. Ištuk, A. R. Barron, Polyphenols of Traditional Apple Varieties in Interaction with Barley β-Glucan: A Study of the Adsorption Process, Foods 2020, 9(9),1278; https://doi.org/10.3390/foods9091278 (SCIE, FOOD SCIENCE & TECHNOLOGY)

23. D. Jukić, K. Sabo, An existence criterion for the nonlinear $\ell_p-$norm fitting problem, Cent Eur J Oper Res (SCIE, Operations Research & Management Science), 2021, accepted

**Software**

M. Benšić, S. Hamedović, K. Sabo, P. Taler, LeArEst R software package (published on CRAN).

**Conference proceedings**

1. P. Taler, S. Hamedović, M. Benšić, E.K. Nyarko, LeArEst - The Software for Border and Area Estimation of Data Measured with Additive Error, 59th International Symposium ELMAR-2017, Zadar, 2017, 259-263

2. R. Scitovski and K. Sabo, A Fast and Efficient Method for Solving the Multiple Generalized Circle Detection Problem, *2018 International Conference on Applied Mathematics & Computational Science (ICAMCS.NET)*, Budapest, Hungary, 2018, pp. 11-117, doi: 10.1109/ICAMCS.NET46018.2018.00010.

2. D. Jukić, K. Sabo, An existence criterion for the sum of squares. In: Zadnik Stirn L, Kljajić Borštnar M, Žerovnik J, Drobne S, Povh J (eds) Proceedings of the 15th International Symposium on Operational Research SOR'19 (Bled, September 25-27, 2019), 500-505

3. U. Radojičić, R. Scitovski, K. Sabo, A Fast and Efficient Method for Solving the Multiple Closed Curve Detection Problem, In Maria De Marsico, Gabriella Sanniti di Baja, Ana Fred (eds), Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods ICPRAM 2019 (Prague, Czech Republic, February 19-21, 2019), 269 - 276

4. K. Sabo, R. Scitovski, Multiple Ellipse Detection by using RANSAC and DBSCAN Method, In: Maria De Marsico, Gabriella Sanniti di Baja and Ana Fred (eds) Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods ICPRAM 2020, (Valletta, Malta, February 22-24, 2020), 129 - 135

**Conferences and seminars**

1. ELMAR 2017, Zadar, September 2017: P. Taler, S. Hamedović, M. Benšić, K. E. Nyarko, LeArEst - The Software for Border and Area Estimation of Data Measured with Additive Error (slides)

2. Seminar for optimization and applications, Department of Mathematics, University of Osijek, December, 2017: R. Scitovski, A method for solving the multiple ellipses detection problem

3. Seminar for optimization and applications, Department of Mathematics, University of Osijek, January, 2018: M. Benšić, Procjena distribucijskih parametara generaliziranom metodom najmanjih kvadrata i standardizacija empirijske distribucije

4. Statistical seminar, Department of Mathematics, University of Osijek, March, 2018: A. Barron, Proper Statistical Fitting of Adsorption Isotherms

5. ISSCRO'18 (2nd International Statistical Conference in CROatia), Opatija, May 2018: U. Radojičić, Application of Adaptive Annealing method to generalized incremental algorithm (slides)

6. ISSCRO'18 (2nd International Statistical Conference in CROatia), Opatija, May 2018: M. Benšić, K. Sabo, S. Hamedović, The width of a uniform distribution: estimation in additive error models (slides)

7. Euro-Global Conference on Food Science, Agronomy and Technology Food ScienceEuro-Global Conference on Food Science, Agronomy and Technology Food Science, 20th and 22nd September 2018, Rome, Italy, L. Jakobek, P. Matić, A. R. Barron, The Application Of Adsorprion Isotherms With Proper Fitting To Interpret Polyphenol Bioaccessibility In Vitro (poster)

8. KOI2018 (17th International Conference on Operational Research), Zadar, September 26–28, 2018, P. Nikić, R. Scitovski, K. Sabo, S. Majstorović, A fast algorithm for solving the multiple ellipse detection problem (slides)

9. ICAMCS2018 (International Conference on Applied Mathematics & Computational Science), Budapest, October 6–8, 2018, R. Scitovski, K. Sabo, A fast and efficient method for solving the multiple generalized circle detection problem (slides)

10. Statistical seminar, Department of Mathematics, University of Osijek, December, 2018: S. Jelić, K. Sabo, Modeli kratkoročne prognoze koncentracije peludi bazirani na strojnom učenju (slides)

11. Seminar for optimization and applications, Department of Mathematics, University of Osijek, December, 2018: R. Scitovski, Prepoznavanje nekih geometrijskih objekata u ravnini (slides)

12. Statistical seminar, Department of Mathematics, University of Osijek, February, 2019: U. Radojičić, *Algoritmi za inicijalizaciju Gaussovih miješanih modela I, II*

13. The International Conference on Pattern Recognition Applications and Methods, 19th - 21th February 2019, Prague, Czech Republic, U. Radojičić, R. Scitovski, K. Sabo, A Fast and Efficient Method for Solving the Multiple Closed Curve Detection Problem,

14. Women in data science conference Croatia Osijek, March, 2019: M. Benšić, K. Sabo, P. Taler, S. Hamedović, *Određivanje preciznih mjera objekta iz zašumljenih podataka (slides)*

15. BIOSTAT 2019, June, 2019, Andrew R. Barron, Lidija Jakobek Barron, Mirta Benšić, Petra Matić, *Statistical fitting of adsorption isotherms* (slides)

16. The 18th Conference of the Applied Stochastic Models and Data Analysis International Society (ASMDA2019), Mirta Benšić, Kristian Sabo, Safet Hamedović, E*stimating the width of uniform distribution under measurement errors *(slides)

17. 21st European Young Statisticians Meeting, Belgrade 29 July - 02 August 2019, Una Radojičić, *Algorithms for initialization of Gaussian **Mixture Models *(slides)

18. The 15th International Symposium on Operations Research in Slovenia | 25th – 27th September 2019, Bled, Slovenia, Dragan Jukić, Kristian Sabo, *An existence criterion for the sum of squares *(slides)

19. The International Conference on Pattern Recognition Applications and Methods,22th - 24th February 2020, Valletta, Malta, K. Sabo, R. Scitovski, *Multiple ellipse detection by using RANSAC and DBSCAN method *(poster)

20. CISTI'2020 - 15th Iberian Conference on Information Systems and Technologies, 24th and 27th of June 2020,Sevilla, Spain, P. Taler, M. Benšić, E. K. Nyarko, *Statistical estimation of the object’s area from the image contaminated with additive noise *(slides)

21. Mathematical Optimization Theory and Operations Research” (MOTOR 2020), July 6-10, 2020, Novosibirsk, Russia, K. Sabo, R. Scitovski, *Incremental method for multiple line detection problem *(slides)

**Project promotion**

Sveučilišni glasnik, No 27, July, 14th, 2017 (page 11): Optimizacijski i statistički modeli prepoznavanje svojstava skupova podataka izmjerenih s pogreškama