Some methodological conclusions about the generative ai model data analyst
Main Article Content
Abstract
This work is devoted to studying the capabilities of the generative AI (GenAI) ChatGPT-4o model Data Analyst (integrated in Chat GPT in August 2025), which has a specialization in the field of data analysis. Currently, the use of large language models (LLM) for various tasks is already a widespread phenomenon. Data scientists are testing the capabilities of these technologies in their field. Available publications and the authors’ own experience have shown that general-purpose generative models, such as ChatGPT, provide useful feedback in some cases, and in others - not. Their common conclusions are both positive experience in applying these technologies for a number of typical tasks, and also observations about significant shortcomings for certain cases. These publications inspired us to do our own research. Since these technologies are developing extremely rapidly (on the one hand, algorithms are improving, on the other hand, the base and training time are increasing), it would be interesting to find out the current state of affairs. The purpose of this study was to test the capabilities of the Data Analyst model, specialized for performing statistical methods, machine learning methods and other computational algorithms. The question of whether this tool should be involved in teaching relevant disciplines was also studied. In this paper, we analyze the suitability of the ChatGPT-5 Data Analyst model for training in statistical methods using the example of applying this resource to the clustering problem. The capabilities of Data Analyst are considered for three data sets with different degrees of cluster separation and using different clustering methods, including k-means, single linkage, and the fuzzy clustering method c-means. The possibilities of visualization, code creation in the R program, and interpretation of results are also considered. The paper establishes the capabilities and limitations of this software tool for training in this topic. It was found that the simplest basic tasks Data Analyst has performed quite effectively, while tasks of medium complexity may not be within its power at the moment. For clustering well-separated data, clustering was effectively performed using the k-means and single linkage methods, clusters were visualized, and the working code in the R program was provided. The task was not performed for the c-means method. Regarding the interpretation of the results and comparison of the effectiveness of different methods, the obtained answers can be considered acceptable if we consider these answers as advisory, supporting human decision-making. The result of our study is the conclusion about the need to teach students to use AI for data analysis along with a discussion of its limitations, consequences, ethical aspects and challenges for professionals in this field.
Article Details
Issue
Section

This work is licensed under a Creative Commons Attribution 4.0 International License.
References
1. Ministry of Education and Science and Ministry of Digital Transformation of Ukraine, Recommendations on the responsible implementation and use of artificial intelligence technologies in higher education institutions, Government Report, 2025. URL: https://mon.gov.ua/static-objects/mon/ sites/1/news/2025/04/24/shi-v-zakladakh-vyshchoi-osvity-24-04-2025.pdf.
2. The Massachusetts Institute of Technology (MIT), Generative ai use at mit, Institutional Web Page, 2023. URL: https://ist.mit.edu/ai.
3. The Massachusetts Institute of Technology (MIT), Guidance for use of generative ai tools, Institutional Web Page, 2023. URL: https://ist.mit.edu/ai-guidance.
4. Stockholm University. Department of Computer and Systems Sciences, Dsv’s ai policy, Institutional Policy, 2024. URL: https://www.su.se/institutionen-for-data-och-systemvetenskap/utbildning/ under-utbildningen/dsv-s-ai-policy-1.705900.
5. Melbourne Institute of Technology, Use of generative artificial intelligence in learning, teaching and research policy and procedure, Institutional Policy, 2024. URL: https://www.mit.edu.au/about-mit/institute-publications/policies-procedures-and-guidelines/ GenAIinLearningTeachingAndResearch.
6. Ministry of Education and Science of Ukraine, gem-ini ai pro google, 2025. URL: https://mon.gov.ua/news/ bezoplatnyi-richnyi-dostup-do-gemini-z-pidpyskoiu-ai-pro-dlia-ukrainskykh-studentiv-vid-google.
7. J. P. Andersen, L. Degn, R. Fishberg, E. K. Graversen, S. P. J. M. Horbach, E. K. Schmidt, J. W. Schneider, M. P. Sorensen, Generative artificial intelligence (genai) in the research process – a survey of researchers’ practices and perceptions, Technology in Society 81 (2025) 102813. URL: https://doi.org/10.1016/j.techsoc.2025.102813. doi:10.1016/j.techsoc.2025.102813.
8. N. McDonald, A. Johri, A. Ali, A. H. Collier, Generative artificial intelligence in higher education: Evidence from an analysis of institutional policies and guidelines, Computers in Human Behavior: Artificial Humans 3 (2025) 100121. URL: https://doi.org/10.1016/j.chbah.2025.100121. doi:10.1016/ j.chbah.2025.100121.
9. P. Slobodsky, M. Durcheva, Using chatgpt errors to encourage student exploration and self-learning in mathematics, International Journal of Mathematical Education in Science and Technology (2025) 1–25. URL: https://doi.org/10.1080/0020739X.2025.2505200. doi:10.1080/0020739X.2025. 2505200.
10. O. Taani, S. Alabidi, Chatgpt in education: Benefits and challenges of chatgpt for mathematics and science teaching practices, International Journal of Mathematical Education in Science and Technology (2024) 1–30. URL: https://doi.org/10.1080/0020739X.2024.2357341. doi:10.1080/ 0020739X.2024.2357341.
11. A. Apps, Openai releases code interpreter plugin, significantly expanding chatgpt’s capabilities, Neurohive, 2023. URL: https://neurohive.io/en/ai-apps/code-interpreter-plugin-chatgpt/.
12. J. Schwarz, The use of generative ai in statistical data analysis and its impact on teaching statistics at universities of applied sciences, Teaching Statistics: An International Journal for Statistics and Data Science Teaching 47 (2025) 118–128. URL: https://doi.org/10.1111/test.12398. doi:10.1111/test.12398.
13. Talmo.uk, Bridging innovation and practice: Integrating modern ai technologies into the statistics curriculum, Workshop, 9 April 2025, Glasgow, 2025. URL: https://talmo.uk/2025/AIinStats.html.
14. A. R. Ellis, E. Slade, A new era of learning: Considerations for chatgpt as a tool to enhance statistics and data science education, Journal of Statistics and Data Science Education 31 (2023) 128–133. URL: https://doi.org/10.1080/26939169.2023.2223609. doi:10.1080/26939169.2023.2223609.
15. D. Prandner, D. Wetzelhutter, S. Hese, Chatgpt as a data analyst: an exploratory study on ai-supported quantitative data analysis in empirical research, Frontiers in Education, Section Digital Education 9 (2025). URL: https://doi.org/10.3389/feduc.2024.1417900. doi:10.3389/feduc.2024. 1417900.
16. T. H. Davenport, R. Bean, Five trends in ai and data science for 2025, MIT Sloan Management Review (2025). URL: https://sloanreview.mit.edu/article/five-trends-in-ai-and-data-science-for-2025/, published January 08, 2025.
17. Development of software for neuromarketing based on artificial intelligence and data science using high-performance computing and parallel programming technologies / Oleh Pastukh, Vasyl Yatsyshyn // Scientific Journal of TNTU. — Tern.: TNTU, 2024. — Vol 113. — No 1. — P. 143–149.
18. OpenAI, Chatgpt data analyst, Web Tool, 2025. URL: https://chatgpt.com/g/ g-HMNcP6w7d-data-analyst.
19. Tilburg.ai, How to use github copilot in rstudio, Web Article, 2023. URL: https://tilburg.ai/2023/12/ github-copilot-rstudio/.
20. Tilburg Science Hub, Github copilot in rstudio and vs code, Web Article, 2023. URL: https:// tilburgsciencehub.com/topics/automation/ai/gpt-models/github-copilot/.