ON THIS PAGE

Capabilities and Application of Artificial Intelligence (AI) Models in Qualitative and Quantitative Data Mining, Data Processing and Data Analysis

Dr. Moises C. Torrentira Jr. EnP1
1University of Southeastern Philippines, , Davao City, 8000 Davao del Sur, Philippines.

Abstract

The study was conducted to determine the capabilities and application of Artificial Intelligence in the development of themes in qualitative and quantitative data analysis. Multi-stage data mining as a tool in AI data collection was employed by the researcher to process data reduction and the extraction of codes. This technique was able to generate the final themes pertaining to the capabilities and application of AI models which include pattern recognition analysis, thematic analysis, sentiment analysis, efficient data processing, objectivity and bias reduction, and insight generation.

These capabilities and application of AI models can be used to highlight the benefits and ethical use of AI in research especially in data mining, data processing, and data analysis.

1. Introduction

The recent innovation and advancement of technology in terms of machine learning and data mining has brought a complete shift of paradigm in research from traditional to Artificial Intelligence or AI-driven research. According to Kelly et al. [1], Artificial intelligence (AI) is the crucial technology that will shape the future of human civilization because of its capacity to upscale the capability of man without spending too much. The traditional conduct of qualitative and quantitative research is rigorous, time-consuming, laborious, and highly technical while the AI-driven way of conducting research is lenient, fast, convenient, time efficient, and sometimes contactless. This contrast however is inevitable and has even invited debates arising from its ethical consideration. Many have been alarmed about the capability of ChatGPT, Solab, Ernie Bot, Gemini, Claude, Grok, and Llama, Copilot [2], to name a few, to eventually depose the human intelligence, facilitate plagiarism, and feed or misrepresent information. But the internet of things has revolutionized the way researchers gather and analyze data through the internet. In fact, majority of the journals have already started accepting research articles that are AI-driven for possible publication. This means that Artificial Intelligence especially ChatGPT and the like has infiltrated the world of research [3]. Mainstreaming the AI in research therefore, is not far off from reality considering its practical acceptability.

In particular, artificial intelligence is very useful in data analysis. In terms of qualitative research, the bulk of non-numerical data gathered through interview, focus group discussion, stakeholders’ consultation, delphi technique, field reconnaissance, field notes and observations can be very tedious to process. It will require the right skills in data processing and analysis to engage in a meaningful play of words, codes, and themes. Although qualitative data processing can be done by one person alone, sometimes, collaboration of two or more researchers will result to a better formulation of themes. On the other hand, under the quantitative research, thematic analysis is necessary to come up with themes in exploratory factor analysis (EFA). Although the statistical software does the grouping of items, the naming of themes or dimensions will purely rely on the meaningful analysis of the researcher. This is where the artificial intelligence works in a powerful and efficient way. The AI may be inquired of the right codes or themes based on the pattern of ideas that was statistically grouped using the software.

This research therefore identified the capabilities of artificial intelligence models particularly in the development of themes for qualitative and quantitative data. Specifically, it answered the acceptable capabilities and applications of AI models in theme development of the gathered data sets from the AI models themselves.

2. Methodology

This study made use of artificial intelligence in generating the answers to the research questions. Considering the technical limitations, the study utilized the following AI models only: ChatGPT, Solab, Claude, YesChat, Gemini, and Copilot. Generally, these AI or language models are conceived to grasp and formulate human-like script based on the concepts or inquiries that they receive. They have the capacity to analyze vast amounts of text data and utilize a pattern of algorithms to process and respond to the queries or commands [4].

To collect the needed data or response from the AI models, a multi-stage data mining was used. This is a kind of collecting data that is done by subjecting the AI responses to different levels of data reduction inquiries until the final themes are identified. This type of data mining is also commonly used in grasping the everchanging trend of the stock market [5]. But in this study, the multi-stage data mining was utilized to reduce the responses to the final themes. For the first stage of data mining, the same inquiry was loaded in each of the AI’s input field specifically: “How helpful is Artificial Intelligence in theme development to analyze the collected data?” The selected AI models then generated very general responses. These responses are still raw data. The second stage induced human interaction to AI by manually extracting and identifying the codes generated pertaining to the capabilities and application of Artificial Intelligence in data mining and analysis. The third stage involving the AI impelled the researcher to load manually the identified codes to ChatGPT’s input field which has the capability to deal with challenging language understanding and generation tasks in a form of conversation [6]. The AI then generated the themes in an instant providing an opportunity for the researcher to gain impartial finalization of themes. Note further that without the help of AI in the production of themes, the human-generated themes are sometimes biased and limited to the mind frame of the researcher due to the dearth of information and exposure to data analysis [7]. The fourth stage consisted of the analysis of AI-generated data that shall be considered in the final selection of themes. Although there is high participation of human decision in this stage, the AI-generated themes were all included in this study and were corroborated with other related literature for triangulation purposes. This is done to further validate the final themes and ensure the legitimacy and precision [8] of the whole process.

3. Results and Discussion

The process generated several groupings that emerged in an instant and suggested the following themes on the capabilities and application of Artificial Intelligence in the analysis of qualitative and quantitative data:

A. Pattern recognition analysis

The algorithms are designed to recognize the keywords, phrases, and concepts within the data sets. In turn, the AI efficiently processes the large volume of this data to identify patterns, trends and even correlations. Apparently, Artificial Intelligence excels at identifying subtle, recurrent themes across large data sets to represent the phenomenon being studied. Additionally, techniques like Latent Dirichlet Allocation (LDA) automatically discover abstract topics or themes present in textual data which can form a network of modeling. Suen et al. [9] emphasized how pattern recognition and artificial intelligence can be used to forecast further discoveries based on the observed sequence of eventualities.

B. Thematic Analysis

AI algorithms extract meaningful themes and topics from textual, numerical, and multimedia data. Social researchers can explore underlying structures and thematic clusters with the qualitative data [10]. Further, AI generates initial thematic frameworks and presents the identified constructs representing the groups of similar trends. Natural Language Processing (NLP) techniques [11], for example, categorize information and help discover underlying themes and concepts. This is because, with thematic analysis, the recurring ideas from the qualitative data sets can be analyzed [12].

C. Sentiment analysis

A capability that is very limited to human understanding is the capability of AI to assess the emotional tone and sentiment expressed in qualitative data. This aids in understanding the participant’s attitudes and opinions in qualitative research. Aside from that, AI can recognize emotions that are expressed in texts [13] By analyzing sentiment and emotion, the AI model helps uncover implicit themes and context behind words expressed as a qualitative data. In fact, according to Chakriswaran et al. [14], AI can uncover the exact human emotion in the context.

D. Efficient data processing

Processing large volume of data can be burdensome and time-consuming for researchers. It can affect the productive use of time by spending so much on a bulk of data that can either be used or dumped. But the AI models can efficiently process large volumes of data which enables the researchers to identify patterns, trends, and correlations quickly. AI assists in generalizing, categorizing, organizing, cleaning, and transforming datasets, streamlining the whole research process. Researchers can handle vast amounts of information more efficiently with the assistance of AI which makes the qualitative analysis very scalable. Especially for unstructured data, according to Baviskar, et al. [15], millions of dollars are spent by businessmen annually just to extract information using the traditional techniques. But with AI models, the bulk of data can be processed in an instant with ease and remarkable acceptability and credibility.

E. Objectivity and bias reduction

Qualitative data processing and analysis that involve human intelligence are sometimes challenged by objectivity and bias. This can usually happen when researchers select information from their own understanding and use them to label a certain phenomenon or experience [16]. On the contrary, Artificial Intelligence approaches data analysis objectively, minimizing the influence of human bias. Consistency in coding and analysis is ensured by applying the same rules and algorithms across the dataset. Unlike human decision making which can be flawed and shaped by individual and societal biases [17], the AI then accelerates the process of coding and categorizing data, reducing the potential for human error and bias.

F. Insight generation

In Philosophy, insight can be defined as the product of deep and clear understanding of certain phenomenon leading to the abstraction of significant and extraordinary ideas. The AI models can uncover these hidden insights, formulate emergent themes, and intensify the connections within data. Commonly, researchers draw more robust conclusions and generate valuable insights from collected data. On the other hand, AI models usually excel at recognizing subtle patterns and connections, identifying latent themes not immediately apparent to human analysts. Because of their capacity to process large-scale and unstructured data sets, AI models can develop insights that cannot be predicted easily by a human mind [18].

4. Conclusion

The use of Artificial Intelligence in data analysis increases the capabilities of researchers to expedite research findings without much of a cost. It streamlines the research process allowing the researchers to focus more on the interpretation rather than on the data processing and analysis. Also, the integration of artificial intelligence in data mining especially in the development of themes significantly enhances the researchers’ capabilities to analyze the data comprehensively while extracting important insights on the research findings. However, despite of the important role of AI models in data analysis, it must be noted that there is no greater knowledge than that of the human being himself. Human interpretation and contextual understanding remain crucial and should complement the power of AI.

References

  1. Kelly, S., Kaye, S. A., & Oviedo-Trespalacios, O. (2023). What factors contribute to the acceptance of artificial intelligence? A systematic review. Telematics and Informatics, 77, 101925.

  2. O’Brien, M., & Fingerhut, H. (2023). Artificial Intelligence Technology Behind ChatGPT Was Built in Iowa—With a Lot of Water. AP News.

  3. Marshall, D. T., & Naff, D. B. (2023). The ethics of using artificial intelligence in qualitative research. Journal of Empirical Research on Human Research Ethics, 15562646241262659.

  4. Asatryan, B., Bleijendaal, H., & Wilde, A. A. (2023). Towards advanced diagnosis and management of inherited arrhythmia syndromes: Harnessing the capabilities of artificial intelligence and machine learning. Heart Rhythm.

  5. Huang, CJ., Chen, PW. & Pan, W. T. (2012). Using multi-stage data mining technique to build forecast model for Taiwan stocks. Neural Comput & Applic 21, 2057–2063.

  6. Wu, T., He, S., Liu, J., Sun, S., Liu, K., Han, Q. L., & Tang, Y. (2023). A brief overview of ChatGPT: The history, status quo and potential future development. IEEE/CAA Journal of Automatica Sinica, 10(5), 1122-1136.

  7. Belotto, M. J. (2018). Data analysis methods for qualitative research: Managing the challenges of coding, interrater reliability, and thematic analysis. The Qualitative Report, 23(11), 2622-2633.

  8. Turner, S. F., Cardinal, L. B., & Burton, R. M. (2017). Research design for mixed methods: A triangulation-based framework and roadmap. Organizational Research Methods, 20(2), 243-267.
  9. Suen, Y. T., Chan, R. C., & Wong, E. M. Y. (2021). An exploratory study of factors associated with difficulties in accessing HIV services during the COVID-19 pandemic among Chinese gay and bisexual men in Hong Kong. International Journal of Infectious Diseases, 106, 358-362.
  10. Terry, G., Hayfield, N., Clarke, V., & Braun, V. (2017). Thematic analysis. The Sage Handbook of Qualitative Research in Psychology, 2(17-37), 25.

  11. Chowdhary, K., & Chowdhary, K. R. (2020). Natural language processing. Fundamentals of Artificial Intelligence, 603-649.

  12. Riger, S. T. E. P. H. A. N. I. E., & Sigurvinsdottir, R. A. N. N. V. E. I. G. (2016). Thematic analysis. Handbook of methodological approaches to community-based research: Qualitative, Quantitative, and Mixed Methods, 33-41.

  13. Taherdoost, H., & Madanchian, M. (2023). Artificial intelligence and sentiment analysis: A review in competitive research. Computers, 12(2), 37.

  14. Chakriswaran, P., Vincent, D. R., Srinivasan, K., Sharma, V., Chang, C. Y., & Reina, D. G. (2019). Emotion AI-driven sentiment analysis: A survey, future research directions, and open issues. Applied Sciences, 9(24), 5462.

  15. Baviskar, D., Ahirrao, S., Potdar, V., & Kotecha, K. (2021). Efficient automated processing of the unstructured documents using artificial intelligence: A systematic literature review and future directions. IEEE Access, 9, 72894-72936.

  16. Sun, W., Nasraoui, O., & Shafto, P. (2020). Evolution and impact of bias in human and machine learning algorithm interaction. Plos one, 15(8), e0235502.

  17. Silberg, J., & Manyika, J. (2019). Notes from the AI frontier: Tackling bias in AI (and in humans). McKinsey Global Institute, 1(6), 1-31.

  18. Ma, L., & Sun, B. (2020). Machine learning and AI in marketing–Connecting computing power to human insights. International Journal of Research in Marketing, 37(3), 481-504.

Related Articles
Cansu Aykut Kolay1, İsmail Hakkı Mirici2
1Hacettepe University Graduate School of Educational Sciences, Ankara, Turkey.
2Hacettepe University, Faculty of Education, Ankara, Turkey.
Shatha M. AlHosian1
1College of Business Adminisrtation, King Saud University, Saudi Arabia.
Mustafa N. Mnati1, Ahmed Salih Al-Khaleefa2, Mohammed Ahmed Jubair3, Rasha Abed Hussein4
1Department of electrical engineering, Faculty of Engineering, University of Misan, Misan, Iraq.
2Department of Physics, Faculty of Education, University of Misan, Misan, Iraq.
3Department of Computer Technical Engineering, College of Information Technology, Imam Ja’afar Al-Sadiq University, Iraq.
4Department Of Dentistry, Almanara University for Medical Science, Iraq.
Samirah Dunakhir, Mukhammad Idrus1
1Faculty of Economics and Business, Universitas Negeri Makassar, Indonesia.

Citation

Dr. Moises C. Torrentira Jr. EnP. Capabilities and Application of Artificial Intelligence (AI) Models in Qualitative and Quantitative Data Mining, Data Processing and Data Analysis[J], Archives Des Sciences, Volume 74 , Issue 3, 2024. 198-200. DOI: https://doi.org/10.62227/as/74331.