แบบจำลองการวิเคราะห์ความเสี่ยงของการเกิดภาวะซึมเศร้าโดยใช้เทคนิคการเรียนรู้ของเครื่อง A Depression Risk Analysis Model using Machine Learning Techniques

แบบจำลองการวิเคราะห์ความเสี่ยงของการเกิดภาวะซึมเศร้าโดยใช้เทคนิคการเรียนรู้ของเครื่อง
A Depression Risk Analysis Model using Machine Learning Techniques

Suda Tipprasert, Thara Angskun, Jitimon Angskun

Abstract

ภาวะโรคซึมเศร้าเป็นหนึ่งในสาเหตุหลักที่ก่อให้เกิดปัญหาการฆ่าตัวตาย โดยผู้ป่วยเป็นโรคซึมเศร้าส่วนใหญ่ไม่ทราบว่า ตนเองเกิดภาวะซึมเศร้าและมักแสดงออกผ่านทางสื่อสังคมออนไลน์ผ่านข้อความหรือรูปภาพ เนื่องจากสื่อสังคมออนไลน์เป็นรูปแบบการสื่อสารผ่านช่องทางที่ไม่ต้องอาศัยการแสดงออกทางสีหน้า นอกจากนี้งานวิจัยที่มีอยู่เน้นไปที่การใช้ข้อความในเครือข่ายสังคมออนไลน์มาวิเคราะห์เพียงอย่างเดียว งานวิจัยนี้จึงนำเสนอแบบจำลองการวิเคราะห์ความเสี่ยงของการเกิดภาวะซึมเศร้าโดยใช้เทคนิคการเรียนรู้ของเครื่อง โดยใช้ข้อมูลจากการทำแบบประเมินภาวะซึมเศร้า 9 คำถาม และจากผู้ใช้งานทวิตเตอร์ จำนวน 405 คน ร่วมกับเทคนิคการเรียนรู้ของเครื่องประกอบด้วย เทคนิคนาอีฟเบย์ เทคนิคซัพพอร์ตเวกเตอร์แมชชีน เทคนิคต้นไม้การตัดสินใจ เทคนิคเพอร์เซ็ปตรอนหลายชั้น และเทคนิคการสุ่มป่าไม้ ผลการทดลองพบว่า แบบจำลองการวิเคราะห์ความเสี่ยงของการเกิดภาวะซึมเศร้าด้วยเทคนิคการสุ่มป่าไม้ให้ค่าประสิทธิภาพโดยรวมสูงที่สุด คือร้อยละ 87.39 ในขณะที่ผลการเปรียบเทียบประสิทธิภาพของแบบจำลองจากการใช้ตัวแปรนำเข้าที่แตกต่างกันพบว่า การใช้คุณลักษณะจากทวิตเตอร์ทั้งประเภทข้อความและรูปภาพให้ค่าประสิทธิภาพโดยรวมสูงที่สุด นอกจากนี้ การหาคุณลักษณะที่สำคัญพบว่า คุณลักษณะที่สำคัญที่สุดในการวิเคราะห์ความเสี่ยงภาวะซึมเศร้า คือ ค่าเฉลี่ยคะแนนความรู้สึกซึ่งเป็นคุณลักษณะของทวิตเตอร์ประเภทข้อความ

Depression is one of the major causes of suicide. Most people with depression are unaware that they have depression and often express themselves on social media through text or images because social media is a form of communication through channels that do not rely on facial expressions. Moreover, the existing research focuses solely on the use of texts in social networks for analysis. This research proposes a depression risk analysis model using machine learning techniques. The model construction applies data from the Patient Health Questionnaire-9 and Twitter data in combination with machine learning techniques. The Twitter data are collected from 405 Twitter users. There are five machine learning techniques explored in this research which are Naïve Bayes, Support Vector Machine, Decision Tree, Multilayer Perceptron, and Random Forest. The experimental results indicated that the depression risk analysis model using a Random Forest technique provided the highest F-measure at 87.39%. While a comparison of the model's performance using different input variables revealed that the use of Twitter attributes, both text and image, achieved the highest F-measure. In addition, the feature finding revealed that the most important feature in the depression risk analysis was the mean sentiment score, which is a Twitter’s text attribute.

Keywords

References

[1] The Excellence Center for Depression Disorder, Knowledge and essence about depression, World Health Day 2017. Depression: Let's talk, 2017, pp 1–2.

[2] PUEY UNGPHAKORN Institute for Economic Research. (2021, May). Mental health problems in Thailand during the Covid-19 crisis from the perspective of an economist. [Online]. (in Thai). Available: https://www.pier.or.th/ abridged/2021/08/

[3] Department of mental health, “Strategic Plan for the Department of Mental Health during the 12th National Economic and Social Development Plan (2017–2021),” Mental Health Strategy Department of Mental Health, 2021, pp 19–20.

[4] Institute for population and social research Mahidol University, Thai health report 2560. Bangkok: Amarin Printing & Publishing Public Company Limited, 2017, pp 88–89 (in Thai).

[5] K. Phanichsiri and B. Tuntasood, “Social media addiction and attention deficit and hyperactivity symptoms in high school students in bangkok,” Journal of the Psychiatrist Association of Thailand, vol. 61, no. 3, pp. 191–2004, 2016 (in Thai).

[6] M. Aldarwish and H. Ahmad, “Predicting depression levels using social media posts,” in IEEE 13th International Symposium on Autonomous Decentralized System (ISADS), Bangkok, Thailand, 2017, pp. 277–280.

[7] M. Choudhury, S. Counts and E. Horvitz, “Social media as a measurement tool of depression in populations,” in Proceedings of the 5th Annual ACM Web Science Conference (WebSci '13), New York, USA: ACM, 2013, pp. 47–56.

[8] P. Vateekul and T. Koomsubha, “A study of sentiment analysis using deep learning techniques on Thai Twitter data,” in Proceeding of 2016 13th International Joint Conference on Computer Science and Software Engineering (JCSSE), 2016, pp 1–6.

[9] M. Park, C. Cha and M. Cha, “Depressive moods of users portrayed in Twitter,” In Proceedings of the ACM SIGKDD Workshop on Healthcare Informatics (HI-KDD). , SIGKDD, 2012, pp. 1-8.

[10] A.G. Reece and C.M. Danforth, “Instagram photos reveal predictive markers of depression,” EPJ Data Science, vol. 6, no. 15, pp. 1–12. 2017.

[11] Department of mental health. (2018). Patient Health Questionnaire-9 (9Q). [Online]. (in Thai). Available: https://www.dmh.go.th/test/ download/files/2Q%209Q%208Q%20(1).pdf

[12] A. Esuli and F. Sebastiani, “SENTIWORDNET: A publicly available lexical resource for opinion mining,” in Proceedings of LREC-06, the 5th Conference on Language Resources, 2006, pp. 417–422.

[13] S. Baccianella, A. Esuli, and F. Sebastiani, “ S e n t i w o r d n e t 3 . 0 : a n e n h a n c e d lexical resource forsentiment analysis and opinion mining,” in Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10), 2010, pp. 2200–2204.

[14] C. Musto, G. Semeraro and M. Polignano, “A comparison of Lexicon-based approaches for Sentiment Analysis of microblog posts,” in CEUR Workshop Proceedings, 2014, 1314, pp. 59–68.

[15] V. A. Rao, K. Anuranjana, and R. Mamidi, “A SentiWordNet strategy for curriculum learning in sentiment analysis,” eprint arXiv, 2005.04749, 2020.

[16] S. Tipprasert. (2022). Depression dataset. [Online]. (in Thai). Available: https://www. twothai.com/dep/.

[17] P. Viola and M. Jones, (2001). “Rapid object detection using a Boosted cascade of simple features,” in Proceedings of the IEEE conference on computer vision and pattern Recognition, 2001, pp. 1–9.

[18] L. Wilms and D. Oberfeld, “Color and emotion: effects of hue, saturation, and brightness,” Psychological Research, vol. 82, pp. 896–914, 2018.

[19] P. Sanguansat. Artificial Intelligence with Machine Learning, Digital Technology. 1st ed. Nonthaburi: IDC Premier, 2019 (in Thai).

[20] A. G. Reece, A. J. Reagan, K. L. M. Lix, P. S. Dodds, C.M. Danforth and E.J. Langer, “Forecasting the onset and course of mental illness with Twitter data,” Scientific Reports 7, pp. 13006, 2017.

[21] M. F. Kabir, C. M. Rahman, A. Hossain and K. Dahal, “Enhanced classification accuracy on naive bayes data mining models,” International Journal of Computer Applications, vol. 28, no. 3, pp 9–16. 2011.

Full Text: PDF

DOI: 10.14416/j.kmutnb.2024.06.003

ISSN: 2985-2145

Username
Password
Remember me

The Journal of King Mongkut's University of Technology North Bangkokวารสารวิชาการพระจอมเกล้าพระนครเหนือ

Abstract

Keywords

References