Phd proposal: At-risk users behaviors modeling on social networks
Main topic
Data mining (text, images, videos, sounds), social networks
Context
Suicide is a person's deliberate act of ending his/her own life.
Suicide reveals serious personal problems but also often reflects a deterioration of the social context in which an individual lives.
According to a recent and alarming WHO (World Health Organisation) report (September 4, 2014), one person dies of suicide every 40 seconds in the world - more than all the yearly victims of wars and natural disaster – more than 1,100,000 by year.
Most suicide attempts are supported by hospital emergency units.
Suicide is a major public health issue with strong socio-economic consequences.
For example, the economic cost of suicide was estimated to 5 billion euros in 2009 in France. In the framework of the 2013-2020 Mental Health Action Plan, WHO member states plan 10% reduction in suicide rates in each country before 2020.
Research hypothesis
It is possible to design semi-automatic tools to exploit massive data issued from social networks, to allow dynamic and interactive knowledge discovery used in order to detect at-risk individuals.
Scientific and technological objectives. The main objective of the thesis is to design and develop new approaches for the early identification of at risk individuals through their use of the social media. The model of semi-automatic detection of suicidal profiles will be used by psychiatrists to follow on social networks, patients who stayed in their services after a first suicide attempt.
We intend to capture a possible deterioration in their mental state in order to offer assistance when needed. In this thesis, the PhD student will design and implement an approach integrating different data mining methods that will be used in a multicenter randomized controlled trial in order to prevent recurrences.
Topic
This subject is a result of a collaboration between LIRMM and the psychiatric emergency department of University Hospital of Montpellier. The main objective is to design and implement new approaches for early detection of at risk individuals through their use of the social media. The model, developed as part of this thesis, semi-automatic detection of suicidal profiles will be used by psychiatrists to follow patients on social networks, who have stayed in their service after a first suicide attempt.
We intend to capture a possible deterioration in their mental state to be able to offer assistance when needed. One of the first deliverables of this thesis is a prototype integrating different methods of text mining and will be used as part of a multicenter randomized controlled trial to test the usefulness of the model for recurrence prevention. The design and implementation of this first prototype will allow the acquisition of data on real users of social networks, users who have committed a suicide attempt and supported by the appropriate emergency services. To design this prototype, data are already available and used in Advanse team (tweets, letters...).
An important element of this thesis is the design of interactive methods dedicated to health care professionals (psychiatrists, ...) to allow, when an alarm is trigger, the best possible restitution of different the information collected about the patient.
Methods
Multi-layer Classification (bagging, boosting, staking) to detect risks symptoms then aggregated via a score defined within the framework of the thesis that makes sense for health professionals ;
Deep learning to create a specific indicator for images, videos and sounds by comparing new media to millions of streaming media available on social networks and labelled with information such as "anorexia", "scarification"… ;
Consideration of the temporal evolution of the previous indicators (topics drifts, martingale, etc.) ;
Aggregation of all the previous indicators in the form of a dashboard, recommendation and alerts for an aid to effective decision healthcare professional ;
Active learning to take into account interactions with the health professionals who validate or invalidate indicators.