Study of Violence Against Women and its Characteristics Using Application of Data Mining Techniques
DOI:
https://doi.org/10.32628/IJSRSET2411314Keywords:
Violence Against Women, Web Scraping, Feature Selection, TF IDF, Text Mining, Classification, Performance AnalysisAbstract
The Internet serves as a vast source of information, offering diverse data that can be gathered and analysed to create an extensive repository. Articles addressing the crucial issue of Violence Against Women (VAW) published online significantly enhance our understanding of this subject. In this study, we employed web scraping to collect VAW-related news, processed the data using a feature selection model to create a comma-separated dataset, and applied text mining techniques for comprehensive analysis. This included exploratory analysis and Topic Modelling to uncover latent topics. We also utilized classification algorithms such as Naive Bayes, Random Forest, Support Vector Machine (SVM), AdaBoost, and Artificial Neural Networks (ANN) to categorize the types of violence physical, psychological, sexual, or a combination. By integrating these techniques, our study provides a nuanced understanding of VAW, revealing patterns and trends that can inform targeted interventions and support mechanisms.
Downloads
References
World Health Organization. Public Health Surveillance. (2020). Availableonline at: https://www.who.int/immunization/monitoring_surveillance/burden/vpd/en (accessed September, 2021).
Australian Bureau of Statistics. Bridging the Data Gaps for Family, Domesticand Sexual Violence. (2013). Available online at: https://www.abs.gov.au/statistics/people/crime-and- justice/bridging-data-gaps- family-domestic-and-sexual- violence/latest-release (accessed September, 2021).
Australian Institute of Health and Welfare. Family, Domestic and SexualViolence in Australia. (2018). Available online at: https://www.aihw.gov.au/reports/domestic-violence/family- domestic-sexual-violence- in-australia-2018/contents/summary (accessed September, 2021).
Australian Institute of Health and Welfare. Family, Domestic and SexualViolence in Australia: Continuing the National Story. (2019). Availableonline at: https://www.aihw.gov.au/reports/domestic-violence/family-domestic-sexual- violence-australia-2019/contents/summary (accessedSeptember, 2021).
Dowse L, Soldatic K, Spangaro J, Van Toorn G. Mind the gap: the extent ofviolence against women with disabilities in Australia. Austr J Soc Issues. (2016)51:341–59. doi: 10.1002/j.1839-4655.2016.tb01235.x DOI: https://doi.org/10.1002/j.1839-4655.2016.tb01235.x
KPMG. The cost of violence against women and their children.(2016). Available online at: https://www.dss.gov.au/sites/default/files/documents/08_2016/the_cost_of_violence_against_women_and_their_children_in_australia_-_summary_report_may_2016.pdf (accessed September, 2021).
Abbe A, Grouin C, Zweigenbaum P, Falissard B. Text mining applications inpsychiatry: a systematic literature review. Int J Methods Psychiatr Res. (2016)25:86–100. doi: 10.1002/mpr.1481 DOI: https://doi.org/10.1002/mpr.1481
Spasic I, Nenadic G. Clinical text data in machine learning: systematic review.JMIR Med Inf. (2020) 8:e17984. doi: 10.2196/17984 DOI: https://doi.org/10.2196/17984
Chau M, Xu JJ, Chen H. Extracting meaningful entities from police narrativereports. In: Proceedings of the 2002 Annual National Conference on DigitalGovernment Research, Digital Government Society of North America. LosAngeles, CA (2002).
Ananyan S. Crime pattern analysis through text mining. In: AMCIS 2004Proceedings: 236. New York, NY (2004).
Chen H, Chung W, Xu JJ, Wang G, Qin Y, Chau M. Crime datamining: a general framework and some examples. Computer. (2004) 37:50–6.doi: 10.1109/MC.2004.1297301 DOI: https://doi.org/10.1109/MC.2004.1297301
Poelmans J, Elzinga P, Viaene S, Dedene G. Formally analyzing theconcepts of domestic violence. Expert Syst Appl. (2011) 38: 3116–30.doi: 10.1016/j.eswa.2010.08.103 DOI: https://doi.org/10.1016/j.eswa.2010.08.103
Haleem MS, Han L, Harding PJ, Ellison M. An automated text miningapproach for classifying mental-ill health incidents from police incidentlogs for data-driven intelligence. IN: 2019 IEEE International Conference onSystems, Man and Cybernetics (SMC). Bari: IEEE (2019). DOI: https://doi.org/10.1109/SMC.2019.8914240
Victor B, Perron BE, Sokol R, Fedina L, Ryan JP. Automated identificationof domestic violence in written child welfare records: leveraging text miningand machine learning to enhance social work research and evaluation. Soc SocWork Rese. (2020) 12. doi: 10.1086/712734 DOI: https://doi.org/10.1086/712734
Laan AM, Tollenaar N. Text mining for cybercrime in registrations of thedutch police. In: Weulen Kranenbarg M, Leukfeldt ER, editors. Cybercrimein Context. Cham: Springer (2021) p. 327–50. DOI: https://doi.org/10.1007/978-3-030-60527-8_18
Karystianis G, Adily A, Schofield P, Knight L, Galdon C, Greenberg D.Automatic extraction of mental health disorders from domestic violencepolice narratives: text mining study. J Med Internet Res. (2018) 20:e11548.doi: 10.2196/11548 DOI: https://doi.org/10.2196/11548
Karystianis G, Adily A, Schofield PW, Greenberg D, Jorm L, Nenadic G.Automated analysis of domestic violence police reports to explore abuse typesand victim injuries. J Med Internet Res. (2019) 21:e13067. doi: 10.2196./13067 DOI: https://doi.org/10.2196/13067
Karystianis G, Simpson A, Adily A, Schofield P, Greenberg D, Wand H.Prevalence of mental illnesses in domestic violence police records: text miningstudy. J Med Internet Res. (2020) 22:e23725. doi: 10.102196./23725 DOI: https://doi.org/10.2196/23725
Published
Issue
Section
License
Copyright (c) 2024 International Journal of Scientific Research in Science, Engineering and Technology
This work is licensed under a Creative Commons Attribution 4.0 International License.