DataSys 2020 Congress
September 27, 2020 to October 01, 2020 - Lisbon, Portugal

  • AICT 2020, The Sixteenth Advanced International Conference on Telecommunications
  • ICIW 2020, The Fifteenth International Conference on Internet and Web Applications and Services
  • ICIMP 2020, The Fifteenth International Conference on Internet Monitoring and Protection
  • SMART 2020, The Ninth International Conference on Smart Cities, Systems, Devices and Technologies
  • IMMM 2020, The Tenth International Conference on Advances in Information Mining and Management
  • INFOCOMP 2020, The Tenth International Conference on Advanced Communications and Computation
  • MOBILITY 2020, The Tenth International Conference on Mobile Services, Resources, and Users
  • SPWID 2020, The Sixth International Conference on Smart Portable, Wearable, Implantable and Disability-oriented Devices and Systems
  • ACCSE 2020, The Fifth International Conference on Advances in Computation, Communications and Services

InfoSys 2020 Congress
September 27, 2020 to October 01, 2020 - Lisbon, Portugal

  • ICNS 2020, The Sixteenth International Conference on Networking and Services
  • ICAS 2020, The Sixteenth International Conference on Autonomic and Autonomous Systems
  • ENERGY 2020, The Tenth International Conference on Smart Grids, Green Communications and IT Energy-aware Technologies
  • WEB 2020, The Eighth International Conference on Building and Exploring Web Based Environments
  • DBKDA 2020, The Twelfth International Conference on Advances in Databases, Knowledge, and Data Applications
  • SIGNAL 2020, The Fifth International Conference on Advances in Signal, Image and Video Processing
  • BIOTECHNO 2020, The Twelfth International Conference on Bioinformatics, Biocomputational Systems and Biotechnologies

SoftNet 2020 Congress
October 18, 2020 to October 22, 2020 - Porto, Portugal

  • ICSEA 2020, The Fifteenth International Conference on Software Engineering Advances
  • ICSNC 2020, The Fifteenth International Conference on Systems and Networks Communications
  • CENTRIC 2020, The Thirteenth International Conference on Advances in Human-oriented and Personalized Mechanisms, Technologies, and Services
  • VALID 2020, The Twelfth International Conference on Advances in System Testing and Validation Lifecycle
  • SIMUL 2020, The Twelfth International Conference on Advances in System Simulation
  • SOTICS 2020, The Tenth International Conference on Social Media Technologies, Communication, and Informatics
  • INNOV 2020, The Ninth International Conference on Communications, Computation, Networks and Technologies
  • HEALTHINFO 2020, The Fifth International Conference on Informatics and Assistive Technologies for Health-Care, Medical Support and Wellbeing

InfoWare 2020 Congress
October 18, 2020 to October 22, 2020 - Porto, Portugal

  • ICCGI 2020, The Fifteenth International Multi-Conference on Computing in the Global Information Technology
  • ICWMC 2020, The Sixteenth International Conference on Wireless and Mobile Communications
  • VEHICULAR 2020, The Ninth International Conference on Advances in Vehicular Systems, Technologies and Applications
  • INTERNET 2020, The Twelfth International Conference on Evolving Internet
  • COLLA 2020, The Tenth International Conference on Advanced Collaborative Networks, Systems and Applications
  • INTELLI 2020, The Ninth International Conference on Intelligent Systems and Applications
  • VISUAL 2020, The Fifth International Conference on Applications and Systems of Visual Paradigms
  • HUSO 2020, The Sixth International Conference on Human and Social Analytics
  • BRAININFO 2020, The Fifth International Conference on Neuroscience and Cognitive Brain Information

NexTech 2020 Congress
October 25, 2020 to October 29, 2020 - Nice, France

  • UBICOMM 2020, The Fourteenth International Conference on Mobile Ubiquitous Computing, Systems, Services and Technologies
  • ADVCOMP 2020, The Fourteenth International Conference on Advanced Engineering Computing and Applications in Sciences
  • SEMAPRO 2020, The Fourteenth International Conference on Advances in Semantic Processing
  • AMBIENT 2020, The Tenth International Conference on Ambient Computing, Applications, Services and Technologies
  • EMERGING 2020, The Twelfth International Conference on Emerging Networks and Systems Intelligence
  • DATA ANALYTICS 2020, The Ninth International Conference on Data Analytics
  • GLOBAL HEALTH 2020, The Ninth International Conference on Global Health Challenges
  • CYBER 2020, The Fifth International Conference on Cyber-Technologies and Cyber-Systems

ComputationWorld 2020 Congress
October 25, 2020 to October 29, 2020 - Nice, France

  • SERVICE COMPUTATION 2020, The Twelfth International Conference on Advanced Service Computing
  • CLOUD COMPUTING 2020, The Eleventh International Conference on Cloud Computing, GRIDs, and Virtualization
  • FUTURE COMPUTING 2020, The Twelfth International Conference on Future Computational Technologies and Applications
  • COGNITIVE 2020, The Twelfth International Conference on Advanced Cognitive Technologies and Applications
  • ADAPTIVE 2020, The Twelfth International Conference on Adaptive and Self-Adaptive Systems and Applications
  • CONTENT 2020, The Twelfth International Conference on Creative Content Technologies
  • PATTERNS 2020, The Twelfth International Conference on Pervasive Patterns and Applications
  • COMPUTATION TOOLS 2020, The Eleventh International Conference on Computational Logics, Algebras, Programming, Tools, and Benchmarking
  • BUSTECH 2020, The Tenth International Conference on Business Intelligence and Technology

NetWare 2020 Congress
November 15, 2020 to November 19, 2020 - Valencia, Spain

  • SENSORCOMM 2020, The Fourteenth International Conference on Sensor Technologies and Applications
  • SENSORDEVICES 2020, The Eleventh International Conference on Sensor Device Technologies and Applications
  • SECURWARE 2020, The Fourteenth International Conference on Emerging Security Information, Systems and Technologies
  • AFIN 2020, The Twelfth International Conference on Advances in Future Internet
  • CENICS 2020, The Thirteenth International Conference on Advances in Circuits, Electronics and Micro-electronics
  • ICQNM 2020, The Fourteenth International Conference on Quantum, Nano/Bio, and Micro Technologies
  • FASSI 2020, The Sixth International Conference on Fundamentals and Advances in Software Systems Integration
  • GREEN 2020, The Fifth International Conference on Green Communications, Computing and Technologies

DigitalWorld 2020 Congress
November 21, 2020 to November 25, 2020 - Valencia, Spain

  • ICDS 2020, The Fourteenth International Conference on Digital Society
  • ACHI 2020, The Thirteenth International Conference on Advances in Computer-Human Interactions
  • GEOProcessing 2020, The Twelfth International Conference on Advanced Geographic Information Systems, Applications, and Services
  • eTELEMED 2020, The Twelfth International Conference on eHealth, Telemedicine, and Social Medicine
  • eLmL 2020, The Twelfth International Conference on Mobile, Hybrid, and On-line Learning
  • eKNOW 2020, The Twelfth International Conference on Information, Process, and Knowledge Management
  • ALLSENSORS 2020, The Fifth International Conference on Advances in Sensors, Actuators, Metering and Sensing
  • SMART ACCESSIBILITY 2020, The Fifth International Conference on Universal Accessibility in the Internet of Things and Smart Environments

 


ThinkMind // eKNOW 2018, The Tenth International Conference on Information, Process, and Knowledge Management // View article eknow_2018_7_30_60026


Ranking Subreddits by Classifier Indistinguishability in the Reddit Corpus

Authors:
Faisal Alquaddoomi
Deborah Estrin

Keywords: Natural language processing; Web mining; Clustering methods

Abstract:
Reddit, a popular online forum, provides a wealth of content for behavioral science researchers to analyze. These data are spread across various “subreddits”, subforums dedicated to specific topics. Social support subreddits are common, and users' behaviors there differ from reddit at large; most significantly, users often use 'throwaway' single-use accounts to disclose especially sensitive information. This work focuses specifically on identifying depression-relevant posts and, consequently, subreddits, by relying only on posting content. We employ posts to r/depression as labeled examples of depression-relevant posts and train a classifier to discriminate posts like them from posts randomly selected from the rest of the Reddit corpus, achieving 90% accuracy at this task. We argue that this high accuracy implies that the classifier is descriptive of "depression-like" posts, and use its ability (or lack thereof) to distinguish posts from other subreddits as discriminating the "distance" between r/depression and those subreddits. To test this approach, we performed a pairwise comparison of classifier performance between r/depression and 229 candidate subreddits. Subreddits which were very closely related thematically to r/depression, such as r/SuicideWatch, r/offmychest, and r/anxiety, were the most difficult to distinguish. A comparison this ranking of similar subreddits to r/depression to existing methods (some of which require extra data, such as user posting co-occurrence across multiple subreddits) yields similar results. Aside from the benefit of relying only on posting content, our method yields per-word importance values (heavily weighing words such as "I", "me", and "myself"), which recapitulate previous research on the linguistic phenomena that accompany mental health self-disclosure.

Pages: 128 to 133

Copyright: Copyright (c) IARIA, 2018

Publication date: March 25, 2018

Published in: conference

ISSN: 2308-4375

ISBN: 978-1-61208-620-0

Location: Rome, Italy

Dates: from March 25, 2018 to March 29, 2018

SERVICES CONTACT
2010 - 2017 © ThinkMind. All rights reserved.
Read Terms of Service and Privacy Policy.