Optimal Proxy Selection for Socioeconomic Status Inference on Twitter

Joint Authors

Levy Abitbol, Jacob
Fleury, Eric
Karsai, Márton

Source

Complexity

Issue

Vol. 2019, Issue 2019 (31 Dec. 2019), pp.1-15, 15 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2019-05-19

Country of Publication

Egypt

No. of Pages

15

Main Subjects

Philosophy

Abstract EN

Individual socioeconomic status inference from online traces is a remarkably difficult task.

While current methods commonly train predictive models on incomplete data by appending socioeconomic information of residential areas or professional occupation profiles, little attention has been paid to how well this information serves as a proxy for the individual demographic trait of interest when fed to a learning model.

Here we address this question by proposing three different data collection and combination methods to first estimate and, in turn, infer the socioeconomic status of French Twitter users from their online semantics.

We assess the validity of each proxy measure by analyzing the performance of our prediction pipeline when trained on these datasets.

Despite having to rely on different user sets, we find that training our model on professional occupation provides better predictive performance than open census data or remote sensed expert annotation of habitual environments.

Furthermore, we release the tools we developed in the hope it will provide a generalizable framework to estimate socioeconomic status of large numbers of Twitter users as well as contribute to the scientific discussion on social stratification and inequalities.

American Psychological Association (APA)

Levy Abitbol, Jacob& Fleury, Eric& Karsai, Márton. 2019. Optimal Proxy Selection for Socioeconomic Status Inference on Twitter. Complexity،Vol. 2019, no. 2019, pp.1-15.
https://search.emarefa.net/detail/BIM-1132318

Modern Language Association (MLA)

Levy Abitbol, Jacob…[et al.]. Optimal Proxy Selection for Socioeconomic Status Inference on Twitter. Complexity No. 2019 (2019), pp.1-15.
https://search.emarefa.net/detail/BIM-1132318

American Medical Association (AMA)

Levy Abitbol, Jacob& Fleury, Eric& Karsai, Márton. Optimal Proxy Selection for Socioeconomic Status Inference on Twitter. Complexity. 2019. Vol. 2019, no. 2019, pp.1-15.
https://search.emarefa.net/detail/BIM-1132318

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1132318