The limits of racial prejudice Kevin Lewisa,1 a

Department of Sociology, University of California, San Diego, La Jolla, CA 92093

The racial segregation of romantic networks has been documented by social scientists for generations. However, because of limitations in available data, we still have a surprisingly basic idea of the extent to which this pattern is generated by actual interpersonal prejudice as opposed to structural constraints on meeting opportunities, how severe this prejudice is, and the circumstances under which it can be reduced. I analyzed a network of messages sent and received among 126,134 users of a popular online dating site over a 2.5-mo period. As in face-to-face interaction, online exchanges are structured heavily by race. Even when controlling for regional differences in meeting opportunities, site users—especially minority site users—disproportionately message other users from the same racial background. However, this high degree of self-segregation peaks at the first stage of contact. First, users from all racial backgrounds are equally likely or more likely to cross a racial boundary when reciprocating than when initiating romantic interest. Second, users who receive a cross-race message initiate more new interracial exchanges in the future than they would have otherwise. This effect varies by gender, racial background, and site experience; is specific to the racial background of the original sender; requires that the recipient replied to the original message; and diminishes after a week. In contrast to prior research on relationship outcomes, these findings shed light on the complex interactional dynamics that —under certain circumstances—may amplify the effects of racial boundary crossing and foster greater interracial mixing. social networks

| preemptive discrimination | OkCupid | assortative mating

R

ace is a uniquely divisive characteristic of American social life. Unlike sexual orientation, it often is immediately visible. Unlike gender, it is innately unrelated to procreation, and so conflict between racial groups is unmitigated by domestic and/or sexual dependence between men and women. Also, unlike socioeconomic status, it is a characteristic we are born with but powerless to change. Aside from its nature, the repercussions of racial background also are distinctly powerful in scope: Its consequences span from economic stratification—in which almost all racial groups experience lower median family incomes and higher rates of unemployment and poverty than do whites (1, 2)—to spatial segregation—in which minorities are disproportionately likely to live in neighborhoods characterized by crime, disorder, and concentrated disadvantage (3, 4)—to the deep-seated interpersonal hostility that has tainted American race relations since the birth of this country (5, 6). Social scientists have measured this hostility using a variety of methods. Many of these instruments, however, are biased by subjects’ desire not to appear prejudiced (5) as well as by the limits of conscious awareness (7). A common alternative is to infer the magnitude of racial prejudice from the patterning of social networks—in other words, the degree to which individuals affiliate with others who are racially similar and avoid those who are racially dissimilar. Scientists have examined this patterning across a wide variety of settings and relationships (8–10) and found that Americans’ preference for same-race alters exceeds their preference for similarity based on any other characteristic (11). This is true particularly for American romantic networks— in which two individuals are much more likely to marry, live together, date, and even “hook up” if they share the same racial background than if they do not (12–15). www.pnas.org/cgi/doi/10.1073/pnas.1308501110

Although we know that racial bias in assortative mating exists, we still know strikingly little about it. Most studies of romantic ties rely on marriage records—but as couples marry later (16) and divorce often (17), these data tell us less today than ever before. Data on romantic networks tend to focus on outcomes rather than interactions (12), which obscures the underlying processes responsible for generating observed patterns. (For instance, two individuals from the same racial background may date because they prefer to date each other or because no one else is willing to date them.) Finally, data on romantic partnerships rarely are accompanied by information on the opportunity structure from which they were selected. However, to pinpoint the degree of in-group preference/out-group exclusion accurately, it is essential to have data on the partnerships that formed as well as the partnerships that could have formed but did not (8, 9). To overcome these limitations, I analyzed data on user interactions on a popular online dating site. These data have several advantages. First, unlike traditional datasets focused on marriage, these data focus on the very early expression of romantic interest— a stage in which most people participate and at which demographic characteristics such as race are particularly salient (because interpersonal chemistry is yet unknown). Second, exchanges are recorded digitally so we can watch them unfold in continuous time. Third, because the composition of site membership is known, it is relatively straightforward to disentangle preferences from opportunity constraints. Fourth, these data are unhindered by the limitations of self-report, given that they capture actual behaviors in a “natural” (if digital) environment. Finally, online dating is not just theoretically important and methodologically useful, it is increasingly consequential for the formation of long-term, offline relationships (18). Data for these analyses include 126,134 users of the dating site OkCupid (www.okcupid.com) as well as all messages sent among these users over a 2.5-mo period (October to mid-December 2010). OkCupid advertises itself as a generalist dating Web site (as opposed to the many “niche” sites available online), and membership is completely free. All identifiers (e.g., photos, user Significance Racial segregation in romantic networks is a robust and ubiquitous social phenomenon—but one we understand remarkably poorly. In this paper, I analyze a large network of interactions among users of a popular online dating site. First, I find that users from all racial backgrounds are equally likely or more likely to cross a racial boundary when reciprocating than when initiating romantic contact. Second, I find that certain subsets of users who receive—and reply to—a cross-race message initiate more new interracial exchanges in the short-term future than they would have otherwise. These findings illustrate an important mechanism whereby racial biases in assortative mating may be reduced temporarily by the actions of others. Author contributions: K.L. designed research, performed research, analyzed data, and wrote the paper. The author declares no conflict of interest. This article is a PNAS Direct Submission. 1

E-mail: [email protected].

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10. 1073/pnas.1308501110/-/DCSupplemental.

PNAS Early Edition | 1 of 6

SOCIAL SCIENCES

Edited by Mary C. Waters, Harvard University, Cambridge, MA, and approved October 8, 2013 (received for review May 4, 2013)

Males

A

B

Iniaons

0.6

0.6

0.4

0.4

0.4

0.2

0.2

0.2

0

0

0

-0.2

-0.2

-0.2

-0.4

-0.4

-0.4

-0.6

-0.6

-1 A

B

I

H

A

B

I

W H

-0.6

-0.8 -1 A

B

W

D

I

H

A

B

I

W H

-0.8 -1 A

W

E

F

0.6

0.6

0.6

0.4

0.4

0.4

0.2

0.2

0.2

0

0

0

-0.2

-0.2

-0.2

-0.4

-0.4

-0.4

-0.6

-0.6

-0.8 -1 A

B

I

H

A

B

I

W H

Difference

C

0.6

-0.8

Females

Replies

B

I

H

A

B

I

W H

W

-0.6

-0.8 -1 A

B

W

I

H

A W

B

I

W H

-0.8 -1 A

B

I

H

A

B

I

W H

W

Fig. 1. Affiliation indices representing rates of inter- and intraracial contact among site users (n = 126,134). A and D represent patterns of initiations for men and women, respectively; B and E represent patterns of responses; and C and F represent the difference between the two. The racial background of the sender (receiver) appears across the width (depth) of the plot. Colored columns emphasize in-group behavior: green signifies a positive index (preference) and red signifies a negative index (aversion). A, Asian; B, black; H, Hispanic; I, Indian; W, white.

IDs) and open-ended text (e.g., profile essays, message content) were stripped before I acquired the data. The above study sample consists of all site users who self-identified as single, straight,† living in the United States, and “looking for” short-term and/or longterm dating (as opposed to casual sex, long-distance pen pals, etc.). It also includes only users who self-identified with one of the five largest racial categories on the site—Asian, black, Indian, Hispanic/Latin, and white‡—and who joined the Web site between October 1 and November 30, 2010. The latter restriction is to prevent artificial truncation of exchanges; otherwise, a message sent on October 1 (or any other date) might equally plausibly represent the initiation of a new exchange as the continuation of a prior exchange that began before the window of available data (Study Population). Because the length of user exchanges is difficult to interpret sans content (two users who stop communicating might equally plausibly have lost interest as taken their interaction offline), I focus only on the first message sent from A to B and, if applicable, B’s first reply. I also exclude all same-gender messages. Consequently, the resulting social network plausibly may be interpreted to reflect the expression and reciprocation of romantic interest. Importantly, the Web site does not limit interaction to matches it “recommends” but allows users to †

My focus on heterosexuals is an important limitation of this study. However, given the reliance of prior research on marriage data, we know little about whether or how samesex mate selection varies from heterosexual mate selection, such that it would be unwarranted to include both types of dynamics in the same analyses.



Racial categories are reproduced here as they appear on the site, which appears to be oriented on common use rather than logical consistency (e.g., “Asian” and “Indian” are treated as distinct categories).

2 of 6 | www.pnas.org/cgi/doi/10.1073/pnas.1308501110

search the general site population on the basis of personalized criteria (e.g., racial background and geographic proximity). Further, insofar as the site does recommend matches to its users, it does so using a transparent algorithm that rates any two users’ compatibility on the basis of self-reported priorities. Strictly speaking, therefore, all recommendations are endogenous to users’ own expressed preferences, and overall patterns of interaction should reflect individual choices rather than the exogenous interference of the dating site (Site Description). I report the results from two analyses of dating site users’ behavior. First, I examine the racial patterning of messages and responses, and the degree of in-group preference/out-group aversion characterizing each. Second, I estimate the causal effect of receiving a cross-race message on the quantity of cross-race exchanges one initiates in the future. Results Fig. 1 provides a first look at inter- and intraracial contact rates among dating site users—and how these patterns vary from initiation messages to replies. Positive (negative) columns indicate that the given sender/receiver combination occurs more (less) frequently than chance would predict given the composition of site membership, and colored columns highlight same-race contact (Materials and Methods). Men and women from all racial backgrounds disproportionately initiate contact with other site users from the same racial background. Interestingly, however, the size of this preference decreases—in some cases, to the point of aversion—when we shift focus to replies. This difference is summarized in Fig. 1 C and F, which show that same-race Lewis

Other noteworthy gendered findings are that most men (except black men) are unlikely to initiate contact with black women (Fig. 1A), all men (including Asian men) are unlikely to reply to Asian women (Fig. 1B), and although women from all racial backgrounds tend to initiate contact with men from the same background (Fig. 1D), women from all racial backgrounds also disproportionately reply to white men (Fig. 1E).

Fig. 2. Box plots of parameter estimates from ERG models of messages sent among site users (n = 102,540). I ran 44 independent models, one for each twodigit ZIP code region with more than 1,000 users in the sample; results from 43 of these models are presented here (one failed to converge) (Methodological Details). Plots follow the Tukey method: boxes represent quartiles, whiskers extend to the most extreme data point within 1.5 times the interquartile range from the edge of the box, and points represent outliers.

or not a cross-race message was received) during the first week of November, and the outcome variable (the quantity of new crossrace exchanges initiated) during the second week of November. I required all individuals in this analysis to join the Web site in October and remain members of the site through the end of the treatment period, thereby (i) incorporating users with a diversity of membership lengths while (ii) maximizing the population who had the opportunity to receive the treatment.k Among the 48,378 users eligible for this analysis, 4,263 (8.8%) received at least one interracial tie during the treatment period. Of these, 3,918 (91.9%) were matched successfully with at least one control case. Whereas individuals in the control condition initiated an average of 0.103 new interracial exchanges during the outcome period, individuals in the treatment condition initiated an average of 0.141 interracial exchanges, an increase of 37.3% (P < 0.001; Fig. 3A). The magnitude of this effect varies by account age, gender, and racial background (Fig. 3 B– D). Among women, the effect is strongest for those who joined the site 17–24 d (P < 0.001) or 25–31 d (P < 0.05) earlier. Among men, the effect is strongest for those who joined the site in the past 1–8 d (P < 0.001). The effect also is strongest for Asian women (who initiated 0.18 more interracial exchanges as a result of receiving an interracial message, an increase of 238%; P < 0.01), black women (106% increase; P < 0.05), Asian men (222% increase; P < 0.01), and white men (49% increase; P < 0.05). Fig. 4 unpacks these results even further. First, the effect of the treatment is strongest for those who have not previously been contacted by someone from a different racial background (P < 0.001 vs. P = 0.065; Fig. 4A). Second, the treatment is “effective” (in terms of producing future interracial contact) only when the recipient responds to the treatment message; such users initiate 115% more interracial exchanges during the outcome period compared with matched controls (P < 0.001; Fig. 4B).** Third,

§

{

Although 63.6% of initiation messages were sent between two users in the same twodigit ZIP code region, there are many more possible interregional pairs than intraregional ones; initiation messages between two users in the same region were 87.6 times more common than messages between two users in different regions.

Lewis

k

I restricted measurement to the second week of November so that results would not be contaminated by Thanksgiving. Robustness checks suggest they were not (Supplementary Analyses).

**In fact, when analyses from Fig. 3 are limited to users who replied to the treatment message (as in Fig. 4B), the treatment effect is statistically significant for virtually all women, regardless of account age or racial background; men who joined the site in the past 1–8 d; Asian men; and Hispanic men. For results and interpretation, see Supplementary Analyses.

PNAS Early Edition | 3 of 6

SOCIAL SCIENCES

preference is almost always greater for initiations than for replies (the sole exception is Indian men).§ To move beyond this descriptive portrait, I used a statistical model. Exponential random graph (ERG) modeling is a sophisticated technique for understanding how patterns in social networks are generated (19). Specifically, these models can statistically disentangle the contribution of various possible reasons one user might message another, including distinguishing and directly comparing patterns of initiations with patterns of replies, while inherently controlling for the structural constraints of group size (Materials and Methods). To ensure that geographic distance is not conflated with social distance (20), I estimated a separate model for each two-digit ZIP code region and considered only intraregional messages.{ Because racial minorities were absent from some regions, I also include only regions with more than 1,000 members represented in the sample. Time is suppressed in this analysis, al though with minimal ramifications for results (Methodological Details). Fig. 2 summarizes parameter estimates across these models. Unsurprisingly, the baseline likelihood of any two users contacting each other (“density”) is extremely low; however, the log odds of B sending a message to A increase tremendously if A has contacted B first (“reciprocity”). Also unsurprising (to most heterosexual dating site users) is that men are much more likely to contact women than women are to contact men (“female-receiver”). Corresponding to Fig. 1, “matching” coefficients across all five racial categories are almost always positive, indicating a high degree of in-group preference net of opportunity structures, particularly for minority site users. Additionally, however, parameter estimates for the interaction of racial matching and reciprocity (“matching*reciprocity”) are predominantly negative for Indian, Asian, and Hispanic users, and roughly distributed around zero for white and black users. In other words, although site users are still generally more likely to respond to a same-race message than a cross-race message—given that matching coefficients tend to be slightly greater in absolute value than interaction term coefficients—this tendency is either equally pronounced (for white and black users) or less pronounced (for Indian, Asian, and Hispanic users) than it is for initiating contact. (For the full array of coefficients from all models and a direct comparison of the log odds of all possible messaging scenarios, see Methodological Details.) Next, I take a counterfactual approach to estimating the causal effect of receiving a cross-race message on the quantity of new cross-race exchanges one initiates in the future (Materials and Methods). In other words, for every “treatment” case of someone who received a cross-race message, I selected one or more “control” cases who were as similar to the treatment case as possible, but who did not receive a cross-race message. These cases serve as the counterfactual to estimate the average treatment effect on the treated (21), or the average quantity of new cross-race initiations “created” per person as a consequence of receiving a cross-race message from someone else. (Estimating the average treatment effect for the entire population requires a more stringent set of assumptions that likely do not apply here.) I matched treatment and control cases exactly according to gender, racial background, and two-digit ZIP code (thereby controlling for differences in opportunity structures as well as for regional differences in prejudice) and “coarsely” according to account age, quantity of previous initiations, and quantity of previous interracial initiations (22). For each individual, I measured control/ matching variables during October, the treatment variable (whether

B

0.2

0.15 Interracial contacts

Interracial contacts

A

0.1

0.05

T

Interracial contacts

0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0

25-31 days 17-24 days 9-16 days

0

C

Male- treatment Male- control Female- treatment Female- control

C

1-8 days

Account age

1

Treatment

0.8

Control

0.6 0.4 0.2 0 Asian

Black

Indian

Hispanic

White

Hispanic

White

Females Interracial contacts

D5 4 3 2 1 0 Asian

Black

Indian

Males Fig. 3. Average quantity of interracial exchanges initiated among the treatment group (white bars) and matched controls (black bars). The difference between bars represents the average treatment effect on the treated, where 95% confidence intervals quantify the precision of this effect as estimated by negative binomial regression (confidence intervals appear asymmetrically because they are converted from a logarithmic scale). Results are presented separately for (A) the overall effect (n = 30,495), (B) comparisons by account age and gender, and (C and D) comparisons by gender and racial background. The relatively low baseline rate of interracial contact among white site users is not itself surprising, given that whites constitute the majority of the study sample and therefore have the fewest opportunities for interracial exchange. Results in their original logged form are presented in Methodological Details.

the effect of the treatment is specific to the racial background of the treatment sender: If a dating site user receives an interracial message from a member of racial group X, then that recipient is likely to initiate additional exchanges only with other members of group X in the future (P < 0.01; Fig. 4C). Finally, it is unlikely that the treatment effect is an artifact of unobserved differences between treatment and control groups. If this were the case, then we would expect the difference in outcomes between the two groups to persist over time. However, as early as 2 wk beyond the treatment period, the quantity of new interracial exchanges initiated by members of both groups becomes statistically indistinguishable (P = 0.125; Fig. 4D). Discussion Despite the centrality of romantic ties to day-to-day life, typical data on relationship outcomes obscure the dynamic interplay of attitudinal and relational forces that occurs at these relationships’ inception. I have shown that there is a “limit” to the degree of racial prejudice displayed by dating site users. First, racial boundaries are equally or more permeable when reciprocating than when initiating romantic contact; in fact, the larger the in-group bias for initiations (with Indian and Asian site users at one extreme and white site users at the other), the larger is the reversal of this bias for replies. Second, after receiving a cross-race message and sending a crossrace reply, many site users exhibit greater interracial openness in the short-term future, an effect that also is stronger for certain 4 of 6 | www.pnas.org/cgi/doi/10.1073/pnas.1308501110

categories of minorities (e.g., Asian women, Asian men, and black women) than for whites. What explains these results? One possibility, consistent with both sets of findings, is that dating site users engage in “preemptive discrimination.” In other words, part of the reason site users, and especially minority site users, do not express interest in individuals from a different racial background is because they anticipate—based on a lifetime of experiences with racism—that individuals from a different background will not be interested in them. In the event that this expectation is falsified by personal experience, however (i.e., the appearance of a cross-race suitor), the recipient not only is receptive to this overture, but begins considering future prospects she would not have otherwise considered (i.e., other site users from that same background). This would explain why the treatment effect is specific to the racial background of the sender, and also why its effects are relatively short in duration (because most subsequent suitors will again be from the same racial background as the recipient, reconfirming her initial expectations and restoring her original initiation patterns). Although a direct test of this hypothesis is not possible with these data,†† it is supported by supplementary analyses of profile views (Supplementary Analyses) as well as by research on indirect reciprocity, everyday discrimination, and social categorization. We know that people tend to “pay forward” both positive and negative experiences (23, 24) and that subtle displays of white racism toward minorities create “psychological tensions and cultural adaptations” that have a cyclical impact on minorities’ subsequent interaction with whites (25). On the other hand, researchers also have shown that the very process of racial encoding can be diminished, if not erased, in laboratory settings, suggesting that race, and hence racial prejudice, is a “volatile, dynamically updated cognitive variable” that can be “overwritten” by new circumstances (26). Approaching mate choice through the lens of online dating inevitably confronts limitations. In general, examining digital traces of interaction patterns exchanges data accuracy, reliability, and scale for issues of privacy, data management, and interpretation (27, 28). The tradeoffs between observational and experimental data of any kind are well known, and causal claims using observational data are necessarily tentative (21). Additionally, it is an open empirical question whether racial dynamics on this site are generalizable to the myriad alternative dating sites on the internet (not to mention the many different ways prospective partners interact offline). I do not have data on the “depth” of online interaction or how subsequent romantic behavior is structured by the first two exchanges, and the findings described here are limited to heterosexual American users pursuing “dating,” with unclear implications for singles living in other countries, pursuing alternative ends, or identifying with alternative sexual orientations. We know that certain types of people (29) with certain types of experiences (30) are more or less likely to cross racial boundaries. Much less common, however, is evidence demonstrating the subtle ways our own everyday actions may temporarily erode the prejudicial behavior of others. These results show that under certain circumstances, not only is the interracial expression of romantic interest more likely to be met with reciprocation than baseline rates of reciprocity and in-group preference would predict, but the consequences of this action are self-reinforcing, and might potentially set in motion a chain of future interracial contact among others. Materials and Methods I used three analytical approaches to produce the above results: one descriptive (affiliation indices) and two inferential (ERG modeling and coarsened exact matching). I describe these approaches below.

††

To test this hypothesis, it is necessary to ascertain whether the above effects are specific to certain racial combinations—for instance, minorities responding to whites—rather than focusing solely on the racial background of the recipient. However, there are too few messages between certain combinations of minority users to address this possibility adequately with the methods used here.

Lewis

B 0.5

0.5

0.4

0.4

0.3

0.3

0.2

0.2

0.1

0.1

0 No

Yes

Received interracial e previously?

Interracial contacts

Control

0 Yes

C

Treatment

No

Replied to treatment message?

D

0.5

0.5

0.4

0.4

0.3

0.3

0.2

0.2

ways. First, as a descriptive index, each subpopulation of interest (men vs. women; initiations vs. replies) must be examined separately, in contrast to a statistical model that can disentangle and directly compare the contribution of various mechanisms to observed patterns of interaction. Second, although the index has a straightforward interpretation (the proportion of time individuals from group X specifically seek out or avoid individuals from group Y), it does not consider that, because of differences in group size, individuals from some racial backgrounds (e.g., Indian site users) are encountered much less frequently—or are much “harder to find”—than individuals from other racial backgrounds (e.g., white site users). Consequently, indices are artificially inflated in proportion to the recipient’s group size. Third, for clarity of presentation, above I calculated indices only for the entire population of site membership. Not unlike studies of national intermarriage rates, this runs the risk that apparent social distance between groups may be a spurious consequence of geographic segregation (20) (e.g., individuals from group X may not actually prefer partners from group X to those from group Y, but rather more partners from group X are locally available). To complement this approach, therefore, I turned to a statistical model. In the rapidly advancing field of network methods, ERG models have received particular attention because of their breadth, flexibility, and ability to address the difficult computational problem of how networks are generated given the complex interdependencies between observations that characterize most network datasets (32). In ERG modeling, the possible ties among actors—here, messages among dating site users—are regarded as random variables. ERG models have the following general form: PrðY = yÞ =

0.1

0.1

0

0 Specific

Generalized

1st week

2nd week

Time horizon of effect

Scope of effect

Fig. 4. Average treatment effect on the treated, presented separately by (A) whether the recipient of the treatment had received an interracial message previously, (B) whether the recipient of the treatment replied to the treatment message, (C) whether the outcome is defined as the quantity of interracial exchanges the recipient initiated with individuals from the same racial background as the treatment sender (“specific” effect) or the quantity of interracial exchanges the recipient initiated with individuals from a racial background different from that of the treatment sender (“generalized” effect), and (D) whether the outcome period is defined as the first or the second week following the treatment period. Confidence intervals are calculated using the same method as in Fig. 3. Results in their original logged form are presented in Methodological Details.

Affiliation Indices. The descriptive index I use was developed by Heckathorn (31) in the context of respondent-driven sampling, but he notes it also may be used to describe patterns of association in a given social structure. If we define Sxy as the proportion of ties group X sends to group Y, and Py as the proportion of Y in the population, then the affiliation between any two groups X and Y is defined as Axy =

Py − Sxy if Py < Sxy Py − 1

Axy =

Sxy − Py if Py > Sxy : Py

[1]

According to Eq. 1, the minimum value of Axy is −1, meaning that group X never forms ties with group Y (complete avoidance); the maximum value of Axy is 1, meaning that group X forms ties only with group Y (complete preference); and Axy is zero if group X forms ties with group Y only in proportion to group Y’s representation in the population, i.e., the degree of interracial mixing we would expect by chance. Both sets of affiliation indices (initiations and replies) were calculated conditional on the appropriate opportunity structure. In other words, indices of inter- and intraracial initiation for men (women) were calculated based on the racial composition of women (men) in the general population, whereas indices of male (female) replies were calculated based on the racial composition of female (male) initiations. ERG Modeling. Heckathorn’s index helps provide an overview of the gendered racial patterning of messages and replies, but it is limited in three

Lewis

( )   X 1 exp ηA gA ðyÞ : κ A

[2]

“Dependence assumptions,” or the various possible reasons one user might contact another, determine the precise shape of the model. Specifically, the analyst posits any number of interpersonal micromechanisms that may explain observed patterns of interaction—for instance, the tendency for messages to be reciprocated (reciprocity) or the tendency for users from the same racial background to contact one another (racial matching). Each micromechanism corresponds to a particular network configuration (e.g., a “mutual” dyad consisting of both a message and a reply, or a message sent between two users from the same racial background, respectively). The presence of each configuration in the actual empirical network is quantified by gA(y) in Eq. 2, where ηA is a parameter measuring the importance of the given effect to the overall network structure. The summation is over all configurations A, and κ is a normalizing constant. Ultimately, therefore, the above expression has a straightforward and intuitive interpretation. It represents the probability of observing the empirical network that actually was observed as a function of the various underlying micromechanisms that might have produced it (19). It may be helpful to think of this method as somewhat similar to logistic regression—except that instead of a dichotomous individual variable, the outcome of interest is a dichotomous dyadic variable indicating the presence or absence of a message between any two users in the sample. In fact, interpretation of model coefficients (at least for the effects presented here) is virtually identical to those from logistic regression: the log odds of any given message can be determined simply by adding the parameter estimates for all effects that describe that message. (So, for instance, to determine the log odds of an Asian male initiating contact with an Asian female, one adds the coefficients for the density effect, the female-receiver effect, and the Asian matching effect.) However, because of the dependence between ties explicitly represented by the various reciprocity effects, violating the independence among observations, these models cannot be estimated in closed form. Instead, I used Markov chain Monte Carlo maximum likelihood estimation, a simulation-based procedure that involves simulating distributions of networks on the basis of beginning parameter estimates, comparing these network simulations against the actual observed network data, refining parameter estimates accordingly, and repeating this process until the estimates reach an acceptable degree of stabilization (33). I estimated all models using ergm, the cornerstone of the statnet suite of packages for statistical network analysis (34). Additional details regarding model specification, parameter interpretation, and checks for model degeneracy are presented in Methodological Details. Coarsened Exact Matching. The fundamental problem in any attempt to estimate the causal effect of some treatment using observational data is that treatment and control cases are likely to differ in characteristics associated with the outcome of interest as well as the probability of

PNAS Early Edition | 5 of 6

SOCIAL SCIENCES

Interracial contacts

A

receiving the treatment. Counterfactual approaches to causality attempt to address this concern (21, 35). In this framework, the analyst attempts to pair every case that has received the treatment to at least one identical (or approximately identical) control case that serves as the counterfactual outcome for the treatment case. The average treatment effect on the treated then may be estimated as the difference in average outcomes between treatment cases and their controls. (Generally, we do not have data available on how those in the control group would have behaved if they had instead received the treatment, and given that individuals who did not receive a cross-race message may have been deliberately avoided by interracial suitors for unobserved reasons, it is unwarranted to generalize to these individuals.) Ideally, sufficient data are available that treatment cases can be matched exactly on all available covariates. In other words, the data are perfectly balanced. In practice, however, this rarely is possible because of curse-ofdimensionality issues (particularly with continuous covariates). The central idea behind coarsened exact matching, therefore, is to temporarily “coarsen” one or more variables into substantively meaningful groups; exactly match on these coarsened data, thereby partitioning the data into unique strata defined by every possible combination of covariates; and then retain only the original (uncoarsened) values of the matched data and drop any observation whose stratum does not contain at least one treated and one control unit. Once completed, these strata are the foundations for calculating the treatment effect; the only inferences necessary are those relatively close to the data, leading to less model dependence and reduced statistical bias, among other advantages (22). I matched site users exactly based on gender, racial background, and two-digit ZIP code region, and coarsely based on overall quantity of initiation messages sent during the control period (dichotomized as one or more messages compared with no messages), quantity of interracial initiation messages sent during the control period (dichotomized as one or more interracial messages compared with no messages), and account length (divided into four approximately equal periods based on whether users joined in the first, second, third, or fourth quarter of

October). ‡‡ First, we may expect the greatest degree of variation in outcomes to be explained by users’ geographic region, which encompasses differences in opportunity structures for interracial contact as well as regional differences in average prejudice. Second, we may expect that users who initiated more exchanges of any kind in the past, especially interracial exchanges, would be particularly likely to initiate new interracial exchanges in the future. Third, we may expect that users who have been members of the site longer may be more or less likely to initiate exchanges of any kind and, therefore, more or less likely to initiate interracial exchanges. One problem is the possibility of unobserved differences between treatment and control groups; I address this possibility in Supplementary Analyses. I executed all matching using Stata’s cem command (22), then estimated the magnitude and statistical significance of the average treatment effect on the treated using negative binomial regression (because the outcome variable—quantity of interracial exchanges initiated—is a count variable with overdispersion).

1. Waters MC, Eschbach K (1995) Immigration and ethnic and racial inequality in the United States. Annu Rev Sociol 21:419–446. 2. Morris M, Western B (1999) Inequality in earnings at the close of the twentieth century. Annu Rev Sociol 25:623–657. 3. Sampson RJ, Raudenbush SW, Earls F (1997) Neighborhoods and violent crime: A multilevel study of collective efficacy. Science 277(5328):918–924. 4. Charles CZ (2003) The dynamics of racial residential segregation. Annu Rev Sociol 29: 167–207. 5. Schuman H, Steeh C, Bobo L, Krysan M (1997) Racial Attitudes in America: Trends and Interpretations (Harvard Univ Press, Cambridge, MA). 6. Quillian L (2006) New approaches to understanding racial prejudice and discrimination. Annu Rev Sociol 32:299–328. 7. Stanley DA, Sokol-Hessner P, Banaji MR, Phelps EA (2011) Implicit race attitudes predict trustworthiness judgments and economic trust decisions. Proc Natl Acad Sci USA 108(19):7710–7715. 8. Currarini S, Jackson MO, Pin P (2010) Identifying the roles of race-based choice and chance in high school friendship network formation. Proc Natl Acad Sci USA 107(11): 4857–4861. 9. Wimmer A, Lewis K (2010) Beyond and below racial homophily: ERG models of a friendship network documented on Facebook. AJS 116(2):583–642. 10. Berry B (2006) Friends for better or for worse: Interracial friendship in the United States as seen through wedding party photos. Demography 43(3):491–510. 11. McPherson M, Smith-Lovin L, Cook JM (2001) Birds of a feather: Homophily in social networks. Annu Rev Sociol 27:415–444. 12. Kalmijn M (1998) Intermarriage and homogamy: Causes, patterns, trends. Annu Rev Sociol 24:395–421. 13. Joyner K, Kao G (2005) Interracial relationships and the transition to adulthood. Am Sociol Rev 70(4):563–581. 14. Blackwell DL, Lichter DT (2004) Homogamy among dating, cohabiting, and married couples. Sociol Q 45(4):719–737. 15. McClintock EA (2010) When does race matter? Race, sex, and dating at an elite university. J Marriage Fam 72(1):45–72. 16. Schoen R, Standish N (2001) The retrenchment of marriage: Results from marital status life tables for the United States, 1995. Popul Dev Rev 27(3):553–563. 17. Goldstein JR (1999) The leveling of divorce in the United States. Demography 36(3): 409–414. 18. Cacioppo JT, Cacioppo S, Gonzaga GC, Ogburn EL, VanderWeele TJ (2013) Marital satisfaction and break-ups differ across on-line and off-line meeting venues. Proc Natl Acad Sci USA 110(25):10135–10140.

19. Robins G, Pattison P, Kalish Y, Lusher D (2007) An introduction to exponential random graph (p*) models for social networks. Soc Networks 29(2):173–191. 20. Harris DR, Ono H (2005) How many interracial marriages would there be if all groups were of equal size in all places? A new look at national estimates of interracial marriage. Soc Sci Res 34(1):236–251. 21. Winship C, Morgan SL (1999) The estimation of causal effects from observational data. Annu Rev Sociol 25:659–706. 22. Blackwell M, Iacus S, King G, Porro G (2009) cem: Coarsened exact matching in Stata. Stata J 9(4):524–546. 23. Nowak MA, Sigmund K (2005) Evolution of indirect reciprocity. Nature 437(7063): 1291–1298. 24. Gray K, Ward AF, Norton MI Paying it forward: Generalized reciprocity and the limits of generosity. J Exp Psychol Gen, in press. 25. Waters MC (1999) Black Identities: West Indian Immigrant Dreams and American Realities (Harvard Univ Press, Cambridge, MA). 26. Kurzban R, Tooby J, Cosmides L (2001) Can race be erased? Coalitional computation and social categorization. Proc Natl Acad Sci USA 98(26):15387–15392. 27. Lazer D, et al. (2009) Social science. Computational social science. Science 323(5915): 721–723. 28. Garton L, Haythornthwaite C, Wellman B (1997) Studying online social networks. J Comput Mediat Commun 3(1). Available at http://onlinelibrary.wiley.com/doi/10.1111/ j.1083-6101.1997.tb00062.x/full. Accessed May 4, 2013. 29. Qian Z (1997) Breaking the racial barriers: Variations in interracial marriage between 1980 and 1990. Demography 34(2):263–276. 30. Emerson MO, Kimbro RT, Yancey G (2002) Contact theory extended: The effects of

6 of 6 | www.pnas.org/cgi/doi/10.1073/pnas.1308501110

ACKNOWLEDGMENTS. I thank Felix Elwert for methodological guidance and Misiek Piskorski for collaboration in acquiring the dataset on which this research is based. I am also grateful for helpful conversations with Chana Teeger and Elizabeth Bruch; valuable feedback from the Social Psychology Brown Bag Series at the University of Southern California, the Family Working Group at the University of California, Los Angeles, the 2013 American Sociological Association session on Assortative Mating, and three anonymous reviewers; and encouragement from colleagues in the Department of Sociology at the University of California, San Diego, Kurt Gray, Kristen Lindquist, and Erin Everett. This work was supported by the Division of Research and Faculty Development at Harvard Business School.

‡‡

I replicated all analyses while also controlling for the overall quantity of initiation messages received during the control period, i.e., each user’s prior “popularity.” In general, this increased rather than decreased the size of the treatment effect, and no result in Fig. 3 or 4 that previously was statistically significant ceased to be so.

prior racial contact on current social ties. Soc Sci Q 83(3):745–761. 31. Heckathorn DD (2002) Respondent-driven sampling II: Deriving valid population estimates from chain-referral samples of hidden populations. Soc Probl 49(1):11–34. 32. Snijders TAB (2011) Statistical models for social networks. Annu Rev Sociol 37:131–153. 33. Hunter DR, Handcock MS (2006) Inference in curved exponential family models for networks. J Comput Graph Statist 15(3):565–583. 34. Handcock MS, Hunter DR, Butts CT, Goodreau SM, Morris M (2003) statnet: Software tools for the statistical modeling of network data. Available at http://statnetproject. org. Accessed May 4, 2013. 35. Morgan SL, Winship C (2007) Counterfactuals and Causal Inference: Methods and Principles for Social Research (Cambridge Univ Press, Cambridge, UK).

Lewis

Supporting Information Lewis 10.1073/pnas.1308501110 Site Description Despite the rapid rise of the internet as a romantic intermediary (1, 2), analyses of behavioral data from online dating sites still are rare. The site used for these analyses, OkCupid, is one of the more popular dating sites in the United States (3), although direct comparisons with other sites are difficult because membership data are not available. OkCupid does not market itself to any particular demographic group and therefore attracts a wide swath of the population. The absence of cost also eliminates a significant barrier to entry and likely increases the generalizability of these results. On the other hand, individuals who use a free online dating site might be considered less “serious” about finding a mate than those willing to pay for a subscription-based site, and, importantly, individuals who are particularly interested in finding a partner from one or another racial background likely will subscribe to one of the many available niche sites specifically tailored to this interest (4). Therefore, estimates of racial prejudice on OkCupid may be conservative relative to the general population. Personal Profiles. Upon joining the site, users create a personal profile not unlike those on other popular social network sites, such as Facebook and LinkedIn. Rather than revealing their actual names or identifying information, however, users choose a screen name by which they will be identified on the site. The only information users are required to provide is their gender (female/ male), sexual orientation (straight/gay/bisexual), relationship status (single/seeing someone/married), birth date, and five-digit ZIP code, all of which (except the full ZIP code) are visible to other users. Beyond this, users have the option of uploading any number of photographs to their profile (one of which serves as the primary photo used for identification around the site) and providing two types of personal information. First, users may describe themselves using several closed-ended prompts regarding their racial background, income, educational attainment, religious views, body type, and several other demographic, physical, and lifestyle descriptors. Second, users have the option of describing themselves in response to several open-ended essay prompts that serve as the main text of user profiles. Contacting Others. Upon logging into the site (which is possible

from a computer or smart phone application), users are greeted by a personalized welcome screen. Four general courses of action are available: (i) viewing and editing one’s own profile; (ii) searching for, viewing the profiles of, and/or potentially contacting other site users through a variety of means; (iii) viewing one’s inbox of messages received from other site users and potentially replying to these messages; and (iv) answering questions about one’s romantic preferences (see below). There are six methods through which users can locate other users on the site. Each method, in order of how commonly it is used (least to most common), is described below (5): i) “Quickmatch” (1% of all profile views). OkCupid’s most basic matching function simply recommends a single match upon clicking a button on the welcome screen. An abbreviated version of this person’s profile (including some text, photos, and a match percentage) then is presented to the user, who has the option of indicating her level of interest and deciding whether to view that person’s full profile. This process may be repeated as frequently as the user wishes. Lewis www.pnas.org/cgi/content/short/1308501110

ii) “Quiver” (5% of all profile views). OkCupid’s Quiver feature is similar in function to Quickmatch, except that the site provides new recommendations automatically rather than in response to user-initiated queries. Users can view the recommendations in their Quiver at any time and indicate their level of interest in each recommendation; if a certain recommendation is dismissed, it is repopulated with a new recommendation after a period of some days. iii) Seeing who visited one’s profile (5% of all profile views). On their home page, users can view a list (including thumbnail photos and match percentages) of all users who recently visited their profile. Users may disable this feature, but they then no longer appear on this list on the home pages of others. (In other words, to view someone’s profile without that person knowing, one must sacrifice the ability to know who has visited one’s own profile.) iv) Home page newsfeed (21% of all profile views). On their home page, users also are provided with a list that displays recent activity of other site members (somewhat akin to Facebook’s “News Feed” function), such as when other members have uploaded a new photograph or edited something on their profiles. Thumbnail photographs of these users also are provided. v) Viewing the profiles of others (23% of all profile views). Every time a user views the profile of another user, she is presented with a list (including thumbnail photos) of other users who are similar to the person currently being viewed, in whom the viewer may be interested. vi) Search (45% of all profile views). The most common means of locating other users on OkCupid is to use the site’s personalized search feature. Users have the option, first, of entering any number of search criteria that correspond to the closed-ended responses users provide on their profile and, second, of ordering search results any number of ways (e.g., by geographic proximity, match percentage, or how recently the user joined the site). Search results appear as a list of thumbnail photos, along with each user’s screen name, match percentage, and a very brief excerpt from one of their openended essay responses. Upon arriving at the profile of another user, regardless of which of the above methods is used, the person viewing the profile has the opportunity to send that user a message using the Web site’s internal e-mail system. Match Percentages. OkCupid differentiates itself from other Web sites not only through its absence of membership fees, but also through its unique approach to matching. After joining the site, users have the option of “improving their matches” by answering a virtually unlimited supply of questions about themselves and their mating preferences. The more questions a user answers, the more data the site acquires about the user and her preferences, and therefore the “smarter” the site becomes at recommending matches. The substance of these questions is extremely broad— ranging from the importance of sex vs. love, to one’s views about homosexuality, to how likely one is to smile at a child walking down the street—and only rarely have to do with demographic preferences (presumably because one can easily realize these preferences using the search function). For each question, users must provide three responses, respectively indicating (i) their answer to the question, (ii) what responses to the question they will accept from a potential match, and (iii) how important the question is (options are “irrelevant,” “a little important,” “some1 of 16

what important,” “very important,” and “mandatory”). Users also have the option of skipping the current question and moving on to the next. As a consequence of completing these questions, a “match percentage” appears on every profile a user visits. This percentage also appears virtually any time the site recommends a user via any of the methods described above. Match percentages are calculated based on the values of i, ii, and iii above for all questions that both site users have answered, using a straightforward (if nuanced) algorithm that OkCupid publicizes on its Web site (www.okcupid.com/help/match-percentages).

expect mating dynamics to vary substantially according to relationship status, sexual orientation, and interpersonal objectives on the Web site (other options included “new friends,” “activity partners,” “long-distance pen pals,” and “casual sex”); and that enforcing a time window prevents the truncation issue described in the main text, these criteria seemed reasonable as a starting point for exploring racial dynamics in online dating. I also acquired the timestamp, sender, receiver, and length (in characters) of all messages sent among this study population from October 1, 2010, through December 15, 2010. I placed several additional restrictions on these data for the purposes of this paper.

The Problem of Site Interference. If individual preferences and prejudices are to be inferred from patterns of interaction on an online dating site, then we must be sure these patterns reflect the autonomous choices of individuals rather than the external influence of the site (or rather, that site interference explains, at most, a negligible proportion of this behavior). There are three reasons to believe this is the case for OkCupid. First, match percentages, which constitute the backbone of OkCupid’s approach to matchmaking, are calculated transparently based exclusively on user-provided preferences. In other words, the match percentage, like a sophisticated version of the search function, simply feeds back to the user what she herself has requested. Among the six possible ways one user can find another, only the Quickmatch and Quiver functions are relatively opaque in how these matches are generated; however, (i) these collectively account for only 6% of all profile views and (ii) there is every reason to expect that match percentages constitute the basis for these recommendations, as well. Second, even when the site recommends a match via one of the above six methods, it still is the user’s decision whether to view the full profile of that person and, even more importantly, whether to send that person a message. At least two prior teams of researchers studying online dating have taken the approach of examining messaging only conditional on profile views (6, 7). However, because race is a characteristic one can search for as well as a characteristic that is visible, this approach ignores perhaps the two most important stages of selection in which racial prejudice will operate. If the degree of out-group aversion among individuals from a certain racial background is strong enough, they will never search for or view the profiles of outgroup members in the first place (8). Third, even recognizing that the ultimate decision to contact someone is in the hands of the individual user—and recognizing that site recommendations essentially are a means of preselecting a pool of eligible partners on the basis of users’ explicit preferences—it still is possible that the site interferes with these recommendations in some unknown way. For this interference to influence the findings of this paper, however, (i) such interference would have to be related directly or indirectly to racial background (interference that is unlikely to avoid detection); (ii) such interference would run directly contrary to the site’s marketing strategy regarding its unique, transparent, user-driven approach to matching; and (iii) one would have to tell a complex story for how the site might be interfering in such a way as to spuriously produce the two central findings of this paper: that cross-race replies are more likely than cross-race initiations, and that receiving a crossrace initiation causes users to extend more cross-race initiations to other users in the future.

Individual Data. Among the OkCupid users, 165,525 satisfied the individual criteria listed above. A long tradition of research has documented the importance of controlling for opportunity structures in any estimation of in- and out-group preferences; it is impossible to form relationships with individuals from another background if no such individuals are available in the first place (or if they live unreasonably far away to consider contacting). Although all users provide their five-digit ZIP code to OkCupid as a condition for site registration, I received access to only a truncated version of these codes to protect against the possibility of reidentification. Further, regional data were stripped entirely from individuals who lived in regions with comparatively few site members. Therefore, my first step was to eliminate from the baseline population the 3,143 users (1.9%) for whom regional data were not available. Among the closed-ended demographic descriptors that appear on users’ profiles, site users have the option to choose from several racial categories: Asian, Middle Eastern, black, Native American, Indian, Pacific Islander, Hispanic/Latin, white, or “other.” It also is possible to select more than one category or not to select a category at all. For the sake of simplicity, as well as to ensure that the methodological tools of this paper (which require a minimum number of inter- and intragroup messages) could be used, I focused only on users who (i) provided data on their racial background, (ii) selected only one category of racial affiliation, and (iii) belonged to one of the five largest racial categories. Of the baseline population of 162,382 users now available, 140,395 (86.5%) selected at least one racial category. Of these, 130,159 (92.7%) selected only one racial category, and of these, 126,134 (96.9%) selected one of the five largest racial categories. This constituted the final study sample for these analyses, with the demographic composition described in Table S1.

Study Population The baseline population from which I selected the sample for this study consisted of all OkCupid users who self-identified as single, straight, living in the United States, and “looking for” short-term and/or long-term dating, and who joined the site between October 1 and November 30, 2010. Given that this paper focuses on racial prejudice in the United States; that it is reasonable to Lewis www.pnas.org/cgi/content/short/1308501110

Relational Data. I first limited the sample of messages to only those sent and received by one of the 126,134 users described above. Otherwise, some messages would be sent to or received from users outside the study sample, whereas a concrete network “boundary” is required for the network analytic techniques in this paper. Next, as described in the main text, I focused only on first messages and first replies. Further, I defined a “reply” as a response sent within 2 wk of the initial message. Two weeks seemed a conservative window within which to expect a reply if the romantic interest indeed was reciprocal. Because relational data were available through mid-December—but to prevent the same kind of truncation problem that I avoided at the start of the time window—I therefore considered initiation messages sent between October 1 and November 30 (the same interval in which new users were allowed to join) and responses sent through December 14. (Otherwise, if I had included initiation messages sent through mid-December, it would be impossible to distinguish a message that never received a reply from one that received a reply beyond the time window of available data.) Finally, because the focus of this paper is on romantic contact among heterosexual users, I eliminated all same-sex messages (which accounted for less than 1% of total messages among this sample). 2 of 16

Sample Representativeness. An important question is the extent to which online dating site users—and OkCupid users in particular— are representative of the broader population of singles in the United States. We know from previous research that age, education, and income are strongly associated with the likelihood of internet access (9), and online daters are younger, make less money, and are more likely to be used compared with nononline daters (10). However, once Sautter et al. (11) limited consideration to single internet users in the United States (i.e., people “at risk” of online dating), they found that no sociodemographic variables are significantly associated with the likelihood of internet dating. Comparing the specific sample of OkCupid users in these analyses with the broader population of unmarried, internet-using adults in the United States from the relevant five racial backgrounds, the sample contains fewer blacks and Hispanics, more whites, and slightly more men than the general population. Users in the sample also are more educated, more likely to be in their 20s and 30s, and make slightly less money than we would expect by chance (although only household income data are available for the broader population, which makes the latter difference unsurprising).†

Methodological Details Exponential Random Graph Modeling. Exponential random graph (ERG) modeling is increasingly common in mainstream social scientific literature (12–15). However, the computational demands required to estimate these models place hard limits on a combination of model complexity and the size of the population over which such models may be estimated. Consequently, most applications of ERG models involve networks on the order of, at most, hundreds or a few thousand nodes (if not far fewer), and applying this method to the entire study sample used here is far outside the realm of possibility. Fortunately, it also is outside the realm of desirability; site users interested in dating rarely will initiate contact beyond a certain geographic radius (as noted in the main text), so it is hardly reasonable to consider the entire study sample as the practical opportunity structure for all users. Therefore, I estimated a separate model for each two-digit ZIP code region‡ (for all regions with more than 1,000 users represented in the sample) and assessed the overall importance of each micromechanism by looking at summary statistics of parameter estimates across all models. This technique also was used by Goodreau et al. (12) in their study of homophily and triadic closure in American high schools. The full set of parameter estimates from all models is presented in Table S2. As noted in the main text, a primary strength of these models is their great degree of flexibility; ERG models can, in principle, incorporate any number of optional parameters that correspond to virtually any possible reason one site user might contact another. Here, however, I model online messaging behavior solely as the joint product of gender, racial background, and reciprocity to present results that are as parsimonious as possible and tailored to the primary questions of this study. A few notes of caution about these models are in order. First, although I included only regions with more than 1,000 users represented in the sample (collectively accounting for 81.3% of the entire study sample) to increase the likelihood that each region would contain users from all five racial backgrounds, it is clear from Table S2 that the issue of “cell size” still was a problem. Specifically, even if there were more than one individual from a particular racial background in a given region (or, more precisely, at least one male and at least one female), it is not



Comparisons are based on the October 2010 Internet Use Supplement of the Current Population Survey.



A helpful map representing the geographic size and location of all 2-digit ZIP Code regions can be found at http://benfry.com/zipdecode/.

Lewis www.pnas.org/cgi/content/short/1308501110

possible to estimate parameters for racial matching and the interaction of racial matching and reciprocity unless there is at least one same-race tie and at least one same-race reciprocal tie among those users, respectively. Consequently, it is important to recognize that the box plot for each effect in Fig. 2 pertains only to regions in which a parameter estimate for that effect could actually be obtained (ranging from 100% of all regions for racial matching among white users, e.g., to 18% of all regions for the interaction of racial matching and reciprocity among Indian users). Second, despite the recent advances in ERG modeling used here (including simulation-based estimation), ERG models traditionally have been plagued by a problem known as “degeneracy” when a model is poorly specified and parameter estimates for that model produce empirically implausible networks, such as a graph with no ties at all or with all nodes connected to all others (16). To ensure that degeneracy was not a problem here, I used the recommended technique in the literature for assessing the fit of ERG models: running a large number of network simulations (here, 100) based on the final parameter estimates for each model, and comparing global properties of these simulated networks to the global properties of the actually observed social network (17). A typical goodness-of-fit output appears in Fig. S1, reflecting model fit for the out-degree distribution (the quantity of messages sent by each user), in-degree distribution (the quantity of messages received by each user), and geodesic distance distribution (the quantity of network steps between any two users). Although, in general, global properties of simulated networks departed substantially from global properties of the observed networks, this is not surprising given the simplicity of models used here (e.g., we would expect that a great deal of variation in in-degree, out-degree, and geodesic distance would be explained by other factors, such as individual attractiveness). Further, the fact that the fits of these models are as good as they are is in some sense quite remarkable and reflects how strongly site behavior is, in fact, patterned simply by the joint dynamics of gender, reciprocity, and racial background. Most importantly, however, virtually none of the models estimated in this paper shows any sign of degeneracy; in other words, almost all models successfully “converged” (the sole exception is noted in Table S2). Third, it is important to emphasize that all effects in each model are estimated controlling for all other effects in the model, and should be interpreted in this light. In other words, it is not necessarily the case that dating site users prefer to reciprocate ties to individuals from a different racial background in an absolute sense, but rather that the log odds of reciprocating a tie to someone from a different racial background (as captured by the matching*reciprocity interaction effects) generally are greater than (for Indian, Asian, and Hispanic users) or equal to (for white and black users) what we would expect by chance conditional on the baseline tendency toward reciprocity and the great degree of in-group bias displayed by most users when initiating contact. In other words, matching*reciprocity interaction coefficients should not be interpreted out of the context of the distinct matching and reciprocity effects that also are estimated in each model. To illustrate this point further, Fig. S2 presents the same results as in Fig. 2, but in an entirely different fashion. Whereas Fig. 2 presents summaries of the actual model coefficients across all regions, Fig. S2 presents summaries of the log odds of all possible messaging scenarios across all regions. In other words, whereas Fig. 2 disentangles the magnitude of the various effects included in each ERG model and presents these effects side by side, Fig. S2 quantifies the direct implications of these effects for the log odds of all possible messaging scenarios among site users. All results in Fig. S2 therefore are derived directly from the data featured in Fig. 2 (and presented in full in Table S2) based on very straightforward calculations. To determine the log odds of any given 3 of 16

messaging scenario, one simply adds all relevant parameter estimates to describe that scenario. So, for instance, the log odds of a female site user from one racial background initiating contact with a male site user from a different racial background (“female initiation–cross-race”) are simply equivalent to the “density” coefficient, because no other model terms describe this scenario. The log odds of a male black user initiating contact with a female black user (“male initiation–both black”) are equivalent to the density coefficient added to the “female-receiver” coefficient and the “black-matching” coefficient. Also, the log odds of a male Indian user replying to an Indian female user (“male reply-both Indian”) are equivalent to the density coefficient added to the “reciprocity” coefficient, the female-receiver coefficient, the “Indian-matching” coefficient, and the “Indian-matching*reciprocity” coefficient (to choose one of the most complex possible scenarios represented by this model). Finally, the advantage of ERG models is their ability to statistically disentangle and quantify the importance of various underlying mechanisms in producing overall network structures (13), and particularly to compare patterns of initiation with patterns of reply. Their primary disadvantage in the context of this study—and the reason affiliation indices are still helpful—is that because these models were designed for cross-sectional data, the dimension of time must be suppressed in model results.§ In other words, the network I model is merely a “snapshot” of all interactions over the course of the study period, with the restrictions described in the main text. However, the consequences of this fact for the findings presented in the main text actually are quite minimal. The reasons for this are twofold. First, although ERG models are indeed designed for cross-sectional data, they assume that these data were produced by dynamic underlying processes; in fact, the simulation-based techniques used to estimate these models respect this underlying dynamism. Second, the upshot of this consideration is that relatively little information relevant to this study actually is lost; in fact, the only practical ramification of the suppression of time for the current paper is that the ERG models, in circumstances in which both an initiation and a reply are present between two users, cannot determine which of the two users initiated the exchange. For this reason, three-way interactions among race, gender, and reciprocity are not possible. However, as is evident from Fig. S2, this does not mean that gender is altogether absent from patterns of reciprocity, because these patterns still consider the female-receiver effect (which incorporates females’ tendency to receive initiations as well as replies). Coarsened Exact Matching. To help understand the implementation of my counterfactual approach, a visualization of the partitioning of the study period into control, treatment, and outcome periods is presented in Fig. S3 (20). Although I present findings in the main text in terms of the average quantity of interracial exchanges initiated to facilitate interpretation of results, I replicate all findings from the matching analyses in Table S3 in their original, logged form (including SEs, significance levels, and the quantity of observations in each analysis). It is important to note that all effects presented in the main text are conservative for two reasons. First, I estimated all models without controlling for the additional imbalance that remained in the matched data as a consequence of the fact that the treatment and control groups

§

Recently, Krivitsky and Handcock (18) developed an extension of ERG modeling for dynamic data. However, these models require that data be recorded in discrete snapshots (rather than continuous time) and that we can properly speak of relationship “formation” and “dissolution,” whereas in the case of online dating, an instance of two people who suddenly stop messaging could equally plausibly reflect a relationship “failure” (i.e., one or the other lost interest) as a “success” (i.e., the interaction transitioned offline). One promising direction for future work is the relational event framework developed by Butts (19).

Lewis www.pnas.org/cgi/content/short/1308501110

were matched coarsely (instead of exactly) on some variables. The reason for this omission is, first, to facilitate interpretation of the causal effect (as simply the difference in mean values of the outcome variable between treatment and control groups, instead of the average difference between groups controlling for other factors) and second, in one case (the model for Asian women) the model with added controls failed to converge. However, in nearly all instances in which I replicated these models but also included controls, the average treatment effect on the treated increased in magnitude compared with models without controls (see Supplementary Analyses below). The second reason these findings are conservative is that the outcome variable—like all other relational variables in these analyses (including prior exchanges initiated, prior interracial exchanges initiated, and the treatment variable itself)—was measured only with respect to ties sent and received among the original study sample of 126,134 users. In other words, a strict interpretation of the baseline effect (Fig. 3A) is not that users who received the treatment initiated an average of 0.038 new interracial exchanges per person than they would have otherwise (0.141 minus 0.103), but rather that users who received the treatment initiated an average of 0.038 new interracial exchanges with other users in the study sample. Users in the treatment group, like all other users in the sample, were almost certainly interacting with other individuals on the site besides simply those users who had also joined the site recently, expressed that they were interested in “dating,” and so forth. Additionally, this causal estimate does not include interracial contact with any site user who belonged to one of the other racial categories on the site that were not included in these analyses. Naturally, according to this logic, it also is possible that individuals who were considered part of the control group for these analyses in fact received an interracial message during the treatment period from a user who was not herself part of the study sample; therefore, such individuals were not counted as receiving the treatment. However, this again would create a conservative bias in results. It is possible that some users in the control condition “actually” received the treatment (which would lessen the gap in outcomes between treatment and control groups, not widen it), but by definition no one in the treatment condition could have actually been untreated. A final detail of these models is worth noting. For almost all analyses, I maximized available data by allowing multiple cases to serve as controls for each treatment. The sole exception to this approach is the analysis presented in Fig. 4C (“scope of effect”). Here, I estimated the average treatment effect on the treated for two distinct outcome variables: the quantity of interracial exchanges initiated with users from the same racial background as the treatment sender, and the quantity of interracial exchanges initiated with users from a different racial background as the treatment sender. However, both these outcomes are defined in relation to the racial background of the treatment sender, although by definition the sender of the treatment does not exist for the control group and, therefore, it is unclear how the counterfactual outcome should be measured. To address this problem, rather than allowing multiple cases to serve as controls for each treatment, I required that each treatment case be assigned to precisely one control case and I personalized measurement of the counterfactual outcomes for that control case in precisely the same way as I measured the actual outcome for the treatment case. Specifically, for instance, if a certain treatment case was an Asian male who received an interracial message (treatment) from a black female, I defined this person’s outcome variables as the quantity of future exchanges initiated with other black females (specific effect) and the quantity of future exchanges initiated with Indian, Hispanic, or white females (generalized effect), and I measured the outcome variables for this person’s control case (another Asian male who did not receive a treatment) identically (i.e., the quantity of future exchanges initiated with black females, for the specific 4 of 16

effect, and with Indian, Hispanic, or white females, for the generalized effect). I also limited this analysis to users in the treatment group who received one and only one interracial message during the treatment period to avoid ambiguous situations in which site users were “treated” by exposure to individuals from more than one racial category. Such users accounted for the majority (78%) of treated cases. Supplementary Analyses I conducted five sets of supplementary analyses to clarify and confirm results presented in the main text. I present the nature and results of these analyses below. “Polite Rejections.” An important alternative explanation for why cross-race messages are more common (or, equivalently, samerace messages are less common) for replies than for initiations is that dating site users fear being perceived as racist; therefore, they are more likely to send a polite rejection (e.g., “thanks, but no thanks”) to someone from a different racial background than to someone from the same racial background rather than simply ignoring the initial message altogether.{ To test for this possibility, I first compared the distributions of message length (in characters) between cross-race replies and same-race replies, because it seems reasonable to expect that a polite rejection would tend to be shorter in length than a reply conveying genuine interest. As presented in Fig. S4, it is indeed the case that relatively short messages are slightly more common among crossrace replies than among same-race replies. Second, to assess whether this difference is driving results, I replicated the analyses presented in Fig. 1 four times—once for each reply length quartile—and focused primarily on the difference between rates of same-race initiation and same-race response (i.e., the figures presented on the diagonal of Fig. 1 C and F). As is clear from Fig. S5, the same-race affiliation index is always higher for initiations than for replies (i.e., the “difference” bar is negative) when we focus only on messages in the shortest quartile (1–37 characters), regardless of gender and racial background. However, this difference almost always exists for replies in each of the three other message length quartiles as well (and in fact, in the case of Hispanic men, black women, and Hispanic women, the difference between rates of same-race initiation and samerace reply widens the longer the message). Further, the only instances in which same-race affiliation indices were higher for replies than for initiations were for Indian men (when the length of their reply was in the second, third, or fourth quartile) and Asian women (when the length of their reply was in the second quartile), the first of which can already be observed in Fig. 1C in the main text. In other words, although it is clear that the precise difference between the likelihood of a same-race initiation and a same-race reply indeed depends on the length of the reply— and in some cases, this difference is larger the shorter the response (particularly for white dating site users, who might be especially concerned about perceptions of racism)—the difference between initiations and replies occurs almost universally regardless of reply length, and there is little reason to fear that polite rejections are driving this finding.k Same-Race “Treatment.” Turning next to the matching analyses, one alternative explanation is that the apparent effect of the

{

I thank an anonymous reviewer for this suggestion.

k

Unfortunately, it is not possible to replicate the ERG analyses presented in Fig. 2 using the different subsets of replies examined in Fig. S5, because there would be too few replies in the data for most two-digit ZIP code regions. (For instance, even considering replies of all lengths, we can see from Table S2 that it was not always possible to obtain parameter estimates for the interaction of racial matching and reciprocity because the relevant type of tie—a reply sent between two site users from the same racial background—was not present in the data.)

Lewis www.pnas.org/cgi/content/short/1308501110

treatment does not stem from having been contacted by someone from a different racial background but from having been contacted by anyone at all. In other words, one can imagine a scenario in which receiving an interracial message per se does not make dating site users more likely to reach across racial boundaries in the future, but that receiving a message from anyone makes dating site users more likely to initiate contact with anyone (including individuals from a different racial background) in the future (because perhaps receiving a message from anyone boosts site users’ confidence, or perhaps users who receive messages are more likely to remain on the site longer and therefore more likely to send out messages in the future). To check for this possibility, I conducted two additional analyses. First, if the effect documented in this paper is an artifact of receiving a tie from anyone, then we would expect that users who receive a same-race version of the treatment also would initiate more interracial exchanges in the future. We therefore can consider three categories of site users: (i) users who received at least one cross-race message during the treatment period (i.e., the actual treatment in this study), (ii) users who received only same-race messages during the treatment period, and (iii) users who did not receive any messages at all during the treatment period. I find that users in category ii indeed initiate significantly more interracial exchanges during the outcome period than users in category iii (P < 0.01). However, the size of this effect is smaller than the size of the effect when I compare users in category i with users in category iii, which also is statistically significant (P < 0.01; Fig. S6A). Second, to directly assess the difference between the two treatment groups, I dropped all individuals in category iii and instead estimated the average treatment effect on the treated when I matched all users in category i with at least one user in category iii. I found that users who received an interracial treatment initiated significantly more interracial exchanges during the outcome period than did users who received a same-race treatment (P < 0.001; Fig. S6B). In other words, simply receiving a message from anyone indeed causes dating site users to initiate more cross-race exchanges in the future, likely because receiving a message from anyone makes users more likely to initiate exchanges with anyone in the future. However, the results I present in this paper cannot be reduced to such an effect; the fact that the sender of the treatment message is from a different racial background than the receiver is indeed causally consequential. Controlling for Remaining Imbalance. A second alternative explanation for these findings is that although I matched treatment and control groups exactly according to gender, racial background, and two-digit ZIP code region, I matched these two groups only “coarsely” according to previous quantity of initiation messages sent, previous quantity of cross-race initiation messages sent, and account age. Therefore, the apparent difference between treatment and control groups could be explained simply by the remaining imbalance between the two groups with respect to the latter three variables. To check for this possibility, I ran the same model presented in Fig. 3A, but I also included the original versions of these three variables as controls. Individuals in the treatment category continued to initiate significantly more interracial exchanges during the outcome period than individuals in the control category (P < 0.001); in fact, the effect size increased, rather than decreased, in magnitude (Fig. S7). I repeated this robustness check for all other analyses presented in the main text (results not pictured). Inclusion of these controls almost always resulted in an increase in the magnitude (and a decrease in the P value) of the treatment effect; in no case did an effect that was statistically significant previously cease to be so, and in two cases an effect that was not statistically significant previously became statistically significant with the inclusion of controls (the case of female site users whose account was 9–16 5 of 16

d old, P < 0.001, and the case of site users who had received an interracial message previously, P < 0.001). The Problem of Unobservables. The final and potentially most problematic alternative explanation for these findings is that the sample is not balanced on unobserved characteristics associated with both the treatment and the outcome. In other words, there may be one or more individual characteristics that make certain dating site users both (i) more likely to attract interracial messages and (ii) more likely to send interracial messages, such that the effect of the treatment is not truly causal. This is a different problem from the alternative explanation above, which had to do with lack of balance on observed characteristics and in which the degree of imbalance was bound strictly by the level of coarsening; it also is the primary advantage of experimental research over observational research (i.e., randomized assignment to the treatment condition ensures balance on both observed and unobserved characteristics). There are two reasons the findings of this paper are unlikely to be a spurious consequence of unobserved differences. First, in order for unobserved differences to explain the effect of the treatment on the treated, not only would these differences have to be correlated with both the treatment and the outcome variables, but the differences must not be highly correlated with any of the observed variables on which treatment and control groups were matched. Given that one of the observed control/matching variables is prior degree of interracial contact (i.e., quantity of interracial exchanges initiated before the treatment period, or a pretreatment version of the outcome variable), such an explanation is very unlikely. In other words, the unobserved characteristic would somehow have to be associated with the likelihood of receiving an interracial tie during the first week of November and the likelihood of sending interracial ties during the second week of November, but not associated with the likelihood of sending interracial ties during October. It is difficult to think of any such characteristic. Second, if there is in fact some unobserved difference between treatment and control groups, then we would expect that differences in outcomes between the two groups should persist over time. In other words, if the effect of the treatment on the treated is not genuinely causal, then there is no reason we should expect the strength of this “effect” to diminish over time. To assess this possibility, I reanalyzed these data using two alternative versions of the outcome variable: one measured during the third week of November (two weeks after the treatment) and one measured during the fourth week of November (three weeks after the treatment). The results of this analysis are presented in Fig. S8, in which I also control for the remaining imbalance between treatment and control groups on observed dimensions, as in Fig. S7 (a decision that influenced the size of the treatment effect, but not its statistical significance). I find that, contrary to what we would expect if unobserved differences were motivating the treatment effect, the effect of the treatment on the treated decreases to statistical insignificance as early as 2 wk beyond the treatment. In other words, there is convincing evidence that the causal effect documented in this paper is indeed veracious, and that it endures for only a week beyond the time of treatment. After this point, treatment and control groups again become statistically indistinguishable with respect to their quantity of interracial initiation. An additional and suggestive observation from the above exercise is worth noting. This observation is that although the effect of the treatment decreases to statistical insignificance as early as the third week of November—and remains statistically insignificant during the fourth week of November—the size of the (insignificant) effect actually increases from week 3 to week 4. The implications of this difference are twofold. First, given that Thanksgiving falls on the fourth Thursday of November, it is possible that this slight increase in effect size relates to unusual Lewis www.pnas.org/cgi/content/short/1308501110

activity surrounding the holiday. For instance, site users may be particularly interested in finding a mate and, therefore, unusually susceptible to the effect of their prior treatment. Second, however, because the size of the treatment effect decreases from the second week of November to the third week of November—and only then increases from the third week of November to the fourth week of November—there is little reason to fear that any such possible “contaminating” effects of Thanksgiving may have influenced the focal outcome of this study, which was measured during the second week of November. Understanding the Causal Mechanism. In addition to addressing alternative explanations for my findings, I conducted two final analyses to better understand the causal mechanism driving results and the specific subpopulations to whom this mechanism is applicable. First, we know from the main text that site users who receive an interracial message are more likely to initiate new interracial exchanges in the future. However, this effect might be driven by one or more of the following three mechanisms: First, as a consequence of the treatment, site users might be more likely to consider (i.e., view the profiles of) individuals from a different racial background; second, site users might be more likely to contact a prospective partner from a different racial background, conditional on viewing that person’s profile; or third, site users might be more likely to contact prospective partners from a different racial background without viewing these individuals’ profiles at all. In Fig. S9, therefore, I present results from a replication of the central analysis in the main text (presented in Fig. 3A), except that I redefine the outcome variable in three different ways corresponding to the three different scenarios above: first, as the quantity of cross-race profile views (Fig. S9A); second, as the cross-race initiation rate conditional on a crossrace profile view (Fig. S9B); and third, as the quantity of crossrace initiations that were not preceded by a profile view (Fig. S9C). As in the original analysis, I also matched treatment and control cases coarsely based on pretreatment overall and racespecific versions of the outcome variable (e.g., the total quantity of profile views and the total quantity of cross-race profile views, the overall initiation rate and the cross-race initiation rate); I measured all outcomes during the outcome period; and, although I continued to use negative binomial regression for the first and third analyses, I used a binomial generalized linear model with a logit link function for the second analysis (because the outcome variable, contact rate, is measured as a proportion rather than a count). Results clearly demonstrate that the finding in the main text is driven by the first and third mechanisms above. In other words, it is not the case that site users who receive the treatment are more likely to initiate contact with prospective partners from a different racial background after viewing these users’ profiles; in fact, after the treatment they are significantly less likely to do so (P < 0.05). However, this lower rate of contact conditional on a profile view is overcompensated by the fact that site users who receive the treatment are more likely to view the profiles of prospective partners from a different racial background in the first place (P < 0.001). They also are more likely to contact individuals from a different racial background without viewing these individuals’ profiles at all (P < 0.001). In other words, the causal effect of receiving a cross-race message is that it leads dating site users to “consider” (and also to “blindly” contact) prospective partners they would not have considered (or blindly contacted) otherwise, lending additional support to the “preemptive discrimination” interpretation proposed in the main text. Second, we know (from Fig. 3) that the treatment effect on the treated varies in magnitude and statistical significance across men and women from different racial backgrounds who have been members of the Web site for shorter vs. longer periods. However, we also know (from Fig. 4B) that the treatment produces a causal 6 of 16

effect only when the recipient replies to the treatment message. Therefore, how do the subgroup results from Fig. 3 change when we replicate analyses separately for site users who did and did not reply to the treatment message, as in Fig. 4B? Results from these replications are presented in Fig. S10 (in which, unlike Figs. 3 and 4B but like many other analyses in SI Materials and Methods, results are presented in terms of the actual parameter estimate and confidence interval for the treatment effect, not the difference in the quantity of interracial initiations between treatment and control groups). Once we distinguish between those who did and did not reply to the initial treatment message, it is clear that in many—but not all—circumstances, the observed causal effect in Fig. 3 is indeed driven by the subgroup of users who replied to the treatment message; in fact, if we focus only on the users who replied to the treatment message, the effect of the treatment is statistically significant for many more subcategories of users. All women who replied to the treatment message, regardless of account age, initiated significantly more interracial exchanges during the outcome period than women who did not receive a treatment message (P < 0.01). Almost all women who replied to the treatment message, regardless of racial background, initiated significantly more interracial exchanges during the outcome period (P < 0.05) (the only exception is Indian women—although the confidence interval for these users also is unusually large because of their small representation in the sample). Among men who replied to the treatment message, there is a positive, significant effect of receiving the treatment for men who joined the Web site in the past 1–8 d (P < 0.001), Asian men (P < 0.01), and Hispanic men (P < 0.01). What explains this finding, and what explains the heterogeneity in this effect across individuals from different backgrounds? Two explanations are possible. One explanation is that there is a subset of site users for whom (for unknown reasons) racial boundaries are present but unusually fragile; in some sense, all it takes is a “push” from an external source to diminish the bias that previously prevented these individuals from considering cross-race prospects. This external push (i.e., receiving cross-race romantic interest) then causes these individuals both to reply to the original treatment message and to consider more interracial prospects in the short-term future. The second explanation (note that the two are not mutually exclusive) is that the act of replying to the original treatment message itself carries causal weight. In other words, some subset of individuals who receive a treatment message reply to this message. It might be that something is unique about those

circumstances when the reply occurs—whether an unobserved characteristic of the sender, an unobserved characteristic of the receiver, an unobserved characteristic of the message, or a particularly high level of compatibility between the two users—or it might be that these replies are more or less distributed by chance. Either way, it is then this very act of replying that temporarily reduces the level of preemptive discrimination these users display in the future—whether because the reply instigated a longer chain of interaction, and perhaps even a date, resulting in greater openness to considering future romantic prospects from that same racial background (21), or because users, observing their own reply, conclude that they must be open to considering individuals from that background and alter their subsequent search behavior accordingly (22). If this is the case, however, why does this effect apply only to certain subsets of site users? All women, regardless of account age and racial background (with the exception of Indian women noted above), display the effect; yet, men display the effect only if they have joined the site relatively recently. If indeed it is the case, as suggested in the main text, that the treatment effect is driven by a change in site users’ perceptions of others’ racial prejudice—and therefore a change in site users’ perceptions of their own likelihood of “success” with individuals from a different racial background—then we might expect exactly this difference given baseline gender dynamics in online dating. Women know their likelihood of receiving a reply from anyone is very high, whereas men know the vast majority of their advances will be ignored. In other words, even if men are persuaded that their chances with cross-race dating prospects are better than they originally thought, as long as they still think their chances with same-race prospects are better, they may continue to statistically discriminate in favor of the latter—unless they have joined the site relatively recently and therefore (i) have not yet learned how rare it is for men to receive a reply and (ii) are particularly impressionable regarding site norms and perceive that interracial contact is more common than it actually is. Meanwhile, the effect also is absent among black men, for whom the odds of receiving a reply from anyone are especially low, regardless of the racial background of the recipient (Fig. 1); Indian men, for whom the confidence interval again is especially large; and white men, for whom the effect of the treatment actually is significant only for those who do not reply to the treatment message. However, this likely is just a statistical artifact, given that the magnitude of the effect for those who do and do not reply to the treatment is quite similar, but the confidence interval is slightly larger among the former.

1. Rosenfeld MJ, Thomas RJ (2012) Searching for a mate: The rise of the internet as a social intermediary. Am Sociol Rev 77(4):523–547. 2. Cacioppo JT, Cacioppo S, Gonzaga GC, Ogburn EL, VanderWeele TJ (2013) Marital satisfaction and break-ups differ across on-line and off-line meeting venues. Proc Natl Acad Sci USA 110(25):10135–10140. 3. Paumgarten N (2011) Looking for someone: Sex, love, and loneliness on the Internet. New Yorker. Available at www.newyorker.com/reporting/2011/07/04/110704fa_fact_ paumgarten. Accessed May 4, 2013. 4. Finkel EJ, Eastwick PW, Karney BR, Reis HT, Sprecher S (2012) Online dating: A critical analysis from the perspective of psychological science. Psychol Sci Public Interest 13(1): 3–66. 5. Piskorski MJ Social Strategy: Why Social Media Platforms Work and How to Leverage Them for Competitive Advantage (Princeton Univ Press, Princeton), in press. 6. Hitsch GJ, Hortaçsu A, Ariely D (2010) What makes you click? Mate preferences in online dating. Quant Mark Econ 8:393–427. 7. Skopek J, Schulz F, Blossfeld H-P (2011) Who contacts whom? Educational homophily in online mate selection. Eur Sociol Rev 27(2):180–195. 8. Lin K-H, Lundquist J Mate selection in cyberspace: The intersection of race, gender, and education. Am J Sociol, in press. 9. DiMaggio P, Hargittai E, Neuman WR, Robinson JP (2001) Social implications of the Internet. Annu Rev Sociol 27:307–336. 10. Madden M, Lenhart A (2006) Online Dating: Americans Who Are Seeking Romance Use the Internet to Help Them in Their Search, but There Is Still Widespread Public Concern About the Safety of Online Dating (Pew Internet and American Life Project, Washington). 11. Sautter JM, Tippett RM, Morgan SP (2010) The social demography of internet dating in the United States. Soc Sci Q 91(2):554–575.

12. Goodreau SM, Kitts JA, Morris M (2009) Birds of a feather, or friend of a friend? Using exponential random graph models to investigate adolescent social networks. Demography 46(1):103–125. 13. Wimmer A, Lewis K (2010) Beyond and below racial homophily: ERG models of a friendship network documented on Facebook. AJS 116(2):583–642. 14. Srivastava SB, Banaji MR (2011) Culture, cognition, and collaborative networks in organizations. Am Sociol Rev 76(2):207–233. 15. Papachristos AV, Hureau DM, Braga AA (2013) The corner and the crew: The influence of geography and social networks on gang violence. Am Sociol Rev 78(3): 417–447. 16. Handcock MS (2003) Statistical models for social networks: Degeneracy and inference. Dynamic Social Network Modeling and Analysis, eds Breiger R, Carley K, Pattison P (National Academies Press, Washington), pp 229–240. 17. Hunter DR, Goodreau SM, Handcock MS (2008) Goodness of fit of social network models. J Am Stat Assoc 103(481):248–258. 18. Krivitsky PN, Handcock MS A separable model for dynamic networks. J R Stat Soc B, in press. 19. Butts CT (2008) A relational event framework for social action. Sociol Methodol 38: 155–200. 20. Harding DJ (2003) Counterfacutal models of neighborhood effects: The effect of neighborhood poverty on dropping out and teenage pregnancy. Am J Sociol 109(3):676–719. 21. Allport GW (1954) The Nature of Prejudice (Addison–Wesley, Reading, MA). 22. Festinger L, Carlsmith JM (1959) Cognitive consequences of forced compliance. J Abnorm Psychol 58(2):203–210.

Lewis www.pnas.org/cgi/content/short/1308501110

7 of 16

Fig. S1. Goodness-of-fit plots for the ERG model of ZIP code region 75xxx (n = 1,780). Solid lines represent observed network distributions, and box plots represent summaries of the same distributions from 100 simulated networks.

Fig. S2. Direct comparisons of the log odds of all possible messaging scenarios, derived from ERG model results summarized in Fig. 2 of the main text (and presented in full in Table S2). Cross-race scenarios are presented in red to facilitate comparisons with the same-race scenarios beneath them.

Treatment period Outcome period Control period Measure matching/control variables

October 2010

Measure whether Measure quanty of cross-race or not each user received a cross- exchanges each user iniated race message

1st week November 2010

2nd week November 2010

Fig. S3. Identifying the causal effect of receiving a cross-race message using a counterfactual framework. Matched treatment and control groups are identical (or coarsely identical) on observed characteristics before the treatment period. The difference in outcomes between matched treatment and control groups is the effect of the treatment on the treated.

Lewis www.pnas.org/cgi/content/short/1308501110

8 of 16

Fig. S4. Kernel density plots comparing the message length distribution for same-race replies (blue) and cross-race replies (red). To enhance visibility, only pictured are replies less than 1,000 characters in length, which account for 98.4% of all replies.

1st quarle

2nd quarle

3rd quarle

4th quarle

0.1

0 -0.1

Males

-0.2 -0.3 -0.4 -0.5 -0.6 -0.7 Asian 1st quarle

Black 2nd quarle

Indian 3rd quarle

Hispanic

White

4th quarle

0.1 0

Females

-0.1 -0.2 -0.3 -0.4 -0.5

-0.6 Asian

Black

Indian

Hispanic

White

Fig. S5. Replication of Fig. 1 C and F from the main text across replies of different lengths, focusing only on same-race indices. In other words, this figure represents only the quantities featured on the diagonal of Fig. 1 C and F—the difference in same-race affiliation indices between patterns of initiations and patterns of replies—considered separately for replies in the first quartile (1–37 characters), second quartile (37–104 characters), third quartile (104–252 characters), and fourth quartile (252–10,182 characters) of message length.

Lewis www.pnas.org/cgi/content/short/1308501110

9 of 16

Parameter esmate

A

B 0.8

0.8

0.7

0.7

0.6

0.6

0.5

0.5

0.4

0.4

0.3

0.3

0.2

0.2

0.1

0.1 0

0 Effect of same-race treatment vs. no treatment

Effect of cross-race treatment vs. no treatment

Effect of cross-race treatment vs. same-race treatment

Fig. S6. Robustness check to ensure that the treatment effect is not simply an artifact of receiving a message from anyone. First, I compared the effect of receiving only same-race messages during the treatment period (relative to those who did not receive any messages) with the effect of receiving at least one cross-race message during the treatment period (relative to those who did not receive any messages) (A). Next, I assessed the effect of receiving at least one cross-race message during the treatment period relative to those who received only same-race messages (B).

0.7

Parameter esmate

0.6 0.5 0.4

0.3 0.2 0.1 0 Effect of treatment without controls

Effect of treatment with controls

Fig. S7. Average treatment effect on the treated, estimated without and with additional controls (quantity of prior exchanges initiated, quantity of prior interracial exchanges initiated, account age) to account for remaining imbalance between treatment and control groups on observed dimensions.

Lewis www.pnas.org/cgi/content/short/1308501110

10 of 16

0.7 0.6

Parameter esmate

0.5 0.4 0.3 0.2 0.1 0 -0.1 -0.2 2nd week of November

3rd week of November

4th week of November

Fig. S8. Average treatment effect on the treated, where outcome variable (quantity of cross-race exchanges initiated) is measured over three distinct periods (second, third, and fourth weeks of November) corresponding to the three weeks following the treatment period (first week of November).

Cross-race profile views

1.6 1.4 1.2

1 0.8 0.6 0.4 0.2 0

C

0.12

Cross-race iniaons without view

B

1.8

Cross-race iniaon rate aer view

A

0.1 0.08 0.06 0.04 0.02 0

T

C

0.025 0.02 0.015 0.01

0.005 0

T

C

T

C

Fig. S9. Average treatment effect on the treated, where outcome variable is defined as (A) the quantity of cross-race profile views (negative binomial regression), (B) the cross-race initiation rate conditional on a profile view (binomial generalized linear model with logit link function), and (C) the quantity of cross-race initiations that were not preceded by a profile view (negative binomial regression).

Lewis www.pnas.org/cgi/content/short/1308501110

11 of 16

Parameter esmate

A 3

B3

2

2

1

1

0

0

-1

-1

Reply

No reply

-2

-2

25-31 days17-24 days 9-16 days 1-8 days

25-31 days17-24 days 9-16 days 1-8 days

Account age (females)

Account age (males)

Parameter esmate

C 4 3 2 1 0 -1 -2 -3 Asian

Black

Indian

Hispanic

White

Hispanic

White

Racial background (females)

Parameter esmate

D 4 3 2 1 0 -1 -2 -3 Asian

Black

Indian Racial background (males)

Fig. S10. Replication of Fig. 3 B–D from the main text, repeated separately for users who did and did not reply to the treatment message (as in Fig. 4B). Specifically, results are presented separately for (A) women with varying account ages, (B) men with varying account ages, (C) women from different racial backgrounds, and (D) men from different racial backgrounds.

Lewis www.pnas.org/cgi/content/short/1308501110

12 of 16

Table S1.

Demographic composition of study sample

Attribute

Male (n = 67,054)

Female (n = 59,080)

3.61 5.21 0.99 6.64 83.55

4.11 6.46 0.63 5.90 82.91

8.15 7.07 5.87 4.34 3.28 2.08 1.64 1.81 1.88 0.72 0.26 0.11 0.69 22.72 39.38

7.53 5.27 3.69 2.42 1.70 0.92 0.53 0.72 0.44 0.16 0.05 0.03 0.25 28.28 48.02

13.20 11.34 47.21 8.95 1.43 0.85 2.02 2.34 12.68

10.30 10.70 47.59 13.80 1.53 1.10 1.96 1.21 11.81

10.73 8.52 28.18 2.54 11.21 0.12 0.40 1.22 10.96 26.13

8.01 3.99 35.64 3.78 13.83 0.07 0.28 1.00 10.11 23.29

30.06 9.97

31.34 10.82

Race Asian Black Indian Hispanic/Latin White Income, $ 1,000,000 Rather not say [Unreported] Education High school 2-y college College/university Master’s program Law school Medical school PhD program Space camp [Unreported] Religion Agnosticism Atheism Christianity Judaism Catholicism Islam Hinduism Buddhism Other [Unreported] Age Mean SD

All statistics are percentages unless otherwise indicated. Categories above are reproduced exactly as they appear on the dating site. Naturally, it is not possible to assess the veracity of these self-reported data; in at least one instance (“space camp”), the response option is clearly meant to be facetious.

Lewis www.pnas.org/cgi/content/short/1308501110

13 of 16

Lewis www.pnas.org/cgi/content/short/1308501110

14 of 16

01 02 06 07 08 10 11 14 15 19 20 21 22 23 27 28 29 30 32 33 37 43 44 45 46 48 49 53 55 60 75 76 77 78 80 85 90 91 92 93 94

Twodigit ZIP code region

Table S2.

1,809 4,136 1,646 2,035 1,715 5,599 4,952 1,174 1,158 2,902 2,465 1,807 1,359 1,133 1,509 1,658 1,115 2,632 2,528 2,872 1,329 1,337 1,762 1,232 1,509 2,750 1,056 1,400 2,127 5,054 1,780 1,345 2,072 2,685 2,307 2,237 3,689 2,315 3,853 1,118 4,777

N

Reciprocity

6.67 7.07 6.56 7.31 7.55 7.24 7.55 7.32 6.36 7.08 7.34 7.38 6.46 7.28 7.33 7.14 7.35 7.55 7.37 7.41 6.51 7.19 7.00 7.15 6.36 7.29 6.59 7.16 6.55 7.16 6.90 6.55 6.91 7.10 6.52 7.63 7.12 8.15 7.84 7.32 7.30

Density

−8.75 −8.16 −7.63 −8.85 −8.56 −8.87 −9.02 −8.13 −7.51 −8.36 −8.70 −8.48 −8.20 −8.41 −8.41 −8.41 −8.29 −8.85 −8.49 −8.75 −7.72 −8.03 −8.02 −7.90 −7.95 −8.34 −7.52 −7.87 −7.61 −8.49 −8.01 −7.82 −8.03 −8.14 −7.54 −8.65 −8.57 −9.35 −9.12 −8.17 −8.66 1.59 1.57 1.84 1.79 1.83 1.63 1.65 2.01 1.64 1.58 1.77 1.85 1.73 2.08 1.45 1.84 1.76 2.01 1.78 2.01 1.49 1.92 1.60 1.72 1.65 1.70 1.57 1.48 1.62 1.75 1.69 1.73 1.75 1.57 1.48 2.22 1.67 2.11 2.03 2.01 1.60

Femalereceiver

0.84

1.07

2.94

1.75 2.84

2.17 0.47 0.49

0.88 1.22 1.35 0.64 1.90 2.14 −0.14 1.44 1.00 1.22 1.21

Blackmatching 1.39 0.43 1.35 1.02 1.44 1.21 1.04 0.95 0.97 1.34 0.67 0.80 1.37 1.39 0.74 0.23 1.36 0.65 1.02 0.01 0.33 0.65 1.19 0.39 1.80 1.11 0.14 1.14 1.54 0.98 0.63 0.81 1.07 −0.37 0.45 0.83 0.49 0.66

0.82 1.13 −0.06 1.15 0.73 1.05 0.88 2.02 1.62 1.71 1.08 1.40 1.71 2.37

Asianmatching

Parameter estimates from all exponential random graph models

1.59

2.23

4.13

2.29

4.18

2.97

2.13 3.27

2.65

1.63 1.95

3.40 1.56 2.38 0.34

2.80

Indianmatching

1.00 0.91 0.32 1.08 0.54 0.71 0.54 0.72 1.02 0.87 0.62 0.76

0.74 0.55 1.17 1.35

2.28 1.57

2.02 1.02 1.55

1.56 0.68 0.71 1.77 1.49 2.11 1.00

1.72 0.71 0.58 1.47 1.53 1.28 1.13

Hispanicmatching 0.66 0.06 −0.11 0.66 0.36 0.40 0.48 0.36 0.30 0.42 0.59 0.59 0.23 0.86 1.03 0.31 0.43 0.88 0.21 0.36 0.06 0.47 0.56 0.26 0.21 0.32 −0.25 0.37 −0.08 0.20 0.34 0.11 0.19 0.40 −0.20 0.51 0.41 0.78 0.59 −0.10 0.23

Whitematching

−0.75

−0.78 −1.49 −1.05 −0.72

−1.89 −2.11

−0.42 −1.79 −1.22

−4.79

−1.35

−2.27 1.14 2.15

−0.79

−1.15 −1.13

−1.38 0.30 −0.60 −0.22 −1.12

84.79 −0.87

Asianmatching* reciprocity

1.14

1.36 −2.01 −1.10 −0.42 0.38 0.12 −1.33 1.78 0.28 −0.03 0.78

−1.53 −1.14 −0.55 −0.51 −0.93 −0.15 −0.62 0.37 −0.29 −0.42 0.22 −0.30 −0.48 −0.71 0.68 1.02 −0.87 −0.73 −0.61 1.55 0.17 0.11 −1.88 1.72 −1.35

Blackmatching* reciprocity

−1.60

−1.28

−0.85 −1.46

−0.34 −2.15

−1.85

Indianmatching* reciprocity

−0.38 −0.07 0.52 −0.82 −0.09 −1.02 −0.38 −0.17 −0.59 −0.56 0.09 −0.57

−1.31 −1.07 −1.45

−0.49

−2.17 0.47 0.62

−3.22 −0.92 0.07 −0.98 −0.94 −1.21 −0.96

Hispanicmatching* reciprocity 0.67 0.21 0.45 −0.14 −0.27 0.18 0.01 −0.43 0.04 −0.06 −0.07 −0.44 0.57 −0.66 −0.70 0.40 −0.20 −0.55 0.01 0.10 0.56 −0.46 −0.34 −0.37 0.60 −0.09 0.43 −0.34 0.23 0.13 −0.06 0.33 0.27 −0.16 0.51 −0.36 0.10 −0.50 −0.29 0.34 0.31

Whitematching* reciprocity

Lewis www.pnas.org/cgi/content/short/1308501110

15 of 16

2,564 2,411 3,627

N

Reciprocity

7.61 7.41 7.26

Density

−8.73 −8.31 −8.37 1.76 1.82 1.75

Femalereceiver 1.34 1.17 0.86

Asianmatching 1.05 0.94 1.16

Blackmatching

1.30

1.73

Indianmatching 0.28 1.27 −0.86

Hispanicmatching 0.29 0.13 −0.11

Whitematching −0.34 −0.45

−0.71

Blackmatching* reciprocity

−1.56

Asianmatching* reciprocity

1.15

Indianmatching* reciprocity 0.50 −1.09

Hispanicmatching* reciprocity

0.04 0.06 0.40

Whitematching* reciprocity

I estimated a separate model for each two-digit ZIP code region with more than 1,000 users represented in the study sample. Some parameters could not be estimated because of an absence of the relevant type of tie. All models converged except for the model for ZIP code region 01xxx, as evinced by the unusually high parameter estimate for Asian same-race reciprocity; results from this model were omitted from Fig. 2.

95 97 98

Twodigit ZIP code region

Table S2. Cont.

Table S3.

Numerical presentation of results visualized in the main text Constant

Treatment

Fig.

Subgroup

Coef.

SE

Coef.

SE

N

3A 3B 3B 3B 3B 3B 3B 3B 3B 3C 3C 3C 3C 3C 3D 3D 3D 3D 3D 4A 4A 4B 4B 4C 4C 4D 4D

Overall effect Female, account age 25–31 d Female, account age 17–24 d Female, account age 9–16 d Female, account age 1–8 d Male, account age 25–31 d Male, account age 17–24 d Male, account age 9–16 d Male, account age 1–8 d Asian female Black female Indian female Hispanic female White female Asian male Black male Indian male Hispanic male White male Received interracial tie previously Did not receive interracial tie previously Replied to treatment message Did not reply to treatment message Specific effect Generalized effect First week after treatment Second week after treatment

−2.273*** −3.373*** −3.778*** −2.511*** −3.075*** −1.299*** −1.361*** −0.997*** −1.398*** −2.558*** −2.098*** −2.590*** −1.622*** −4.438*** −1.424*** −0.363* −0.470 −0.237* −2.092*** −2.322*** −2.438*** −1.768*** −2.480*** −2.905*** −3.465*** −2.273*** −2.230***

0.035 0.126 0.135 0.125 0.095 0.092 0.067 0.062 0.058 0.276 0.180 0.700 0.154 0.093 0.211 0.153 0.406 0.107 0.042 0.101 0.042 0.036 0.041 0.130 0.141 0.035 0.037

0.317*** 0.572* 0.971*** 0.071 0.209 −0.038 0.296 −0.222 0.660*** 1.218** 0.723* 0.511 −0.302 −0.092 1.168** −0.014 0.470 0.292 0.396* 0.323 0.511*** 0.765*** −0.079 0.511** 0.099 0.317*** 0.157

0.093 0.282 0.275 0.291 0.202 0.399 0.257 0.258 0.184 0.392 0.331 0.960 0.255 0.250 0.425 0.316 0.727 0.242 0.169 0.175 0.127 0.140 0.122 0.177 0.197 0.093 0.103

30,495 3,103 4,523 4,479 4,916 2,193 3,203 3,343 4,735 682 747 56 932 14,604 190 365 55 672 12,192 4,999 22,256 15,926 26,873 6,138 6,138 30,495 30,495

All models were estimated using negative binomial regression. Results are presented here in their original logged form. In all models, treatment and control cases are matched exactly based on gender, racial background, and two-digit ZIP code region, and coarsely based on overall quantity of initiation messages sent during the control period (dichotomized as one or more messages compared with no messages), quantity of interracial initiation messages sent during the control period (dichotomized as one or more interracial messages compared with no messages), and account length (divided into four approximately equal periods based on whether users joined in the first, second, third, or fourth quarter of October). *P < 0.05; **P < 0.01; ***P < 0.001 (two-tailed tests).

Lewis www.pnas.org/cgi/content/short/1308501110

16 of 16

The limits of racial prejudice.

The racial segregation of romantic networks has been documented by social scientists for generations. However, because of limitations in available dat...
2MB Sizes 0 Downloads 0 Views