Face ethnicity influences which features drive social judgments

doi:10.21203/rs.3.rs-4680996/v1

Download PDF

Article

Face ethnicity influences which features drive social judgments

https://doi.org/10.21203/rs.3.rs-4680996/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Humans regularly judge others’ character, including how trustworthy or dominant they are, based on facial appearance. Current models propose that specific facial features drive these judgments, but they are based predominantly on White faces. Here, we show that face ethnicity alters the features that drive trustworthiness and dominance judgments, highlighting the limited generalizability of current models. Using ethnically diverse faces and a powerful data-driven method, we modelled the 3D facial features that drive these key social trait judgments from Black African, East Asian, and White European faces in 60 individual White Western observers. Trustworthiness judgments are driven by a shared set of features plus those that exaggerate or diminish ethno-phenotypic features. Dominance judgments also rely on shared features plus those that exaggerate or diminish signal strength. Our results have direct implications for current theories of social perception and emphasize and the importance of representing ethnic diversity in psychological models.

Scientific community and society/Social sciences/Psychology/Human behaviour

Scientific community and society/Social sciences/Society

Scientific community and society/Social sciences/Communication

Human faces are one of the richest sources of social information in our environment (see e.g., Jack & Schyns, 2015; Jack & Schyns, 2017; Zebrowitz & Montepare, 2008 for reviews). From facial appearance alone, observers spontaneously (e.g., Klapper et al., 2016), implicitly (e.g., Swe et al., 2020), and readily (e.g., Willis & Todorov, 2006) infer important personal characteristics of others, such as how trustworthy or dominant they are. Though fleeting, such judgments can have significant downstream consequences, ranging from dating preferences (e.g., South Palomares & Young, 2018), to professional success (e.g., Menegatti et al., 2021), and voting choices (e.g., Joo et al., 2015). Given the central relevance of these judgments to human social life, a longstanding goal in the human behavioral sciences has been to understand which types of faces drive these perceptions.

Current influential models posit that fundamental social trait judgments, such as of trustworthiness and dominance (Oosterhof & Todorov, 2008), are driven by specific facial features (e.g., Freeman & Ambady, 2011; Todorov & Oosterhof, 2011; Zebrowitz, 2017). For instance, smaller faces with upturned mouth corners, arched eyebrows, and a lighter skin tone are judged as more trustworthy (Jaeger et al., 2020; Said et al., 2009; Todorov & Oosterhof, 2011; Vernon et al., 2014; Zebrowitz & McDonald, 1991), while larger faces with a more prominent brow ridge and jaw, and a darker skin tone are judged as more dominant (Albert et al., 2021; Mileva et al., 2014; Todorov & Oosterhof, 2011; Zebrowitz et al., 2003). However, though human faces vary remarkably in both shape and skin tone (e.g., Farkas et al., 2005; Maddox, 2004), existing models of the facial features that drive social trait perception are based almost exclusively on White European faces (Cook & Over, 2021), which fundamentally limits their generalizability. For example, face ethnicity impacts social trait perception by biasing judgments of other-ethnicity faces towards ethnic stereotypes (e.g., Blair et al., 2002; Eberhardt et al., 2006; Eberhardt et al., 2004; Hutchings et al., 2024; Kleider-Offutt et al., 2018; e.g., Xie et al., 2021). However, because current models (e.g., Oosterhof & Todorov, 2008) do not represent ethnically diverse facial features, they cannot provide a causal explanation of which facial features drive these judgments. With mounting evidence now revealing the inherent limitations of WEIRD psychological science (see Cook & Over, 2021 for further discussion; see also e.g., Henrich et al., 2010; Jones et al., 2021; Rad et al., 2018 for related discussion), representing ethnic diversity is increasingly important for theoretical accounts of human social perception and in developing interventions that aim to address inequality and bias in cross-ethnicity interactions.

Here, we address this critical knowledge gap by modelling the specific facial features that drive the perception of two key social traits—trustworthiness and dominance—from three face ethnicities—Black African, East Asian, and White European. We include these three broad ethnic groups because they are anthropometrically distinct in terms of skin tone and/or facial structure (e.g., Farkas et al., 2005) and implicated in cross-ethnicity social trait perception differences (e.g., Xie et al., 2021). To model the facial features, we used a high-fidelity 3D generative model of the human face (Yu et al., 2012; Zhan, Garrod, et al., 2019) combined with the classic psychophysical method of reverse correlation used in ethology (e.g., Tinbergen, 1948), vision science (e.g., Mangini & Biederman, 2004), neuroscience (e.g., Hubel & Wiesel, 1959; Nestor et al., 2016; Zhan, Ince, et al., 2019), and engineering (e.g., Thompson et al., 1999; Volterra, 1930), and human social trait perception (Jack & Schyns, 2017). Figure 1 illustrates the approach.

On each experimental trial, we generated a novel 3D face identity using a high-fidelity generative model of the human face that is based on high-resolution real-world 3D captures (see Yu et al., 2012; Zhan, Garrod, et al., 2019 for more details). Specifically, the generative model randomly samples, for 3D face shape and 2D complexion separately, weights for 402 principal components that capture and control the natural facial feature variance associated with individual identities (henceforth called ‘identity components’). For example, in Figure 1A, red color-coding shows the 3D face shape features that deviate outward from the generative model average (e.g., a more prominent brow) and blue color-coding shows features that deviate inward from the average (e.g., a smaller nose). In the generative model, these randomly weighted identity components are then added to the average face for a given ethnicity, sex and age—in Figure 1A, the three faces below show the results of adding the same identity components to the average face for a Black African (BA), East Asian (EA), or White European (WE) male aged 25 years (see Methods—Stimulus generation). Thus, the randomly weighted identity components precisely control how the facial features of the stimulus change on each trial.

In a between-subjects design, observers (N = 60, white Western European, sex-balanced; see Methods—Observers) rated each resulting face stimulus according to perceived trustworthiness or dominance using a 7-point bipolar scale (e.g., ‘very untrustworthy’ to ‘very trustworthy’ with ‘neutral’ as the mid-point) in separate blocks randomly ordered across the experiment (see Methods—Task procedure)—in Figure 1B, the observer rated this face as ‘somewhat trustworthy’ (see red box). Each observer completed 2,400 trials per social trait rating task (n = 20 observers per face ethnicity), with stimulus sex blocked and randomized across the experiment for each observer. Importantly, to directly compare whether and how face ethnicity changes the facial features that drive social trait perception, we used identical facial feature variations across all experimental conditions. That is, we used the exact same 2,400 randomly generated identity components (1,200 per stimulus sex) in each of the three face ethnicity conditions, for each observer and each rating task. Thus, across all face ethnicity conditions, the faces had the same age, sex, and random identity components, and differed only according to the ethnicity of the average base face (see also SM—Expressivity of generative face model).

Following the experiment, we modelled the specific facial features that drive the perception of trustworthiness and dominance in each individual observer in each face ethnicity condition. Specifically, we measured the statistical relationship between the randomly generated identity components used on each trial and the observer’s corresponding social trait ratings, using linear regression (see Methods—Modelling procedure). Figure 1C illustrates this procedure with example trustworthiness ratings from one observer viewing Black African male faces. This analysis produced a quantitative 3D face model for each individual observer, face ethnicity, social trait, and stimulus sex—in Figure 1C, the color-coded faces show the results for 3D shape (i.e., deviations in cartesian space) and 2D complexion (i.e., differences in each of the three L*a*b color channels; e.g., see Weatherall & Coombs, 1992) separately (see SM—Model visualization). We thus produced a total of 240 face models ([20 observers 3 face ethnicities 2 social traits 2 stimulus sex]), which we validated using a leave-one-out cross-validation prior to further analyses (see Methods—Model validation). Figure 1D shows examples of the resulting validated 3D face models for trustworthy, untrustworthy, dominant, and submissive male faces, from one representative observer in each face ethnicity condition.

Our data-driven approach provides several advantages. First, by agnostically generating facial features from a high-fidelity model of the human face, we can model those that drive social trait perception in individual observers without constraints or biases imposed by prior assumptions (see Jack & Schyns, 2017 for further discussion). Second, by using the exact same randomly generated identity components in each face ethnicity condition, we can isolate how face ethnicity influences the specific facial features that observers use to make each social trait judgment. Third, our per-observer analyses preserve individual variation rather than erase it as traditional averaging approaches can do. This in turn enables any replications of effects to be demonstrated across N observers in the tested sample (Ince et al., 2022) and thus provides an estimate of the prevalence of these effects in the sampled population (Donhauser et al., 2018; Ince et al., 2021; see also Methods—Population prevalence).

We used the above approach to model the specific facial features that drive the perception of trustworthiness and dominance from three face ethnicities—Black African (BA), East Asian (EA) and White European (WE)—across 60 White Western individual observers. Figure 2 shows the results for male faces with results aggregated across individual observers (n = 20 per face ethnicity condition; see SM Fig. 2 for female results).

In the top panel, each face shows the resulting models for trustworthiness vs untrustworthiness (left) and dominance vs submissiveness (right) for each face ethnicity, with shape and complexion combined and displayed on an ethnically neutral face (displayed in the center) for comparison (see Fig. 1D for example models displayed on each ethnic average face). Color-coded faces below show the results for 3D shape and 2D complexion separately, using the same format as in Fig. 1. Color saturation shows the number of observers (n = 20 per face ethnicity condition) with a statistically significant effect, for all identity components at or above the population prevalence threshold (n = 4 observers; Donhauser et al., 2018; Ince et al., 2021; see SM Fig. 2 for female results). For example, Black African faces judged as trustworthy typically have a shorter nose bridge, more upturned mouth corners, and a darker, cooler (greener, bluer) skin tone with a lighter eye socket region, while White European faces judged as dominant typically have a more prominent brow ridge and chin, with a sallower (greener) skin tone compared to the average face.

Further inspection of the results reveals both clear similarities and differences across face ethnicities. For example, regardless of face ethnicity, faces perceived as trustworthy typically have a narrower jaw, upturned mouth corners, and a warmer (redder, yellower) eye region. Similarly, faces perceived as dominant typically have a more prominent brow ridge and chin with a sallower skin tone. However, the modelled features also differ across face ethnicities. For example, for trustworthiness, Black African faces typically have a shorter nose bridge and plumper cheeks with a darker, cooler skin tone and lighter eye socket region, whereas East Asian faces are narrower with heavier upper eyelids, a larger mouth with fuller lips, and a warmer skin tone, and White European faces have more arched eyebrows, higher cheeks, fuller lips and a warmer skin tone. For dominance, differences across face ethnicities primarily reflect differences in the prevalence of effects—for example, the prominent brow ridge, a commonly reported feature (e.g., Todorov & Oosterhof, 2011), is more frequently associated with White European than East Asian or Black African faces (see hue variation in the eyebrow region).

Social trait face perception is driven by shared plus specific facial features across face ethnicities. These results suggest that perceptions of trustworthiness and dominance are driven by a set of facial features that are shared across face ethnicities, plus those that are specific to each face ethnicity. To test this formally, we measured the specificity of each modelled facial feature (i.e., the statistically significant identity components) to face ethnicity using the general measure of Mutual Information (MI; Cover & Thomas, 1991; Ince et al., 2017). Specifically, MI quantifies the strength of the relationship between two variables—here, face ethnicity and each modelled facial feature at or above the population prevalence threshold (n = 4 observers) for at least one face ethnicity. A high MI value indicates a strong relationship—i.e., the feature is specific to a certain face ethnicity; a low MI value indicates a weak relationship—i.e., the feature is not specific to a certain face ethnicity and is thus shared across face ethnicities. We established a statistical significance threshold using non-parametric permutation testing (n = 1,000; see Methods—Mutual information) and applied the same analysis to each social trait and stimulus sex separately. Figure 3 shows the results for male faces (see SM Fig. 3 for female results).

Faces in the top panel show, for both trustworthy (left panel) and dominant (right panel) judgements, the facial features that are shared across face ethnicities (top row) and those that are specific to a given face ethnicity (bottom row), with shape and complexion combined and displayed on an ethnically neutral face as in Fig. 2. Bar plots below show the proportion of shared (black) vs specific (white) 3D shape and 2D complexion features for each corresponding face ethnicity (see SM Table 1 for details). The bottom panel shows the results separately for 3D shape and 2D complexion using the same color-coding as Fig. 1 (see colorbars below), with values normalized per social trait for display purposes. Results confirmed that perceptions of trustworthiness and dominance are driven by a core set of shared facial features plus those that are specific to face ethnicity.

For both trustworthiness and dominance, the shared features reflect those represented in current models—trustworthy features comprise a narrower jaw, a bigger mouth with upturned mouth corners, and arched, darker eyebrows with warmer skin tones (e.g., Jaeger et al., 2020; Said et al., 2009; Todorov & Oosterhof, 2011; Vernon et al., 2014; Zebrowitz & McDonald, 1991); dominant features comprise a prominent brow ridge with darker eyebrows and lighter brow ridges, a bigger head, a stronger chin, and a narrower face with darker, cooler skin tones (e.g., Albert et al., 2021; Mileva et al., 2014; Todorov & Oosterhof, 2011; Zebrowitz et al., 2003). Therefore, our results show that certain features are generalizable across different face ethnicities. For the ethnicity-specific features, the White European-specific features are also represented in current models, including bigger eyes, higher cheekbones, a fuller bottom lip and smaller chin with warmer skin tones for trustworthy judgments (Zebrowitz & McDonald, 1991), and a more prominent chin, angled eyebrows and smaller eyes with warmer skin tones and sallower lips for dominant judgments (Albert et al., 2021). However, unlike current models, our results demonstrate that these features are specific to White European faces rather than being generalizable across other face ethnicities. In contrast, Black African- and East Asian-specific features are not represented in current models (e.g., Oosterhof & Todorov, 2008). For trustworthy judgments, BA-specific features comprise a smaller mouth and pointier nose tip with lighter, warmer eye regions against darker, cooler skin tones; and EA-specific features comprise a narrower face and a bigger mouth, nose and forehead with warmer skin tones. For dominant judgments, BA-specific features comprise a longer nose bridge and arched eyebrows with darker, cooler skin tones and eyebrows; and EA-specific features comprise warmer skin tones and sallower lips (but no visible 3D shape features).

Validating ethnic variance in the facial features that drive social trait judgments. Finally, we validated the ethnic variance in facial features using two key predictions from the literature. For trustworthiness, existing work suggests that perceptions of trustworthiness is associated with diminished ethnic phenotypic features while untrustworthiness is associated with their exaggeration (e.g., Blair et al., 2002; Eberhardt et al., 2004; Hutchings et al., 2024; Kleider-Offutt et al., 2018). We tested this by correlating the ethnicity-specific features modelled here with the ethnic phenotype features (represented as the ethnic average in our generative model, see SM—Ethnic phenotypes in the generative model; see Methods—Ethnicity-related variance, trustworthiness features). Results confirmed that Black African- and East Asian-specific 3D shape trustworthy features are negatively correlated with their ethnic phenotypic features (r = − .69, p < .001; r = − .74, p < .001 respectively). We found no significant results for White European-specific trustworthy features, nor for 2D complexion. Figure 4A shows the results (male faces only; see SM Fig. 5 for female faces).

As shown by the color-coded faces, trustworthy-looking Black African faces (see first row) have smaller mouths than the average Black African face (see second row). Similarly, trustworthy-looking East Asian faces have a more protruding forehead, nose and mouth than the average East Asian face. Faces below show the specific features that are negatively correlated with trustworthiness (Bonferroni-Holm corrected, above-chance: r < − .62 and r < − .63 for BA and EA respectively). Color-coding indicates whether ethnic phenotypic features are diminished or exaggerated; color saturation indicates the magnitude (see colorbar below; see Methods—Ethnicity-related variance, trustworthiness features). Our results thus confirm that the perception of trustworthiness from other-ethnicity faces is driven by features that reduce out-group ethnic phenotypic appearance.

For dominance perception, existing work suggests that it could rely on fewer cues from other-ethnicity faces that are stereotypically associated with aggression (Cabral & de Almeida, 2019; Hugenberg, 2005; e.g., Said et al., 2009). We tested this by examining whether the face ethnicity-specific facial features of dominance diminish or exaggerate the shared features above-chance (see Methods—Ethnicity-related variance, dominance features). Results confirmed that dominance features are diminished above-chance in dominant-looking Black African faces and exaggerated above-chance in dominant-looking White European faces. We found no significant effects for East Asian faces. Figure 4B shows the results—the top row of faces shows the ethnicity-specific facial features; the second row shows how they modulate the underlying shared dominance features (see larger face to the left); the third row shows the specific features that are diminished or exaggerated above-chance (see colorbar below). Thus, our results show that the strength of dominance features varies across face ethnicities.

Here, we aimed to revisit current models of social trait face perception (Freeman & Ambady, 2011; Oosterhof & Todorov, 2008; Zebrowitz, 2017) by examining whether and how face ethnicity influences the facial features observers use to make social trait judgments. Using a powerful perception-based data-driven approach (see Jack & Schyns, 2017 for more details) and a high-fidelity 3D generative model of the human face (Yu et al., 2012; Zhan, Garrod, et al., 2019), we modelled the specific facial features that drive the perception of two key social trait dimensions—trustworthiness and dominance—from three face ethnicities—Black African, East Asian, and White European—in 60 individual white Western observers. Results revealed that, across face ethnicities, social trait face perception is driven by a combination of facial features that are shared across face ethnicities, plus facial features that are specific to each face ethnicity. Specifically, the shared facial features closely mirror those described in current models of social trait face perception (e.g., Oosterhof & Todorov, 2008), as do those that are specific to White European (i.e., same-ethnicity) faces. In contrast, the facial features that are specific to Black African and East Asian (i.e., other-ethnicity) faces are not represented in current models and instead closely mirror ethnic phenotypic features for trustworthiness judgments and modulate the strength of the features for dominance judgments. We discuss the implications of our results for current understanding of social trait face perception below.

We show that face ethnicity influences the specific facial features observers use to make social trait judgments, with direct implications for current models of social trait face perception. Specifically, current influential models propose that social trait judgments are based on a specific set of facial features, such as prominent brows for dominance and upturned mouth corners for trustworthiness (e.g., Oosterhof & Todorov, 2008). However, as such models are primarily based on an ethnically limited set of White European faces (e.g., see Cook & Over, 2021), this constrains their ability to represent and therefore causally explain how face ethnicity influences which facial features drive social trait judgements (e.g., Blair et al., 2002; Eberhardt et al., 2006; Eberhardt et al., 2004; Hutchings et al., 2024; Kleider-Offutt et al., 2018; Xie et al., 2021). Our results directly demonstrate this limitation by showing that face ethnicity systematically influences the facial features observers use to make social trait judgments. Further, our results suggest that, rather than representing a universal set of features, current models are specific to White faces because they include features that are specific to White European faces. Together, our results directly demonstrate that face ethnicity is an influential source of variance in deriving causal explanations of social trait face perception, highlighting the limited generalizability of current models and the importance of including a more diverse range of faces in examining social perception.

Our results also correspond with existing theories on the social perception of out-group faces. Specifically, we found that the perception of untrustworthiness is driven by facial features that closely resemble ethnic phenotypic features (e.g., Farkas et al., 2005; Maddox, 2004). These results mirror existing findings on ethnic stereotyping, where other-ethnicity faces comprising ethnic phenotypic features tend to be judged according to ethnic stereotypic traits, such as aggressiveness for Black Africans and unfriendliness for East Asians (e.g., Blair et al., 2002; Eberhardt et al., 2006; Eberhardt et al., 2004; Hutchings et al., 2024; Kleider-Offutt et al., 2018; e.g., Xie et al., 2021). Parallel to this, we found that the perception of trustworthiness is driven by facial features that diminish (i.e., negatively correlate with) other-ethnicity phenotypic appearance. These results correspond with the existing literature on perceptual experience, where faces more similar to those in the observer’s environment are perceived as more trustworthy (e.g., Dotsch et al., 2016; Sofer et al., 2015; see also e.g., Tanaka et al., 2004; Valentine, 1991). For dominance, we showed that perception is largely driven by features that are shared across face ethnicities, but with specific variations across face ethnicities that either diminish or exaggerate those dominant features. Such results correspond with previous work showing that perceptions of anger—a social message related to dominance (e.g., Cabral & de Almeida, 2019; Said et al., 2009)—rely on fewer cues from other-ethnicity vs same-ethnicity faces (e.g., Hugenberg, 2005).

Finally, our results show that social trait face perception is also driven by a set of facial features that are shared across face ethnicities, which supports central theories proposing that social trait perception is based, at least in part, on reliable cues. For example, the overgeneralization hypothesis posits that the facial features driving social trait perception originate from an evolved attunement to adaptive social cues, such age, attractiveness, and facial expressions of emotion (see Zebrowitz, 2017; Zebrowitz & Montepare, 2008 for reviews). Such theories suggest that these specific facial features should drive social trait perceptions regardless of the ethnicity of the face. Our results support this theory by showing that social trait perception is driven by a shared set of facial features that resemble adaptive cues—for example, perceptions of trustworthiness are associated with happiness-resembling (e.g., Montepare & Dobish, 2003; Said et al., 2009; Thorstenson et al., 2018) and youth-related facial features (e.g., Fink & Matts, 2008; Jaeger et al., 2020; Zebrowitz & McDonald, 1991), while perceptions of dominance are associated with masculine features (e.g., Albert et al., 2021; Batres et al., 2015; Todorov & Oosterhof, 2011), and anger-resembling features (e.g., Hareli et al., 2009). By mapping onto these adaptive cues, our results therefore support the notion that social trait perceptions central to human social interactions are, at least in part, underpinned by universal attunements to meaningful social cues which generalize across faces of different ethnicities.

Importantly, our results show which specific facial features observers associate with different social traits, rather than reflecting actual behavior or disposition. Our results also reflect the perceptions of White Western observers perceiving same- and other-ethnicity faces. As both observer ethnicity (e.g., Hugenberg & Sacco, 2008; Tanaka et al., 2004; Valentine, 1991) and culture (e.g., De Leersnyder et al., 2011; Jack, 2013) can influence face perception, future research should examine whether and how these characteristics could influence social trait perception from ethnically diverse faces. Finally, although pervasive (Todorov et al., 2015), social trait perceptions from static facial appearance can be influenced by other sources of information, including facial expressions (e.g., Gill et al., 2014), body shape (e.g., Hu et al., 2018), voice (e.g., McAleer et al., 2014), and concrete behavioral evidence (e.g., Van Dessel et al., 2019). Future research should explore whether and how these multiple sources of information influence social trait perception when viewing ethnically diverse persons.

In sum, our results provide a causal demonstration of how face ethnicity influences social trait perception by showing that the facial features observers use to perceive social traits diminish or exaggerate specific features in line with same- vs other-ethnicity stereotype knowledge. Together, our results highlight the limitations and biases in current prominent models of social trait perception (e.g., Oosterhof & Todorov, 2008). We anticipate that our approach and results will inform further developments in accounts of person perception and motivate further consideration for diversity in psychological science.

Observers. We recruited a total of 60 White Western observers (30 male, mean age = 21.43 years, SD = 2.81 years). We pseudo-randomly assigned each observer to one of the three face ethnicity conditions (Black African, East Asian, White European), for a total of 20 observers (10 males, 10 females) in each condition. To control for the potential effects of culture on social face perception (e.g., De Leersnyder et al., 2011; Jack, 2013; Sutherland et al., 2018), we recruited Western observers with minimal exposure to and experience with non-Western cultures, assessed by questionnaire (see SM—Screening questionnaire). To control for the potential effects of viewing same- vs other-ethnicity faces on perception (e.g., McKone et al., 2023) and to compare our results with the existing literature, we recruited White observers, assessed by self-report. All observers had normal or corrected to normal vision with no history of synaesthesia nor psychological, psychiatric, or neurological conditions affecting visual processing or face perception (e.g., depression, ASD, prosopagnosia), as per self-report. All observers gave written informed consent prior to testing and received £6/hour for their participation. The University of Glasgow College of Science and Engineering Ethics Committee provided ethical approval (Ethics Approval No: 300160203).

Stimulus generation. To generate novel 3D face identities on each experimental trial, we used a high-fidelity 3D generative model of the human face which is based on the high-resolution 3D captures of real human faces varying in ethnicity, sex, and age (Yu et al., 2012; Zhan, Garrod, et al., 2019). Specifically, the generative model represents each novel 3D face identity as the sum of 2 complementary components:

The average 3D face shape and 2D complexion for a specified face ethnicity, sex, and age, which captures the constant sources of facial feature variance associated with each category.

Weights for 402 principal components for 3D shape and 402 principal components \(\times\) 5 spatial frequency bands for 2D complexion that capture the naturalistic facial feature variance associated with individual identity rather than any categorical information (thus referred to as ‘identity components’).

To generate ethnically diverse but directly comparable 3D face identities, we first generated for each stimulus sex (male, female) and ethnicity (Black African, East Asian, White European), 1,200 average base faces varying in age from 18–35 years to match the general age of the observer sample (see Fig. 1A for an example average base face). Next, for each stimulus sex, we generated 1,200 randomly weighted identity components and randomly assigned them to the 1,200 average base faces for each face ethnicity. Therefore, we used the exact same identity components across the different face ethnicity conditions (see also SM—Expressivity of generative face model). In total, we generated a total of 7,200 face stimuli ([1200 identity component variations \(\times\) 3 face ethnicities \(\times\)2 stimulus sex]).

Experimental task. On each experimental trial, observers viewed a randomly generated 3D face identity and rated it according to social trait (either trustworthiness or dominance) on a 7-point bipolar scale ranging from 1 (e.g., ‘very untrustworthy’) to 7 (e.g., ‘very trustworthy’) with ‘neutral’ as the mid-point. Both the face stimulus and response scale remained on the screen until observer response. We instructed observers to respond quickly using their first impressions, using a Graphic User Interface (GUI). After response, a fixation cross appeared on screen during a 0.5s inter-stimulus interval. In a between-subjects design, each observer thus completed a total of 4,800 trials ([1,200 face stimuli \(\times\) 2 social traits \(\times\)2 stimulus sex]) across 6 separate 1-hour sessions each comprising 4 blocks of 200 trial each with short breaks in-between blocks. We blocked trials by social trait rating task and stimulus sex and randomized the order of the blocks across the experiment for each observer. At the start of each block, we displayed the stimulus sex and the social trait rating task on-screen. Observers completed no more than 3 sessions per day and had at least a 1-hour mandatory break in-between each session. We displayed all face stimuli (average height = 18cm, average width = 11cm) on a 1,930 \(\times\)1,080 resolution color-calibrated flat panel monitor at a constant viewing distance of 70cm. Face stimuli thereby subtended 14.87° (vertical) and 9.03° (horizontal) of visual angle, which reflects the average size of a human face (e.g., Ibrahimagić-Šeper et al., 2006) at a typical social distance (e.g., Hall, 1966).

Modelling procedure. To model the 3D facial features that drive the perception of trustworthiness and dominance for each individual observer, we linearly regressed the weights of each identity component used on each trial onto the observer’s corresponding social trait ratings (e.g., 1 – ‘very untrustworthy’ to 6 – ‘very trustworthy’), for 3D shape (402 \(\times\)1) and 2D complexion (402\(\times\)5 spatial frequency bands) separately. This produced a quantitative model of the specific facial features that are statistically associated with the observer’s perception of trustworthiness and dominance. We therefore obtained a total of 240 per-observer 3D face models ([20 observers \(\times\) 3 face ethnicities \(\times\)2 social traits \(\times\) 2 stimulus sex]).

Model validation. Prior to analysis of the resulting 3D face models, we validated them using a leave-one-out cross-validation method, for 3D shape and 2D complexion separately. For each observer, face ethnicity, social trait, and stimulus sex, we mass-univariately regressed the responses of 19 out of the 20 observers onto the identity component weights using ridge regression. At each iteration, we identified which identity components are statistically significantly associated with the 19 observers’ ratings using non-parametric permutation testing (n = 1,000). Specifically, we obtained a distribution of chance beta coefficients by randomly shuffling the 19 observers’ ratings and mass-univariately regressing them onto the identity component weights 1,000 times and used the 95th percentile of the resulting distribution as a threshold for statistical significance. We then trained a General Linear Model on the 19 observers’ ratings and the statistically significant identity components and used the resulting model to predict the ratings of the left-out observer. Finally, we correlated the model’s predicted ratings with the true ratings of the left-out observer (using Spearman’s rho, Bonferroni-Holm corrected, p < .05) to obtain an index of how accurately each individual observer’s face model could be predicted by the other observers’ models. We repeated this procedure until all 20 observers in each face ethnicity condition had been tested. Results (see SM Fig. 6 for a visualization) showed that most of the 240 face models are valid: 238 shape models (119 male: 20 BA/EA/WE dominance; 19 BA trustworthiness, 20 EA/WE trustworthiness; 119 female: 20 BA/WE dominance, 19 EA dominance; 20 BA/EA/WE trustworthiness) and 237 complexion models (119 male: 20 BA/EA/WE dominance; 19 BA trustworthiness, 20 EA/WE trustworthiness; 118 female: 19 BA/EA dominance, 20 WE dominance, 20 BA/EA/WE trustworthiness).

Population prevalence. To estimate how prevalent the statistically significant effects are in the sampled population, we used the measure of population prevalence (Donhauser et al., 2018; Ince et al., 2021). Specifically, we used our within-observer results to model the population from which our observers are sampled, using binary coding—that is, each possible observer can either use, or not use, a specific identity component to perceive a given social trait, with a given proportion of this population showing a true positive effect. We then performed an inference against the null hypothesis that no observers in the population shows any effect (i.e., the prevalence of the effect is 0; see Donhauser et al., 2018; Ince et al., 2021 for more details). Thus, with 20 observers in each face ethnicity condition and using a p < .05 threshold, we can reject the null hypothesis that there is no effect in any observer in the sampled population when significant within-observer results are detected in at least 4 observers.

Mutual information. To examine the specificity of the facial features used for social trait perception across face ethnicities, we computed the Mutual Information (MI; Cover & Thomas, 1991; Ince et al., 2017) between each individual identity components and each face ethnicity, for each social trait and stimulus sex separately. Each identity component is represented as statistically significant or not in each validated observer model, separately for positive and negative component values; face ethnicity is also represented as a binary coding (e.g., “BA” vs “not BA”). A high MI value indicates a strong relationship between the identity component and face ethnicity—i.e., the identity component is ethnicity-specific. A low MI value indicates a weak relationship—i.e., the identity component is shared across face ethnicities. We included only the identity components at or above the population prevalence threshold of n = 4 observers (Donhauser et al., 2018; Ince et al., 2021) for at least one face ethnicity. To establish a threshold for statistical significance, we used non-parametric permutation testing. For each identity component, we randomly shuffled the face ethnicity label of each face model and recomputed MI 1,000 times to obtain a distribution of chance MI values. We then used the 95th percentile of the distribution as a threshold for statistical significance. Finally, we used Pearson correlation to discard any identity component with a high MI value but negative correlation with face ethnicity (i.e., the identity component is repeatedly absent in these face models). We conducted this analysis for 3D shape and 2D complexion separately. SM Table 1 shows the number of total, shared, and face ethnicity-specific identity components for trustworthy and dominant male and female faces.

Ethnicity-related variance. Trustworthiness features. To measure the similarity between the face ethnicity-specific trustworthy facial features and the ethnic phenotype facial features represented in our generative model (see SM—Ethnic phenotypes in the generative model), we first represented all feature sets as 3D coordinates for 31,094 face vertices. We then computed the Pearson correlation between each set of face ethnicity-specific features and their corresponding ethnic phenotype features. To constrain results, we only considered correlations which were statistically significant (p < .05, Bonferroni-Holm corrected) and above-chance. To identify correlations that were above-chance— that is, correlations which occur due to specific effects rather than natural similarity in face vertex structure—we generated 500 random 3D face identities per stimulus sex and computed the Pearson correlation between each random face and each ethnic phenotype, thereby obtaining a random distribution of correlation coefficients for each ethnic phenotype for each stimulus sex. We then used the 5th (vs 95th ) percentile of each distribution as threshold for above-chance negative (vs positive) correlations. Finally, to pinpoint the specific facial features driving each statistically significant, above-chance correlation, we computed the Euclidean distance between all pairs of non-zero vertex deviations from the ethnically neutral average face across the correlated sets of face ethnicity-specific facial features and ethnic phenotypic facial features.

Dominance features. To test whether the face ethnicity-specific dominant features diminish or exaggerate the shared dominant features, we first added the face-ethnicity specific features to the dominant features and represented each resulting feature set (shared and shared + face ethnicity-specific) as 3D coordinates for 31,094 face vertices. We then computed the Euclidean distance of each set to the ethnically neutral average face, thereby obtaining a comparable measure of deviation for each feature set. We then computed the difference between all pairs of non-zero Euclidean distances across the shared and shared plus specific feature sets. Finally, we reverse-signed the results to obtain a 31,094 \(\times\) 1 vector of Euclidean distance differences for each comparison, where negative values indicate that the shared feature is diminished, and positive values indicated that the shared feature is exaggerated. To ensure that our results were above-chance, we repeated the same steps detailed in this section but substituting the face ethnicity-specific features with the features of the randomly generated faces described in Trustworthiness features. We then used the 5th (vs 95th ) percentile of the distribution of random Euclidean distance differences for each vertex as thresholds for features that are diminished (vs exaggerated) above-chance.

Author contributions:

Conceptualization: VG and REJ. Data curation: VG. Formal analysis: VG and LBH. Funding acquisition: REJ and PGS. Investigation: VG. Project administration: VG. Resources: LBH, RAAI, OGBG, PGS, and REJ. Software: OGBG and PGS. Supervision: REJ. Visualization: VG. Writing—original draft: VG and REJ. Writing—review and editing: all authors. Methodology: VG, LBH, OGBG, PGS and REJ.

Albert G, Wells E, Arnocky S, Liu CH, Hodges-Simeon CR (2021) Observers use facial masculinity to make physical dominance assessments following 100‐ms exposure. Aggressive Behav 47(2):226–235
Batres C, Re DE, Perrett DI (2015) Influence of perceived height, masculinity, and age on each other and on perceptions of dominance in male faces. Perception 44(11):1293–1309
Blair IV, Judd CM, Sadler MS, Jenkins C (2002) The role of Afrocentric features in person perception: judging by features and categories. J Personal Soc Psychol 83(1):5
Cabral JCC, de Almeida RMM (2019) Effects of anger on dominance-seeking and aggressive behaviors. Evol Hum Behav 40(1):23–33
Cook R, Over H (2021) Why is the literature on first impressions so focused on White faces? Royal Soc open Sci 8(9):211146
Cover TM, Thomas JA (1991) Entropy, relative entropy and mutual information. Elem Inform theory 2(1):12–13
De Leersnyder J, Mesquita B, Kim HS (2011) Where do my emotions belong? A study of immigrants' emotional acculturation [Research Support, Non-U.S. Gov't]. Pers Soc Psychol Bull 37(4):451–463. https://doi.org/10.1177/0146167211399103
Donhauser PW, Florin E, Baillet S (2018) Imaging of neural oscillations with embedded inferential and group prevalence statistics. PLoS Comput Biol, 14(2), e1005990
Dotsch R, Hassin RR, Todorov A (2016) Statistical learning shapes face evaluation. Nat Hum Behav 1(1):0001
Eberhardt JL, Davies PG, Purdie-Vaughns VJ, Johnson SL (2006) Looking deathworthy: perceived stereotypicality of Black defendants predicts capital-sentencing outcomes. Psychol Sci 17(5):383–386
Eberhardt JL, Goff PA, Purdie VJ, Davies PG (2004) Seeing black: race, crime, and visual processing. J Pers Soc Psychol 87(6):876–893
Farkas LG, Katic MJ, Forrest CR (2005) International anthropometric study of facial morphology in various ethnic groups/races. J Craniofac Surg 16(4):615–646
Fink B, Matts PJ (2008) The effects of skin colour distribution and topography cues on the perception of female facial age and health. J Eur Acad Dermatol Venereol 22(4):493–498
Freeman JB, Ambady N (2011) A dynamic interactive theory of person construal. Psychol Rev 118(2):247
Gill D, Garrod OG, Jack RE, Schyns PG (2014) Facial movements strategically camouflage involuntary social signals of face morphology. Psychol Sci 25(5):1079–1086. https://doi.org/10.1177/0956797614522274
Hall E (1966) The Hidden Dimension. Doubleday
Hareli S, Shomrat N, Hess U (2009) Emotional versus neutral expressions and perceptions of social dominance and submissiveness. Emotion 9(3):378
Henrich J, Heine SJ, Norenzayan A (2010) The weirdest people in the world? Behav Brain Sci 33(2–3) 61–83; discussion 83–135. https://doi.org/10.1017/S0140525X0999152X
Hu Y, Parde CJ, Hill MQ, Mahmood N, O’Toole AJ (2018) First impressions of personality traits from body shapes. Psychol Sci 29(12):1969–1983
Hubel D, Wiesel TN (1959) Receptive fields of single neurons in the cat’s striate visual cortex. J Phys 148:574–591
Hugenberg K (2005) Social categorization and the perception of facial affect: target race moderates the response latency advantage for happy faces. Emotion 5(3):267–276
Hugenberg K, Sacco DF (2008) Social categorization and stereotyping: How social categorization biases person perception and face memory. Soc Pers Psychol Compass 2(2):1052–1072
Hutchings RJ, Freiburger E, Sim M, Hugenberg K (2024) Racial prejudice affects representations of facial trustworthiness. Psychol Sci, 09567976231225094
Ibrahimagić-Šeper L, Čelebić A, Petričević N, Selimović E (2006) Anthropometric differences between males and females in face dimensions and dimensions of central maxillary incisors [Journal]. Medicinski glasnik 3(2):58–62
Ince RA, Giordano BL, Kayser C, Rousselet GA, Gross J, Schyns PG (2017) A statistical framework for neuroimaging data analysis based on mutual information estimated via a gaussian copula. Hum Brain Mapp 38(3):1541–1573
Ince RA, Kay JW, Schyns PG (2022) Within-participant statistics for cognitive science. Trends in Cognitive Sciences
Ince RA, Paton AT, Kay JW, Schyns PG (2021) Bayesian inference of population prevalence. Elife 10:e62461
Jack RE (2013) Culture and facial expressions of emotion. Visual Cognition
Jack RE, Schyns PG (2015) The Human Face as a Dynamic Tool for Social Communication. Curr Biol 25(14):R621–R634
Jack RE, Schyns PG (2017) Toward a social psychophysics of face communication. Ann Rev Psychol 68:269–297
Jaeger B, Todorov AT, Evans AM, van Beest I (2020) Can we reduce facial biases? Persistent effects of facial trustworthiness on sentencing decisions. J Exp Soc Psychol 90:104004
Jones BC, DeBruine LM, Flake JK, Liuzza MT, Antfolk J, Arinze NC, Foroni F (2021) To which world regions does the valence–dominance model of social perception apply? Nat Hum Behav 5(1):159–169
Joo J, Steen FF, Zhu S-C (2015) Automated facial trait judgment and election outcome prediction: Social dimensions of face. Proceedings of the IEEE international conference on computer vision
Klapper A, Dotsch R, van Rooij I, Wigboldus DH (2016) Do we spontaneously form stable trustworthiness impressions from facial appearance? J Personal Soc Psychol 111(5):655
Kleider-Offutt HM, Bond AD, Williams SE, Bohil CJ (2018) When a face type is perceived as threatening: Using general recognition theory to understand biased categorization of Afrocentric faces. Mem Cognit 46(5):716–728
Maddox KB (2004) Perspectives on racial phenotypicality bias. Personality Social Psychol Rev 8(4):383–401
Mangini MC, Biederman I (2004) Making the ineffable explicit: Estimating the information employed for face classifications. Cogn Sci 28(2):209–226
McAleer P, Todorov A, Belin P (2014) How do you say ‘Hello’? Personality impressions from brief novel voices. PLoS ONE, 9(3), e90779
McKone E, Dawel A, Robbins RA, Shou Y, Chen N, Crookes K (2023) Why the other-race effect matters: Poor recognition of other‐race faces impacts everyday social interactions. Br J Psychol 114:230–252
Menegatti M, Pireddu S, Crocetti E, Moscatelli S, Rubini M (2021) The Ginevra de’Benci Effect: Competence, Morality, and Attractiveness Inferred From Faces Predict Hiring Decisions for Women. Front Psychol 12:658424
Mileva VR, Cowan ML, Cobey KD, Knowles K, Little A (2014) In the face of dominance: Self-perceived and other-perceived dominance are positively associated with facial-width-to-height ratio in men. Pers Indiv Differ 69:115–118
Montepare JM, Dobish H (2003) The contribution of emotion perceptions and their overgeneralizations to trait impressions. J Nonverbal Behav 27(4):237–254
Nestor A, Plaut DC, Behrmann M (2016) Feature-based face representations and image reconstruction from behavioral and neural data. Proceedings of the National Academy of Sciences, 113(2), 416–421
Oosterhof NN, Todorov A (2008) The functional basis of face evaluation. Proceedings of the National Academy of Sciences, 105(32), 11087–11092. https://doi.org/10.1073/pnas.0805664105
Rad MS, Martingano AJ, Ginges J (2018) Toward a psychology of Homo sapiens: Making psychological science more representative of the human population. Proceedings of the National Academy of Sciences, 115(45), 11401–11405
Said CP, Sebe N, Todorov A (2009) Structural resemblance to emotional expressions predicts evaluation of emotionally neutral faces. Emotion 9(2):260
Sofer C, Dotsch R, Wigboldus DH, Todorov A (2015) What is typical is good: The influence of face typicality on perceived trustworthiness. Psychol Sci 26(1):39–47
South Palomares JK, Young AW (2018) Facial first impressions of partner preference traits: Trustworthiness, status, and attractiveness. Social Psychol Personality Sci 9(8):990–1000
Sutherland CA, Liu X, Zhang L, Chu Y, Oldmeadow JA, Young AW (2018) Facial first impressions across culture: Data-driven modeling of Chinese and British perceivers’ unconstrained facial impressions. Pers Soc Psychol Bull 44(4):521–537
Swe DC, Palermo R, Gwinn OS, Rhodes G, Neumann M, Payart S, Sutherland CA (2020) An objective and reliable electrophysiological marker for implicit trustworthiness perception. Soc Cognit Affect Neurosci 15(3):337–346
Tanaka JW, Kiefer M, Bukach CM (2004) A holistic account of the own-race effect in face recognition: evidence from a cross-cultural study. Cognition 93(1):B1–9
Thompson WB, Owen JC, Germain HdS, Stark SR, Henderson TC (1999) Feature-based reverse engineering of mechanical parts. IEEE Trans Robot Autom 15(1):57–66
Thorstenson CA, Elliot AJ, Pazda AD, Perrett DI, Xiao D (2018) Emotion-color associations in the context of the face. Emotion 18(7):1032
Tinbergen N (1948) Social releasers and the experimental method required for their study. Wilson Bull, 6–51
Todorov A, Olivola CY, Dotsch R, Mende-Siedlecki P (2015) Social attributions from faces: determinants, consequences, accuracy, and functional significance. Psychology 66(1):519
Todorov A, Oosterhof NN (2011) Modeling social perception of faces [social sciences]. IEEE Signal Process Mag 28(2):117–122
Valentine T (1991) A unified account of the effects of distinctiveness, inversion, and race in face recognition. Q J Experimental Psychol Hum Experimental Psychol 43a(2):161–204
Van Dessel P, Ye Y, De Houwer J (2019) Changing deep-rooted implicit evaluation in the blink of an eye: Negative verbal information shifts automatic liking of Gandhi. Social Psychol Personality Sci 10(2):266–273
Vernon RJW, Sutherland CAM, Young AW, Hartley T (2014) Modeling first impressions from highly variable facial images. Proceedings of the National Academy of Sciences, 111(32), E3353-E3361. https://doi.org/10.1073/pnas.1409860111
Volterra V (1930) Theory of Functionals and of Integral and Integro-Differential Equations. Blackie, London
Weatherall IL, Coombs BD (1992) Skin color measurements in terms of CIELAB color space values. J Invest dermatology 99(4):468–473
Willis J, Todorov A (2006) First impressions making up your mind after a 100-ms exposure to a face. Psychol Sci 17(7):592–598
Xie SY, Flake JK, Stolier RM, Freeman JB, Hehman E (2021) Facial impressions are predicted by the structure of group stereotypes. Psychol Sci 32(12):1979–1993
Yu H, Garrod OGB, Schyns PG (2012) Perception-driven facial expression synthesis. Computers Graphics 36(3):152–162
Zebrowitz LA (2017) First impressions from faces. Curr Dir Psychol Sci 26(3):237–242
Zebrowitz LA, Fellous J-M, Mignault A, Andreoletti C (2003) Trait impressions as overgeneralized responses to adaptively significant facial qualities: Evidence from connectionist modeling. Personality social Psychol Rev 7(3):194–215
Zebrowitz LA, McDonald SM (1991) The impact of litigants' baby-facedness and attractiveness on adjudications in small claims courts. Law Hum Behav 15(6):603
Zebrowitz LA, Montepare JM (2008) Social psychological face perception: Why appearance matters. Soc Pers Psychol Compass 2(3):1497–1517
Zhan J, Garrod OGB, van Rijsbergen N, Schyns PG (2019) Modelling face memory reveals task-generalizable representations. Nat Hum Behav. https://doi.org/10.1038/s41562-019-0625-3
Zhan J, Ince RAA, Van Rijsbergen N, Schyns PG (2019) Dynamic construction of reduced representations in the brain for perceptual decision behavior. Curr Biol 29(2):319–326

There is NO Competing Interest.

Gosettisupplementarymaterials.docx

Download PDF

Version 1

posted

You are reading this latest preprint version

Face ethnicity influences which features drive social judgments

Status:

Version 1

Abstract

Figures

Introduction

Results

Discussion

Methods

Declarations

Author contributions:

References

Additional Declarations

Supplementary Files

Status:

Version 1