- Proceedings of the National Academy of Sciences of the United States of America
- Published almost 4 years ago
Using human evaluation of 100,000 words spread across 24 corpora in 10 languages diverse in origin and culture, we present evidence of a deep imprint of human sociality in language, observing that (i) the words of natural human language possess a universal positivity bias, (ii) the estimated emotional content of words is consistent between languages under translation, and (iii) this positivity bias is strongly independent of frequency of word use. Alongside these general regularities, we describe interlanguage variations in the emotional spectrum of languages that allow us to rank corpora. We also show how our word evaluations can be used to construct physical-like instruments for both real-time and offline measurement of the emotional content of large-scale texts.
Traditional fact checking by expert journalists cannot keep up with the enormous volume of information that is now generated online. Computational fact checking may significantly enhance our ability to evaluate the veracity of dubious information. Here we show that the complexities of human fact checking can be approximated quite well by finding the shortest path between concept nodes under properly defined semantic proximity metrics on knowledge graphs. Framed as a network problem this approach is feasible with efficient computational techniques. We evaluate this approach by examining tens of thousands of claims related to history, entertainment, geography, and biographical information using a public knowledge graph extracted from Wikipedia. Statements independently known to be true consistently receive higher support via our method than do false ones. These findings represent a significant step toward scalable computational fact-checking methods that may one day mitigate the spread of harmful misinformation.
Cardiac vagal tone (indexed via resting heart rate variability [HRV]) has been previously associated with superior executive functioning. Is HRV related to wiser reasoning and less biased judgments? Here we hypothesize that this will be the case when adopting a self-distanced (as opposed to a self-immersed) perspective, with self-distancing enabling individuals with higher HRV to overcome bias-promoting egocentric impulses and to reason wisely. However, higher HRV may not be associated with greater wisdom when adopting a self-immersed perspective. Participants were randomly assigned to reflect on societal issues from a self-distanced- or self-immersed perspective, with responses coded for reasoning quality. In a separate task, participants read about and evaluated a person performing morally ambiguous actions, with responses coded for dispositional vs. situational attributions. We simultaneously assessed resting cardiac recordings, obtaining six HRV indicators. As hypothesized, in the self-distanced condition, each HRV indicator was positively related to prevalence of wisdom-related reasoning (e.g., prevalence of recognition of limits of one’s knowledge, recognition that the world is in flux/change, consideration of others' opinions and search for an integration of these opinions) and to balanced vs. biased attributions (recognition of situational and dispositional factors vs. focus on dispositional factors alone). In contrast, there was no relationship between these variables in the self-immersed condition. We discuss implications for research on psychophysiology, cognition, and wisdom.
There would be little adaptive value in a complex communication system like human language if there were no ways to detect and correct problems. A systematic comparison of conversation in a broad sample of the world’s languages reveals a universal system for the real-time resolution of frequent breakdowns in communication. In a sample of 12 languages of 8 language families of varied typological profiles we find a system of ‘other-initiated repair’, where the recipient of an unclear message can signal trouble and the sender can repair the original message. We find that this system is frequently used (on average about once per 1.4 minutes in any language), and that it has detailed common properties, contrary to assumptions of radical cultural variation. Unrelated languages share the same three functionally distinct types of repair initiator for signalling problems and use them in the same kinds of contexts. People prefer to choose the type that is the most specific possible, a principle that minimizes cost both for the sender being asked to fix the problem and for the dyad as a social unit. Disruption to the conversation is kept to a minimum, with the two-utterance repair sequence being on average no longer that the single utterance which is being fixed. The findings, controlled for historical relationships, situation types and other dependencies, reveal the fundamentally cooperative nature of human communication and offer support for the pragmatic universals hypothesis: while languages may vary in the organization of grammar and meaning, key systems of language use may be largely similar across cultural groups. They also provide a fresh perspective on controversies about the core properties of language, by revealing a common infrastructure for social interaction which may be the universal bedrock upon which linguistic diversity rests.
To maximize survival and reproductive success, primates evolved the tendency to tell lies and the ability to accurately detect them. Despite the obvious advantage of detecting lies accurately, conscious judgments of veracity are only slightly more accurate than chance. However, findings in forensic psychology, neuroscience, and primatology suggest that lies can be accurately detected when less-conscious mental processes (as opposed to more-conscious mental processes) are used. We predicted that observing someone tell a lie would automatically activate cognitive concepts associated with deception, and observing someone tell the truth would activate concepts associated with truth. In two experiments, we demonstrated that indirect measures of deception detection are significantly more accurate than direct measures. These findings provide a new lens through which to reconsider old questions and approach new investigations of human lie detection.
The claim that Eskimo languages have words for different types of snow is well-known among the public, but has been greatly exaggerated through popularization and is therefore viewed with skepticism by many scholars of language. Despite the prominence of this claim, to our knowledge the line of reasoning behind it has not been tested broadly across languages. Here, we note that this reasoning is a special case of the more general view that language is shaped by the need for efficient communication, and we empirically test a variant of it against multiple sources of data, including library reference works, Twitter, and large digital collections of linguistic and meteorological data. Consistent with the hypothesis of efficient communication, we find that languages that use the same linguistic form for snow and ice tend to be spoken in warmer climates, and that this association appears to be mediated by lower communicative need to talk about snow and ice. Our results confirm that variation in semantic categories across languages may be traceable in part to local communicative needs. They suggest moreover that despite its awkward history, the topic of “words for snow” may play a useful role as an accessible instance of the principle that language supports efficient communication.
- Proceedings of the National Academy of Sciences of the United States of America
- Published about 4 years ago
Languages vary enormously in global importance because of historical, demographic, political, and technological forces. However, beyond simple measures of population and economic power, there has been no rigorous quantitative way to define the global influence of languages. Here we use the structure of the networks connecting multilingual speakers and translated texts, as expressed in book translations, multiple language editions of Wikipedia, and Twitter, to provide a concept of language importance that goes beyond simple economic or demographic measures. We find that the structure of these three global language networks (GLNs) is centered on English as a global hub and around a handful of intermediate hub languages, which include Spanish, German, French, Russian, Portuguese, and Chinese. We validate the measure of a language’s centrality in the three GLNs by showing that it exhibits a strong correlation with two independent measures of the number of famous people born in the countries associated with that language. These results suggest that the position of a language in the GLN contributes to the visibility of its speakers and the global popularity of the cultural content they produce.
The challenges in the development of a prophylactic HIV-1 vaccine are unprecedented in the history of vaccinology. But there are reasons for optimism, and indeed, several promising vaccine concepts are currently moving toward clinical efficacy trials.
In daily life, we frequently encounter false claims in the form of consumer advertisements, political propaganda, and rumors. Repetition may be one way that insidious misconceptions, such as the belief that vitamin C prevents the common cold, enter our knowledge base. Research on the illusory truth effect demonstrates that repeated statements are easier to process, and subsequently perceived to be more truthful, than new statements. The prevailing assumption in the literature has been that knowledge constrains this effect (i.e., repeating the statement “The Atlantic Ocean is the largest ocean on Earth” will not make you believe it). We tested this assumption using both normed estimates of knowledge and individuals' demonstrated knowledge on a postexperimental knowledge check (Experiment 1). Contrary to prior suppositions, illusory truth effects occurred even when participants knew better. Multinomial modeling demonstrated that participants sometimes rely on fluency even if knowledge is also available to them (Experiment 2). Thus, participants demonstrated knowledge neglect, or the failure to rely on stored knowledge, in the face of fluent processing experiences. (PsycINFO Database Record
How human language arose is a mystery in the evolution of Homo sapiens. Miyagawa et al. (2013) put forward a proposal, which we will call the Integration Hypothesis of human language evolution, that holds that human language is composed of two components, E for expressive, and L for lexical. Each component has an antecedent in nature: E as found, for example, in birdsong, and L in, for example, the alarm calls of monkeys. E and L integrated uniquely in humans to give rise to language. A challenge to the Integration Hypothesis is that while these non-human systems are finite-state in nature, human language is known to require characterization by a non-finite state grammar. Our claim is that E and L, taken separately, are in fact finite-state; when a grammatical process crosses the boundary between E and L, it gives rise to the non-finite state character of human language. We provide empirical evidence for the Integration Hypothesis by showing that certain processes found in contemporary languages that have been characterized as non-finite state in nature can in fact be shown to be finite-state. We also speculate on how human language actually arose in evolution through the lens of the Integration Hypothesis.