<<

Downloaded by guest on October 4, 2021 Osnabr language about uptake. reasoning listener’s probabilistic the encodes complemented that are module pragmatic semantics a the by the as when well as truth- model, just the data prototype-based production that the show explains model comparison conditional model empirical representation. and numerosity account and Statistical into salience as take such also factors, models two cognitive These the meaning. on linguistic We based study on models prototypes. views production computational via compare large-scale understood and are a formulate quantity words from quantity of that Data lan- use suggest human thought. the to and fundamental consider are guage which We “all”), “some,” use. (e.g., words linguistic language connecting function and link meaning a as models Hence, prob- pragmatic applying use. by abilistic empirically language theories semantic of these compare aspects we direct makes quantitative theories about semantic predictions abstract pro- these around of revolves true. and Neither rigid is gradient totypes. too inherently is it is view the which meaning truth-conditional that this under equates and that conditions argued 2020) been the 23, has March to and it review But sentence for philosophy (received a 2021 in 22, of January meaning approved view and MA, influential Amherst, Amherst, An at Massachusetts of University Partee, H. Barbara by Edited and Germany; Berlin, 10117 Sprachwissenschaft, Allgemeine Leibniz-Zentrum Pragmatics, a Tiel van quantification Bob language natural in focality and gradience explains pragmatics Probabilistic hs rbblsi rgai oest etsmni theories PNAS semantic test leverage to we models Here, pragmatic (9–13). probabilistic use these language about of predictions precise aspects make quantitative that models Amsterdam”) probabilistic within in is “He (8). (e.g., disjunction disjunct the true than that is rather disjunct utter which knows the simply who In speaker would rational goals. a hand, conversational at certain case reaching are toward speakers that tributions idea the on of based theory Grice “He , a (e.g., such outlined explain true (7) To is Berlin”). disjunct or which Amsterdam in unsure is is speaker to the tend that also “or” imply containing logi- utterances expresses However, “or” disjunction. that cal hold semanticists For formal perspective. many truth-conditional example, the from especially use, guage This prototypes. to similarity on called is depends approach a and itself—is degree truth of evidence pres- matter consequently as exemplar The construed membership—and an prototypes. sometimes category is of to that structure similarity typicality prototype on The of depends ence (6). and category gradient a is of features tant cate- that around shows built that are research gories psychological upon drawing 5), (4, truth- the meanings how the from out (3). up spell constituents built its is formally of sentence to a of theory meaning meaning set conditional of to and field approach logic the truth-conditional from to This circumstances rise false. which given in or has know true to is is means it sentence a what know A odr nttt o ri,CgiinadBhvor abu nvriyNjee,62 J imgn h Netherlands; The Nijmegen, AJ, 6525 Nijmegen, University Radboud Behaviour, and Cognition Brain, for Institute Donders eety hstp fGienraoighsbe formalized been has reasoning Gricean of type this Recently, lan- and meaning linguistic between gap important an is There approach truth-conditional the criticized have authors Other nuniltaiini hlspyeutstemeaning its the to equates sentence philosophy a in of tradition influential n 01Vl 1 o e2005453118 9 No. 118 Vol. 2021 c,Germany uck, ¨ | quantifiers a,b,1 rttp Theory Prototype ihe Franke Michael , | semantics prototypes pragmatics rt conditions truth hc sstools uses which semantics, formal | pragmatics hteepiytems impor- most the exemplify that oncigmaigaduse and meaning connecting (PT). .. ertercon- their gear i.e., rational, c n l Sauerland Uli and , | 1 ) codnl,to Accordingly, 2). (1, rbblsi reasoning probabilistic b ulse eray2,2021. 22, February Published doi:10.1073/pnas.2005453118/-/DCSupplemental at online supporting contains article This 1 the under Published Submission.y Direct PNAS the a is wrote article This U.S. interest.y and competing M.F., no declare B.v.T., authors and The data; per- analyzed U.S. M.F. and M.F., paper. and y B.v.T., B.v.T. research; designed research; U.S. formed and M.F., B.v.T., contributions: Author i.e., • common, are: in definitions element GQ-theoretic of one examples least at have sets the that this here—express that tion with ones to the concerned of are According all we words—and terms. quantity many specifically, express set-theoretic More statements in quantified words approach, quantity of GQT. the of develop- centerpieces the the to of leading of ment meaning, one to the that approach is Indeed, truth-conditional cognition 15). quantification of (14, of human number communication treatment and of the frequency about underpins aspect information and crucial generalization a supports is Quantification Words Quantity the on focus we mind, in of goal case this specific With data. empirical on based • • hoeia ucs,epann aiu tews puzzling otherwise various explaining success, theoretical size • owo orsodnemyb drse.Eal [email protected] Email: addressed. be may correspondence whom To ela oe htecdspooye notemaigof meaning the into strict prototypes words. encodes a quantity as that just embeds model data a that the as well explains model meaning of pragmatic notion truth-conditional probabilistic show a we But comparison, that model prototypes. empirical via and sug- statistical understood on to are based seem words study quantity “some,” production that (e.g., large-scale gest words a from quantity Data of “all”). the case through consider We important theories modeling. historically pragmatic semantic probabilistic abstract of lens test pro- the we empirically Here, apparent. to always not pose is empirical use and language pre- from ideas data theoretical the However, abstract between language. relation of cise aspects suc- key that explain structures cessfully abstract postulates linguistics Theoretical Significance “all”: “no”: all”: “not “most”: o ovnec,w ilrfrto refer will we convenience, For n to and ewe w es(8.Freape Sm r ”means P” are S “Some example, For (18). sets two between Q omlzsa nutv praht h meaning the to approach intuitive an formalizes GQT |S |S eeaie unie Theory Quantifier Generalized c nttt fCgiieSine Osnabr Science, Cognitive of Institute |S ∩ ∩ | |S S P P ∩ S | | ≥ | ∩ ≤ | NSlicense.y PNAS P uha sm”ad“all.” and “some” as such words, quantity sthe as and P | 0 > | S < | |S P | Q a noe immense enjoyed has GQT size. set total S eoe ytesbetadpredicate, and subject the by denoted , − https://doi.org/10.1073/pnas.2005453118 | P | . y |S b eateto eatc and Semantics of Department https://www.pnas.org/lookup/suppl/ GT 1,17). (16, (GQT) ∩ thresholds P c nvriy 49069 University, uck ¨ eain ewe sets. between relations | sthe as |S ∩ nthe on nescinset intersection P ≥ | Other 1. intersec- | f6 of 1

PSYCHOLOGICAL AND COGNITIVE SCIENCES observations about, e.g., the cognitive complexity of quantity vals of their GQT meanings. It has been argued that this type of words (19), their order of acquisition (20), the aptitude of gradience and focality is fundamentally incongruous with GQT people’s reasoning with quantified sentences (21), and the dis- (25–27)—or even with a bivalent truth-conditional approach to tribution of negative polarity items like “any” and “ever” (22). At meaning more generally (4, 5). Instead, it has been argued that the same , however, it has been observed that there is no quantity words have a prototype-based semantics (25, 27); e.g., straightforward connection between the set-theoretic meanings the prototype for “most” may be situated at about 75% of the postulated by GQT and the way people use quantity words, as total set size, with the typicality of a situation decreasing with the we will show presently. distance from that prototype.

Using Quantity Words. We conducted a large-scale study on the Outline. The goal of this paper is twofold. First, we show how production of quantity words in English ( [Exp.] 1a). abstract semantic theories can be compared empirically with Each of 600 participants described 10 displays showing 432 cir- data-driven statistical modeling. Second, we show that GQT’s cles, which were either red or black. To describe these displays, truth-conditional account of quantity-word meaning offers an participants freely completed the sentence frame “— of the cir- equally compelling account of quantity-word production as a cles are red.” Each display varied the intersection set size, i.e., semantics based on prototypes, but only when it is complemented the number of red circles. To elicit quantity words that are by a probabilistic theory of language use. potentially vague and thus relevant to our purposes, the dis- The next section formalizes GQ-theoretic and PT-based plays had a large total set size, thereby likely triggering only semantics of quantity words. We then describe four speaker approximate representations of the true intersection set size. models that make probabilistic predictions about quantity-word Additionally, experimental instructions discouraged the use of choices as a function of the underlying semantic theory. numerical expressions like “fifty-two.” Fig. 1 A and B visualizes the production probabilities of the Semantics of Quantity Words 15 most frequently produced quantity words, all of which were A lexical meaning function L: M × T → [0, 1] maps each pair of produced 50 or more, as well as “all” and “none.” “All” quantity word m and state t to a truth value in the unit interval. and “none” were included because they are crucial for commu- Participants in Exp. 1 were presented with displays consisting of nication and historically salient, even though they were not very 432 circles, which were either red or black. There were thus 433 frequently produced in the experiment for the obvious reason possible intersection set sizes T = {t0, ... , t432}, i.e., relevant that displays with uniformly colored dots occurred infrequently. states of the world. Taken together, these 17 quantity words made up 87% of the production data. GQT Semantics. According to GQT, many quantity words—and In line with earlier interpretation studies (23, 24), the results all of the ones that we are interested in—express thresholds of Exp. 1 suggest that the ranges in which quantity words are on the intersection set size |S ∩ P| (18). Whether a quantity produced lack clear boundaries and peak in restricted subinter- word expresses a lower bound, an upper bound, or both can be

(other) A B Training data (Exp. 1a) Test data (Exp. 1b) a few 1.00 a lot about half all all almost all 0.75 almost all few most half majority 0.50 hardly any less than half a lot majority many more than half 0.25 more than half

many Production probability most about half none 0.00 several 0 100 200 300 400 0 100 200 300 400 half some several Number of red circles very few some C GQ−lit PT−lit GQ−prag PT−prag less than half 1.00

few 0.75 a few 0.50 very few 0.25 hardly any none 0.00 0 100 200 300 400 0 100 200 300 400 0 100 200 300 400 0 100 200 300 400 0 100 200 300 400 Production probability Number of red circles Number of red circles

Fig. 1. (A) Density plot showing the distribution of each quantity word over intersection set sizes (Exp. 1a). The height of the distribution indicates the production frequency. (B) Bar plot showing the production probabilities in each bin of intersection set sizes. Each bar shows which quantity words were produced in that bin and their production probability. (C) Bar plot showing the predicted production probabilities for each of the four computational models.

2 of 6 | PNAS van Tiel et al. https://doi.org/10.1073/pnas.2005453118 Probabilistic pragmatics explains gradience and focality in natural language quantification Downloaded by guest on October 4, 2021 Downloaded by guest on October 4, 2021 sueta ere ftuhdces xoetal ihthe with We exponentially truth. decrease of truth degrees of on degrees prototype is that the word from assume quantity factor distance the scaling of which a effect in to and size sizes true, set word maximally intersection set the quantity intersection is Each which from truth. functions of degrees with words quantity value any take may fuzzy sentence false, a of or value true in truth either the be that holds must logic sentences that using assumes quantification is logic on meaning PT- a linguistic view formalize that gra- based we holds is Consequently, it prototypes. truth around and that organized binary, assumes than It rather respects: dient two in meaning linguistic Semantics. PT over distributions prior meanings (See informative (30). likely weakly from thresholds the as expec- inferred about words priori be theory quantity a of linguistic to encode by to are inference informed which them Bayesian tations model, used treated We our but fix data. in advance, not the or variables in did imprecise words we free an Therefore, quantity as where (29). of (28), applied thresholds “all” be the of could def- case reading textbook the the loose from in the differ even about often initions, intuitions words people’s quantity of that meanings shown has research Previous threshold size set section L few” “very increasing. words and monotone quantity as other “none,” classified all were half,” decreasing; monotone than as classified “less were any,” criteria, these increas- “hardly Following (monotone “few,” decreasing). “all” (monotone with “none” classified or clustered were ing) words they quantity whether more otherwise, on accepted time; based was the argument of neither 50% than if clas- nonmonotone were words as Quantity above. sified inference shown of those validity to the similar judged patterns participants 120 2): empiri- 17 (Exp. in data the assumptions cal these of ground We properties words. monotonicity quantity relevant the about quantity assumptions make the nonmonotone. of as none perceived was However, sample our straightforward. in words quantity less monotone- relation nonmonotone is The for bound; words bound. meaning lower upper and an monotonicity a between place Monotone- words place size. quantity set words decreasing intersection quantity the on increasing places it that threshold smntn eraig ssonb h neec patterns • inference the • by shown “no” as and neither below. increasing, decreasing, license monotone monotone is words “all” is sets Thus, quantity inference. from nonmonotone of type words and quantity subsets; to monotone-decreasing supersets; sets to from inferences license words quantity Monotone-increasing oeincreasing, tone its i.e., predicate, its in monotonicity potential inferential the from judged rbblsi rgaisepan rdec n oaiyi aua agaequantification language natural in focality and gradience explains pragmatics Probabilistic al. et Tiel van GQ ogetaefish. ate guest No salmon. ate guests All etu nrdc rttp lexicon prototype a introduce thus We lexicon GQ-theoretic a introduce we results, these on Based to need we 1, Fig. in shown patterns production the model To of type the determines word quantity a of monotonicity The uniywrsi aua agaeaeeither are language natural in words Quantity 0 1] [0, L (m GQ (m , 2,31). (26, t hc sin ahpi fqatt word quantity of pair each assigns which ), θ , m t = ) hne ipy “monotonicity”). simply, (hence, eoe yta uniyword: quantity that by denoted Tdfesfo h rt-odtoa prahto approach truth-conditional the from differs PT t      h au n tu)o eo(as) ae nthe on based (false), zero or (true) one value the or decreasing, monotone 0 1 1 → IAppendix SI otherwise. if if ogetaesalmon. ate guest No m m → smntn eraig& decreasing monotone is & increasing monotone is l ussaefish. ate guests All o oedetails.) more for hr classical Where logic. fuzzy d m m hc ouae the modulates which , a prototype a has L nonmonotone PT htassociates that m t t n inter- and ≥ ≤ θ θ mono- m m (16). right p m , udai itnefo h rttp,s httetuhvalue truth the that so word quantity prototype, a the of from distance quadratic omn iheules,w nld the had experiment include the in we Participants factor. ease, additional an equal as words with mind false to over true those for which over of preference value descriptions a for truth prototypical. i.e., preference higher a value, or a truth descriptions when lower with that, a assume descriptions to with give is to size 1 set Exp. fer intersection like with task picture a a describing in participants of ior Speakers. ceteris Literal produced, be to in likely factor numerosity. more also therefore We paribus. are and semantics. easily more quan- some PT-based are that words possibility and tity the pragmatic includes GQ-theoretic additionally and model literal with Each both models combining by speaker models four both compare are that utterances produce to are that The utterances produces functions. speaker probabilistic behavioral observed the in 35). generating (34, in behavior choices role under- observable causal adequate their most the on the based from infer to semantics able be lying on will Based we function. link models, empirical these extendable 10) an (9, such models as pragmatic serve probabilistic can recent that here assumptions. argue semantic of We of theo- likelihood sets semantic the different under about various choices pragmatic predictions production embed a probabilistic can produce is that to then, 33). ries use needed, (32, language is directly of What observed theory use. users’ be language cannot language people’s meanings in of these part usually, seman- However, are by postulated theories meanings the tic that assumed commonly is It Words Quantity of Production the Modeling lexica. of types two the in word 2A weakly Fig. on priors. based inference informative Bayesian data-driven in variables free prototypes before, As oeicesn)qatt odi Q n PT. false. and or true and GQT is ing, word in quantity the word when indicating quantity increasing) tone 2. Fig. nefcso prxmt ubrrpeetto ntepouto fthe of production the in on visualized words representation quantity number approximate of effects in A B 0 0 oas aero o h data o l ecitoscome descriptions all not that idea the for room make also To of kinds two introduce therefore will we following, the In iulzto ftehpteie eia ennso (mono- a of meanings lexical hypothesized the of Visualization (A) d GQT +ANS stesra around spread the is GQT L PT aua trigpitfrmdln h behav- the modeling for point starting natural A m (m salient Intersection setsize o nescinstsize set intersection for p , t m = ) A. n cln factors scaling and iulzto fteefc ffactoring of effect the of Visualization (B) p. hnohr,ie,cm omn more mind to come i.e., others, than imprecision iulzstemaigo quantity a of meaning the visualizes exp |S| |S| The true.

https://doi.org/10.1073/pnas.2005453118 p|S|0p |S|0p −  true t literal pragmatic − d d nterpeetto of representation the in and m p m PT +ANS p salience θ d ewill We informative.  stepooyemean- prototype the is pae preferably speaker t m t 2 satrsodvalue threshold a is atcpnspre- participants , PT is: ! ilb rae as treated be will pae prefers speaker . PNAS fquantity of t smore is | f6 of 3

PSYCHOLOGICAL AND COGNITIVE SCIENCES marked preferences for some quantity words over others. For The evaluatively neutral nature of our experimental task example, “most,” “the majority,” and “more than half” are often makes it intuitively unlikely that factors such as argumentativity viewed as truth-conditionally equivalent, despite having different played an important role in our production study. conditions of use. Nonetheless, participants used “most” much more often than “more than half” or “the majority.” We assume Imprecise Number Representation. Participants in Exp. 1 were not that this difference in frequency is due to the fact that “most” told how many circles were red, but, rather, had to estimate the was more salient, i.e., accessible in the task at hand. (See SI intersection set size based on the displays. The cognitive module Appendix for further discussion.) To capture effects of differen- used to estimate the numerosity of large sets is the Approxi- tial salience, we paired each quantity word with a salience value mate Number System (ANS) (38). The accuracy of ANS estimates PSal(m), which is treated as a free variable. decreases as the numerosity to be estimated increases. This effect Combining truth-based and salience-based production prefer- is a corollary of Weber’s law, which states that the probability of ences, we consider a simple model of a literal speaker, which confusing two stimuli increases with their relative, rather than serves as a baseline for later comparison: absolute, difference (39). In order to model the accuracy of participants’ estimates of PS (m | t, L) ∝ PSal(m) L(m, t). the intersection set size t, we consider the following simple def- lit 0 inition of the confusion probability PCf(t | t) of representing 0 Pragmatic Speakers. One of the central insights of modern prag- the true intersection set size t as t . Since the visual displays in 0 matics is that speakers’ behavior can be understood, at least Exp. 1 were upper-bounded, PCf(t | t) is defined as the prod- 0 in substantial part, as optimal or near-optimal goal-directed uct of the probability PANS(t | t) of maintaining an approximate 0 action for the purpose of communicating information (7). On representation of the number t as t and the inverse proba- 0 this assumption, speakers are predicted to preferably produce bility PANS(432 − t | 432 − t ). These probabilities, in turn, are utterances not just based on how true they are, but also based on specified following common assumptions in the literature on how likely they are to receive the intended interpretation (9, 10). imprecise representation of number: Consequently, a pragmatic speaker produces utterances propor- 0 0 0 tionally to how likely they are to convey the intended meaning PCf(t | t) ∝ PANS(t | t) PANS(432 − t | 432 − t) to a literally interpreting listener (L ), while also factoring in Z t0+0.5 lit 0 salience as before: PANS(t | t) = Gaussian(x, µ = t, σ = w t)dx. t0−0.5 P (m | t, L) ∝ P (m) P (t | m, L)α, Sprag Sal Llit where The parameter w stands for Weber’s fraction, which represents participants’ sensitivity to relative differences between intersec- PLlit (t | m, L) ∝ L(t, m). tion set sizes. To prevent the possibility that different production This model implies that speakers prefer, all else equal, 1) true models leverage the Weber fraction’s power as a free parameter descriptions over false ones, 2) informative descriptions over unduly, we collected independent data to estimate w based on less informative ones, and 3) more salient descriptions over the same kind of stimuli we used in Exp. 1. To this end, Exp. 3 less salient ones. The higher the “rationality parameter” α, asked 20 participants to estimate the proportion of red circles in the more pronounced is the preference for the optimization of the displays used in Exp. 1. Each participant provided estimates communicative success. (See SI Appendix for more details.) for 24 displays using a continuous slider. Based on the results The pragmatic speaker rule is rooted in standard Gricean of this experiment, we determined the maximum-likelihood esti- pragmatic theory, but also goes beyond. The preference for mate of the Weber fraction, resulting in wˆ = 0.576, and used this informative utterances is reminiscent of how scalar inferences, value for all production models reported here. (See SI Appendix such as the inference from “some” to “some but not all,” are for more information.) usually explained (36). The central idea underlying scalar infer- Adding imprecise number representation to the previously ences is that words that stand in an entailment relation may defined speaker models helps explain gradience in language use. form lexical scales. For example, “all” unilaterally entails “some,” There are certainly other factors that generate gradience (29), which delivers the lexical scale hsome, alli. Since efficient speak- but adding imprecise number representation is motivated given ers are expected to provide as much information as relevant, an the stimuli used in Exp. 1 and sufficient to capture probabilis- utterance with the weaker scalar word “some” may imply that tic gradience in production behavior. If PS (m | t, L) is a speaker the speaker believes that the corresponding sentence with the production rule, either literal or pragmatic, the production prob- stronger scalar word “all” is false. abilities under an approximate representation of the true world The pragmatic speaker model formulated here generalizes this state are: type of reasoning by assuming that words may compete with each CF X 0 other in spontaneous language use, even if they do not stand PS (m | t, L) ∝ PCf(t | t) PS (m | t, L). in a proper entailment relation. For example, consider “some” t0∈T and “few.” These two quantity words do not stand in an entail- Fig. 2B illustrates the effect of factoring in approximate num- ment relation because both may be true when the other is false ber representation on the production of the quantity words. The (e.g., “few” is true, but “some” is false when t is zero; and ANS component asymmetrically increases the amount of gradi- “some” is, true but “few” is false when t equals the total set size). ence present in the semantics, reflecting the fact that estimates Nonetheless, we propose that pragmatic speakers may, for some of the intersection set size are less accurate in the middle range intersection set sizes t, consider both “few” and “some” as viable than around the two extremes. Note that, implemented in this utterances and that they prefer whichever utterance promises to way, speakers factor in effects of imprecise number represen- be more reliable in communicating t. tation in an essentially egocentric way, i.e., they do not take Here, we abstract away from other considerations that may into consideration potential biases in the listener’s estimates of influence speakers’ choice of quantity words. For example, numerosity. “some” and “few” also differ in their argumentative direction, as shown by the following felicity pattern (37): Model Comparison • Some people liked the food, which is {good / ∗bad}. All four models were implemented in Stan (40) to obtain sam- • Few people liked the food, which is {∗good / bad}. ples from the posterior distribution over free parameter values

4 of 6 | PNAS van Tiel et al. https://doi.org/10.1073/pnas.2005453118 Probabilistic pragmatics explains gradience and focality in natural language quantification Downloaded by guest on October 4, 2021 Downloaded by guest on October 4, 2021 oe o-i Ep b aig(x.4 5 CI 95% 4) (Exp. Rating 1b) (Exp. Log-lik Model results comparison Model inherently 1. Table is meaning that (24–26). prototypes fact around the criticized organized ignoring and been gradient and has rigid bivalence, approach, overly of truth-conditional as assumption this its However, specifically false. true and is is it it when when knowing that involves and argue means sentence linguists a and what of knowing philosophers nature Many the about meaning. debate linguistic long-standing a addresses paper This more Discussion General a as words. focality compelling quantity and of gradience as semantics the encodes least into directly at that a competitor is flexible with model combined GQT-based when prag- the matic words Indeed, communication. quantity rational of of theory probabilistic production the focal- and in gradience of ity patterns of account compelling a provides as rated details.) not for were (β data predictions GQ-prag the −1.81, whose The than one rated. worse quantity only were the significantly the lower models much was four from how the model words shows by quantity 1 the produced Table for words 74.6. scale) was 100 to data 0 the a (on for rating distribution for predictive average 1a posterior Exp. models’ of four data the production from the dots from size the sampled set of either intersection “Q were given form 4). a the for slider of (Exp. red” a descriptions are adjusted of model and adequacy 1 each the Exp. judge in of to used predictions displays the the saw Participants evaluate PT-lit, to than fit ticipants better best order. that substantially in the GQ-lit, data, ultimately, offered and, production (PT-prag), (GQ-prag) PT-pragmatic new GQ-pragmatic the that to find We rbblsi rgaisepan rdec n oaiyi aua agaequantification language natural in focality and gradience explains pragmatics Probabilistic al. et Tiel van with (Rating) models trained observed by predicted words CI). (95% words quantity CIs quantity for bootstrapped and ratings data between the difference in of mean and lik) PT-prag GQ-prag PT-lit GQ-lit the D reports 1 Table ticipants. (see 1a Exp. from data on conditioned oprsn elo thwwl ahmdl fe en trained data being are after model, models data each model on the well statistical how once at proper look For disappear we module. comparison, to pragmatic seems a dif- with fit the enriched model However, than worse in model. substantially (PT-lit) ference is but PT-literal 1a, corresponding Exp. the approx- from fair data a the offers of (GQ-lit) imation GQ-literal that suggests figure The (See used. models we Bayesian specification mod- the prior Appendix speaking, the informative strictly Therefore, the are, well. including models) here as PT compared for different els factors slightly scaling necessarily and are prototypes for and (thresholds models parameters semantic GQ the on semantic priors between the differences structures, conceptual the given that, note We test ae oehr hs eut hwta h Q-ae model GQT-based the that show results these together, Taken par- 200 asked we comparison, model empirical additional For otro xetdlglklho ftets aafo x.1 (Log- 1b Exp. from data test the of log-likelihood expected Posterior i.1C Fig. o model for D test P Z D .7 l te models: other all 0.07; = o oediscussion). more for hw the shows P rmEp b hc elctdEp awt 0 par- 200 with 1a Exp. replicated which 1b, Exp. from train (Θ rmEp a sal opeitidpnettest independent predict to able is 1a, Exp. from M M −1,675 −1,625 −1,660 −1,717 | ihisfe parameters free its with D otro rdciedistribution predictive posterior train , M log ) otro xetdlog-likelihood expected posterior P (D P .1(−2.37, 0.14) (−1.82, 0.00) (−1.97, (−3.30, −1.41 −0.77 −0.99 −2.25 = test < IAppendix SI t −0.82, .2.(See 0.02). | h uniywrsQ words quantity The . Θ Θ M M , M hge sbetter): is (higher SE ) ftemodels. the of dΘ 0.45, = IAppendix SI o details). for M . t −0.41) −1.30) The . t t or SI of = eevdehclapoa rmteIBo h emnLnusisSociety Linguistics German the experiments of other IRB All the Non- (10833). from Stanford the approval (IRB) from ethical Board approval received Review ethical Institutional received 1a Medical Exp. modeling. and Appendix SI Methods and Materials theory. linguistic established to directly of speak bridging models that cognitive strong use integrated language a build to be work future can in conclude instrument modeling We pragmatic ANS. non- the probabilistic include as to that such enriched components, flexibly and cognitive be work can that linguistic on models builds such paper how This shows (9–12). use gradience language of such eling explain nonlinguistic to in representation. number limitations possible imprecise However, cognitive case, words. is this of domains—in vague by-product it of natural that use a approach as the shown latter in the have gradience of we is favor there one in that and zero seemingly is between argument in values One truth (49). have can statements words that argue vague Others with (48). simpliciter false or true always Both salient. currently ism phi- are in debate theories of Several topic classical losophy. a is Vagueness “many.” and “few” biases. perceptual and similar cognitive that as between a well but competition In of messages—as meanings, consequence viable crisp (47). a have as situation emerge words may given quantity gradience that a show in we certainty apply way, of rules a degrees which within varying for about disappears allows appar- approaches that the two framework that Bayesian these shown between been has tension similarity it gradient ent recently, by or More rules prototypes. all-or-none to of basis are the categories on whether defined asked have theorists (46). Category words gorization. color of meaning the optimal about runs debate argument achieve the current to to the parallel respect, messages latter the competing In the communication. contrast of epiphenomenon to modular an this is tendency structure Within and prototype approach. perception and in PT-based rationality, limitations a from emerges as gradience well approach, intended as the produc- just in receives focalization tion and utterance reasons gradience the explain that can that module interpretation, pragmatic probability a truth-conditional the and the about utterance, calculates an that con- of module production meaning have semantic language We a whereby meaning. of view, sists linguistic modular about into a facts language that encode shown use to attempt people approaches way defended lin- these the still that of and are All degree its prototypes) 45). of wane, (44, around matter to revolves a is PT meaning 43). truth for guistic (42, that support PT (i.e., caused of tenets have central ideas concerns foundational these the While about voiced been semantics. have prototype-based flexible more quantity truth- a of as production the well the as to that in just focality words able showed and GQT—is gradience we for quantification—i.e., so, account to doing approach By conditional “all”). “some,” (e.g., from meaning of theories abstract cogni- 13). of the (12, study linguistics in the modeling for probabilistic sciences of tive from field techniques growing and of the tools set that showed a We pragmatics with mean- use. combined connect and that when ing assumptions use linking eval- justified language to independently attempting explain how from on than based they language rather them evaluated well divorce Hence, we (41). directly, to theories used impossible, semantic uate is not it if way difficult, the is it that eetyashv enarsn neeti rbblsi mod- probabilistic in interest rising a seen have years Recent as such words, quantity vague of number a tested study Our cate- on research in precursor important an has endeavor Our that concerns theoretical more with in ties conclusion This words quantity of domain the to approach this applied We is theories semantic such test to challenging it makes What and epistemicism a rvd uhln ucin,teeyleveraging thereby functions, link such provide can rvdsmr nomto bu h xeiet,analyses, experiments, the about information more provides ru htsaeet ihvgewrsare words vague with statements that argue https://doi.org/10.1073/pnas.2005453118 PNAS probabilistic contextual- | f6 of 5

PSYCHOLOGICAL AND COGNITIVE SCIENCES (DGfS August 1, 2014). Participants received information about their rights Exp. 3: ANS. For Exp. 3, a total of 20 participants were drafted on AMT. and about the purpose of the study. Afterward, they gave informed consent. Displays were as in Exp. 1. Participants saw a random display from each of 24 bins, each including 18 intersection set sizes, and estimated the proportion Exp. 1: Production. For Exp. 1, there were 600 (Exp. 1a) and 200 (Exp. 1b) of red circles by moving a slider. participants drafted on Amazon’s Mechanical Turk (AMT). Only workers with an internet protocol address from the United States were eligible for partic- Exp. 4: Evaluation. For Exp. 4, a total of 200 participants were drafted on AMT. ipation. In addition, participants were asked in which language they usually Each saw 20 random displays from Exp. 1. Displays were described by sen- counted and were excluded if they did not answer English. (These constraints tences of the form “Q of the circles are red,” where Q was obtained by first hold for all experiments.) Items showed displays containing 432 black or red sampling an arbitrary number 0, ... , 432 and then sampling from the poste- circles, which were randomly scattered across a 25×36 grid. Participants saw rior predictive distribution of each model, as well as from the data observed a random selection of 10 displays. At the bottom of each display, participants in Exp. 1. We only tested items where at least two of the sampled expressions were asked: “How many of the circles are red?” Participants had to complete differed. Participants rated the adequacy of descriptions by adjusting a slider a sentence of the form “—are red” by typing freely into the blank. For the bar (with endpoints labeled “bad” and “good”). Ratings were recalibrated as analyses, we grouped together synonyms. For the purpose of data visualiza- differences from the ratings for the quantity words sampled from the data. tion (Fig. 1 B and C), we binned the data into bins consisting of 10 intersection A mixed-effects linear regression model predicting differential rating on the set sizes, except for the first and last bins, which only contained data for inter- basis of the source of the quantity word (using the data as reference category) section set sizes 0 and 432. Moreover, the second-to-last bin contained 11 with random intercepts for participants, quantity words, and intersection set intersection set sizes, also including data for intersection set size 431. sizes was the maximal converging model.

Exp. 2: Monotonicity. For Exp. 2, a total of 120 participants were drafted on Data Availability. Anonymized experimental data and model code have AMT. Based on a pretest (Exp. 2a), 17 predicate pairs hP1, P2i were selected, been deposited at Open Science Framework (DOI: 10.17605/OSF.IO/HSYTK) such that participants agreed that P1 entailed P2, e.g., hplay poker, play (50). cardsi. Using these predicate pairs, two types of arguments were randomly generated for each quantity word: ACKNOWLEDGMENTS. We thank R. Ba˚ ath˚ for contributing to the design of Exp. 1 and giving permission to make use of the data. We thank B. Geurts, • Q of the people P1. → Q of the people P2. G. Jager,¨ G. Scontras, and K. Syrett for valuable feedback. This research was • Q of the people P2. → Q of the people P1. supported by German Research Council Grants FR 3482/2-1, KR 951/14-1, SA 925/4-1, SA 925/11-1, and SA 925/17-1, in part within Grant SPP 1727 Participants indicated whether these arguments were valid. Two anno- (Xprag.de); and by Dutch Science Organisation Gravitation Grant “Language tated examples illustrated the notion of validity. in Interaction” 024.001.006.

1. G. Frege, Grundgesetze der Arithmetik (Georg Olms, Hildesheim, Germany, 1962). 26. L. Zadeh, Precisiated natural language (PNL). AI Mag. 25, 74–92 (2004). 2. L. Wittgenstein, Tractatus Logico-Philosophicus (Harcourt, New York, NY, 1922). 27. B. van Tiel, Embedded scalars and typicality. J. Semant. 31, 147–177 (2014). 3. I. Heim, A. Kratzer, Semantics in Generative Grammar (Blackwell, Malden, MA, 1998). 28. S. E. Newstead, R. A. Griggs, Fuzzy quantifiers as an explanation of set inclusion 4. G. Lakoff, Women, Fire, and Dangerous Things: What Categories Reveal About the performance. Psychol. Res. 46, 377–388 (1984). Mind (Chicago University Press, Chicago, IL, 1987). 29. M. Krifka, “Be brief and vague! And how bidirectional optimality theory allows for 5. R. Langacker, Theoretical Prerequisites (Foundations of Cognitive Grammar, Stanford verbosity and precision” in Sounds and Systems. Studies in Structure and Change: A University Press, Stanford, CA, 1987), vol. 1. Festschrift for Theo Vennemann, D. Restle, D. Zaefferer, Eds. (Mouton de Gruyter, 6. E. Rosch, C. B. Mervis, Family resemblances: Studies in the internal structure of Berlin, Germany, 2002), pp. 439–458. categories. Cognit. Psychol. 7, 573–605 (1975). 30. A. Gelman, J. B. Carlin, H. S. Stern, D. B. Rubin, Bayesian Data Analysis (Texts in 7. H. P. Grice, “Logic and conversation” in Speech Acts, P. Cole, J. L. Morgan, Eds. (Syntax Statistical Science, CRC Press, Boca Raton, FL, ed. 3, 2014). and Semantics, Academic Press, New York, NY, 1975), vol. 3, pp. 41–58. 31. L. A. Zadeh, A computational approach to fuzzy quantifiers in natural languages. 8. U. Sauerland, Scalar implicatures in complex sentences. Ling. Philos. 27, 367–391 (2004). Comput. Math. Appl. 9, 149–184 (1983). 9. M. C. Frank, N. D. Goodman, Predicting pragmatic reasoning in language games. 32. D. Lewis, General semantics. Synthese 22, 18–67 (1970). Science 336, 998 (2012). 33. B. Partee, “Montague grammar” in International Encyclopedia of the Social & Behav- 10. M. Franke, G. Jager,¨ Probabilistic pragmatics, or why Bayes’ rule is probably important ioral Sciences, N. J. Smelser, P. B. Baltes, Eds. (Pergamon, Oxford, UK, 2001), pp. for pragmatics. Z. Sprachwiss. 35, 3–44 (2016). 9995–9999. 11. B. Russell, “Probabilistic reasoning and the computation of scalar implicatures,” PhD 34. M. C. Frank, N. D. Goodman, Inferring word meanings by assuming that speakers are thesis, Brown University, Providence, RI (2012). informative. Cognit. Psychol. 75, 80–96 (2014). 12. C. Potts, D. Lassiter, R. Levy, M. C. Frank, Embedded implicatures as pragmatic 35. A. Scholler,¨ M. Franke, Semantic values as latent parameters: Testing a fixed threshold inferences under compositional lexical uncertainty. J. Semant. 33, 755–802 (2016). hypothesis for cardinal readings of few & many. Linguistics Vanguard 1, 20160072 13. M. Franke, L. Bergen, Theory-driven statistical modeling for semantics and pragmat- (2017). ics: A case study on grammatically generated implicature readings. Language 96, 36. Horn L. R. (1972) “On the semantic properties of logical operators in English,” PhD 77–96 (2020). thesis, University of California, Los Angeles, CA. 14. K. van Deemter, Not Exactly: In Praise of Vagueness (Oxford University Press, Oxford, 37. O. Ducrot, Le preuve et le dire (Maison Mame, Paris, France, 1973). UK, 2010). 38. S. Dehaene, The Number Sense: How the Mind Creates (Oxford 15. S. Peters, D. Westerstahl,˚ Quantifiers in Language and Logic (Oxford University Press, University Press, Oxford, UK, 1997). Oxford, UK, 2006). 39. G. T. Fechner, Elemente der Psychophysik (Breitkopf & Hartel, Leipzig, Germany, 16. J. Barwise, R. Cooper, Generalized quantifiers and natural language. Ling. Philos. 4, 1860). 159–219 (1981). 40. Stan Development Team, Stan Modeling Language Users Guide and Reference 17. R. Montague, “The proper treatment of quantification in ordinary English” in Manual, Version 2.18.00. https://mc-stan.org. Accessed 16 February 2021. Approaches to Natural Language, K. J. J. Hintikka, J. M. E. Moravcsik, P. Suppes, Eds. 41. H. H. Clark, Using Language (Cambridge University Press, Cambridge, UK, 1996). (Reidel, Dordrecht, The Netherlands, 1973), pp. 221–242. 42. D. N. Osherson, E. E. Smith, On the adequacy of prototype theory as a theory of 18. D. Bonevac, “A history of quantification” in Logic: A History of its Central Concepts, concepts. Cognition 9, 35–58 (1981). D. M. Gabbay, F. Pelletier, J. Woods, Eds. (Handbook of the History of Logic, Elsevier, 43. H. Kamp, B. Partee, Prototype theory and compositionality. Cognition 57, 129–191 Amsterdam, Netherlands, 2012), vol. 11, pp. 63–126. (1995). 19. J. Szymanik, M. Zajenkowski, Comprehension of simple quantifiers: Empirical 44. J. Hampton, Typicality, graded membership, and vagueness. Cognit. Sci. 31, 355–384 evaluation of a computational model. Cognit. Sci. 34, 521–532 (2010). (2007). 20. N. Katsos et al., Cross-linguistic patterns in the acquisition of quantifiers. Proc. Natl. 45. V. Novak,´ A formal theory of intermediate quantifiers. Fuzzy Set Syst. 159, 1229–1246 Acad. Sci. U.S.A. 113:9244–9249 ( 2016). (2008). 21. B. Geurts, Reasoning with quantifiers. Cognition 86, 223–251 (2003). 46. E. Gibson et al., Color naming across languages reflect color use. Proc. Natl. Acad. Sci. 22. B. Ladusaw, “Polarity sensitivity as inherent scope relation,” PhD thesis, University of U.S.A. 114, 10785–10790 (2016). Texas, Austin, TX (1979). 47. J. Tenenbaum, “A Bayesian framework for concept learning,” PhD thesis, Mas- 23. S. E. Newstead, P. Pollard, D. Riezebos, The effect of set size on the interpretation of sachusetts Institute of Technology, Cambridge, MA (1999). quantifiers used in rating scales. Appl. Ergon. 18, 178–182 (1987). 48. H. Kamp, “The paradox of the heap” in Aspects of Philosophical Logic, U. Monnich,¨ 24. T. S. Wallsten, D. V. Budescu, A. Rapoport, R. Zwick, B. Forsyth, Measuring the vague Ed. (Springer, Dordrecht, Netherlands, 1981), pp. 225–277. meanings of probability terms. J. Exp. Psychol. Gen. 115, 348–365 (1986). 49. J. A. Goguen, The logic of inexact concepts. Synthese 19, 325–373 (1969). 25. S. E. Newstead, “Quantifiers as fuzzy concepts” in Fuzzy Sets in Psychology, T. Zetenyi, 50. B. van Tiel, M. Franke, U. Sauerland, Materials for “Probabilistic pragmatics explains Ed. (Advances in Psychology, Elsevier, Amsterdam, Netherlands, 1988), vol. 56, pp. gradience and focality in natural language quantification”. Open Science Framework. 51–72. https://osf.io/hsytk/. Deposited 9 February 2021.

6 of 6 | PNAS van Tiel et al. https://doi.org/10.1073/pnas.2005453118 Probabilistic pragmatics explains gradience and focality in natural language quantification Downloaded by guest on October 4, 2021