On an effective and efficient method for exploiting the wisdom of the inner crowd

doi:10.21203/rs.3.rs-1958619/v1

Download PDF

Article

On an effective and efficient method for exploiting the wisdom of the inner crowd

https://doi.org/10.21203/rs.3.rs-1958619/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 03 Mar, 2023

Read the published version in Scientific Reports →

You are reading this latest preprint version

Researchers have shown that even an individual can produce the wisdom of the crowds, called ‘the wisdom of the inner crowd’. However, the previous methods leave room for improvements in terms of efficacy and convenience. This paper proposes a more efficient method with low cognitive cost, based on findings from cognitive and social psychology. The procedure is to ask participants to give two answers to the same question: first, their own estimate and, second, their estimate of public opinion. Experiments using this method showed that the averages of the two estimates were more accurate than the participants’ first estimates. That is, the wisdom of the inner crowd emerged. In addition, we found that the method could be superior to other methods in terms of efficacy and convenience. Moreover, we identified the conditions where our method worked better. We further clarify the limitations of using the wisdom of the inner crowd, including people’s tendency to fall into overconfidence. Overall, this paper proposes an effective and convenient method for harvesting the wisdom of the inner crowd.

In daily life, people often need to estimate factual problems (e.g., the number of people who will attend a meeting or the price of a gift), and doing so may be either easy or difficult depending on the particular case. For difficult cases, researchers have sought ways to arrive at accurate estimations. One promising approach is to harness ‘the wisdom of the crowds’^1–11: that is, collecting estimations from many people and aggregating them (especially averaging) can yield surprisingly accurate estimates. For over 100 years, various studies have investigated this phenomenon.

However, the wisdom of the crowds also has a fundamental problem in that it is difficult to gather estimates from many people^12,13. To solve this problem, previous studies^14,15 proposed methods that exploited the wisdom of the crowds within-person (hereafter, ‘the wisdom of the inner crowd’^16–19). In such approaches, an individual is asked to produce two different estimates in response to a single question, in place of two people’s estimates. Research has shown that the average of an individual’s two estimates can be more accurate than their first estimate. For instance, in Fig. 1, the true value response to the question is 40. The participant gave a first estimate of 25 and a second estimate of 35 (Fig. 1A). These average to 30, which is closer to the true value when compared to the first estimate of 25. Figure 1B shows a first estimate of 25 and second of 75, with the true value 40 in the middle. Subsequently, the average is 50, which is better than 25.

For collecting the wisdom of the inner crowd, two promising methods have been proposed thus far. The first is to utilise the power of forgetting¹⁴. Some studies^14,17,20 have indicated that by providing a timespan between two estimates (e.g., two weeks¹⁴), people can exploit the wisdom of the inner crowd. The second is to utilise the power of the dialectic¹⁵. For instance, previous studies^15,19 have shown the effectiveness of making people ‘consider the opposite’ when making second estimates (called Dialectical bootstrapping; see Table 1 for more details).

Based on these two methods, many studies over the last decade^16–27 have examined the wisdom of the inner crowd (see the ‘Discussion’ section for more details). However, to the best of our knowledge, nothing other than the above two methods has been proposed. As we discuss below, these existing methods leave room for improvements in efficacy and convenience. Therefore, this paper proposes a new method that efficiently exploits the wisdom of the inner crowd at low cognitive cost and ‘boosts’^28–30 judgments.

For the development of the new method, cognitive and social psychology studies provide important insights. In these fields, many studies have examined how people can think in ways different from their own perspectives. In particular, it is well-known that an individual can access various forms of cognition by considering others’ perspectives, called ‘perspective-taking’^31,32. Perspective-taking enables a person to decrease stereotypic biases³³, change preferential values³⁴, and reduce egocentric thinking³⁵. Therefore, we primarily decided to follow the perspective-taking paradigm. Subsequently, among others, we considered the general crowd’s viewpoint. People believe the general crowd is different from themselves in several ways, including degree of intelligence^36–38 and risk attitude^39,40. Accordingly, we hypothesised that by considering the general crowd’s perspectives, participants could make estimates different from their own previous estimates in response to the same question.

We developed a new method as follows. Like the two existing methods described above, the method proposed here requires people to give two estimates for a single question. However, unlike the other methods, these procedures ask the participants for their own opinion and then asks them to estimate the opinion of the general crowd or average people (i.e., public opinion; for the full instructions, see Table 1). Subsequently, the two opinions are averaged.

In the sections below, we confirm that the averaged estimates are more accurate than their participants’ original estimates based on their own view. Subsequently, we compare our method with the existing ones in terms of efficacy and convenience, using response time as an index of convenience. The results highlight the room for improvement in the previous methods. Moreover, we identify the conditions in which our method works better (or worse) and point out the limitations involved in utilising the wisdom of the inner crowd. Through these analyses and discussions, finally, we demonstrate that the new approach can largely aid research on the wisdom of the inner crowd.

-----Figure 1 about here-----

Table 1

Full instructions for each method (called ‘condition’ in our experiments). Note that in second estimates, the Dialectical method required the first estimate to be displayed while the Other’s perspective and Repeated methods did not. All instructions were translated into Japanese.
Method	Instructions for the second estimates
Other’s perspective	How do you think people in general estimate the following question? Make a second estimate after considering fully how people in general estimate this. (The computer display does not present the first estimate.)
Dialectical	First, assume that your first estimate is off the mark. Second, think about a few reasons why that could be. Which assumptions and considerations could have been wrong? Third, what do these new considerations imply? Was the first estimate too high or too low? Fourth, based on this new perspective, make a second, alternative estimate. (The computer display shows the first estimate.)
Repeated	No instructions (The display does not present the first estimate.)

An overview of the behavioural data.

In Experiment 1 tested the efficacy of the proposed method, including a comparison with other methods. Therefore, we set three conditions: Other’s perspective, Dialectical, and Repeated conditions. All participants produced two estimates for each question. In the Other’s perspective condition, participants used our method. In the Dialectical condition, they did dialectical bootstrapping¹⁵. In the Repeated condition, they produced two estimates for a question without instructions (Table 1). Participants were randomly assigned to one of the three conditions. The stimulus consisted of the questions which asked general knowledge (for example, ‘What percent of the world's airports are in the United States?’; Table 2).

In Experiment 2, we also set the three conditions (that is, the Other’s perspective, Dialectical, and Repeated conditions). To conduct further analysis, we performed Experiment 2 with the following modifications from Experiment 1. We recorded response times, asked participants for a third (i.e., final) estimate, and to rate their level of confidence (See more details in ‘Methods’).

In Experiment 3, we tested the method on an additional framework to examine whether its efficacy increased when the number of estimates increased. In this experiment, we set a single condition: All participants made five estimates for a question: one participant’s own estimate and four estimated public opinions.

Efficacy of our method

Based on the behavioural data from the Other’s perspective condition in Experiments 1 and 2, we analysed the efficacy of our method. First, we compared accuracy among the three estimates, i.e., the first, second, and their averaged estimates, only in the Other’s perspective condition. As an index of the accuracy of the estimates, we used the mean squared error (MSE), where a lower MSE indicates higher accuracy. We calculated the MSEs of the three estimates for each participant.

Figure 2 shows the results of the analysis. Notably, the averaged estimates had a lower MSE than the participants’ own estimates (Estimate 1) across the two experiments (Experiment 1: t(149) = 4.39, p < .01, Cohen’s d = 0.15; Experiment 2: t(29) = 4.23, p < .01, Cohen’s d = 0.50). Thus, we confirmed that the new method elicited the wisdom of the inner crowd.

Note that we also compared the averaged estimates with two people’s own estimates (see Section S1 of the Supplementary Information) and found they did not exceed the two people’s estimates. However, we also found that the averaged estimates could be more accurate than 1.5 people’s estimates (1.59 people in Study 2; 1.26 people in Study 1). To the best of our knowledge, this is the best efficacy for an approach to harvesting the wisdom of the inner crowd^{14,17,20−22}.

As previous studies^14,15 showed, the second estimate is not necessarily accurate. Accordingly, in our study, Estimate 2 had a higher MSE than the average across the two experiments (Experiment 1: t(149) = 6.50, p < .01, Cohen's d = 0.32; Experiment 2: t(29) = 3.49, p < .01, Cohen's d = 0.32). Along with this, we did not find a significant effect of MSE between Estimate 1 and Estimate 2 (ps > .1).

-----Figure 2 about here-----

Table 2

Questions and correct answers used in the experiments (in %). We checked all the answers on 2022/08/05. We used the answers in The world factbook (52), as with the previous studies (14,15). Experiment 1 used Questions 1–8 (14), and Experiments 2 and 3 used all the questions (15). Note that since we could not confirm the answer on Q10, we used the data on the World Bank Data (53). In addition, as for Q5, we used the latest data on the World Population Review (54) because a fertility rate changes frequently. All questions were translated into Japanese.
Number	Question	Answer
1	The area of the USA is what percent of the area of the Pacific Ocean?	6.32
2	What percent of the world’s population lives in either China, India, or the European Union?	41.29
3	What percent of the world’s airports are in the United States?	32.31
4	What percent of the world’s roads are in India?	7.30
5	What percent of the world’s countries have a higher fertility rate than the United States?	69.84
6	What percent of the world’s telephone lines are in China, USA, or the European Union?	52.09
7	Saudi Arabia consumes what percentage of the oil it produces?	26.62
8	What percentage of the world’s countries have a higher life expectancy than the United States?	22.56
9	What percent of the earth’s surface is covered by water?	70.90
10	What percent of the worldwide land mass is not used for agriculture?	63.10
11	What percent of the world’s population is between 15 and 64 years old?	65.18
12	What percent of the world’s population is Christian?	31.40
13	What percent of the world’s population speaks Mandarin Chinese as a first language?	12.30
14	What percent of the world’s population aged 15 years or older can read and write?	86.30
15	What percent of the worldwide gross domestic product (GDP) comes from the service sector?	63.00
16	What percent of the worldwide labor force works in the agricultural sector?	31.00
17	What percent of the worldwide income does the richest 10% of households earn?	30.20
18	What percent of the worldwide gross domestic product (GDP) is re-invested (‘gross fixed investment’)?	25.70
19	What percent of the goods exported worldwide are mineral fuels (including oil, coal, gas, and refined products)?	14.40
20	What percent of the worldwide gross domestic product (GDP) is used for the military (military expenditure)?	2.140

Comparison of the methods

We compared the efficacy among conditions, based on the data from Experiments 1 and 2. The reduction of MSE was calculated as shown in Eq. 1.

$Reduction of MSE = {MSE}_{first estimates} - {MSE}_{averaged two estimates}$ Eq. 1

Subsequently, a higher the reduction of MSE indicates higher effectiveness of a method. As Fig. 3 shows, the results confirm the advantage of our method. In Experiment 1, the Other’s perspective condition had a larger reduction of MSE than the Repeated condition (pairwise Wilcoxon rank-sum test: p < .05, Cliff’s delta = 0.16; Dialectical and Repeated conditions did not follow the normal distribution, Kolmogorov–Smirnov test: ps < .05; note that all pairwise tests were performed using Bonferroni correction). In Experiment 2, the Other’s perspective condition had a larger reduction of MSE than the Dialectical condition (pairwise Wilcoxon rank-sum test: p < .05, Cliff’s delta = 0.40; Other’s perspective and Repeated conditions did not follow the normal distribution, Kolmogorov–Smirnov test: ps < .05).

No such effects were found between the Other’s perspective condition and Dialectical condition in Experiment 1 (p = 1.00) or between the Other’s perspective condition and Repeated condition in Experiment 2 (p = 0.22). However, the results showed the benefit of our method: Other’s perspective had the largest values on reduction of MSE among all conditions across the two Experiments (Experiment 1: Other’s perspective = 51.80; Dialectical = 42.01; Repeated = 15.69; Experiment 2: Other’s perspective = 85.56; Dialectical = 17.38; Repeated = 38.46). In summary, we can assume that the new method is superior to the previous ones in terms of efficacy (for raw data, see S5 of the Supplementary Information).

-----Figure 3 about here-----

Analysis of cognitive load

Methods for collecting the wisdom of the inner crowd can be used on a daily basis. From this perspective, it is important that the method is convenient to use. Therefore, we compared the cognitive load among all conditions.

As an index of the cognitive load, we utilised response time: in Experiment 2, the laboratory computer recorded the response time. Particularly, we examined the response time for the second estimates because, for this estimate, the participants were instructed differently depending on their assigned condition (Table 1). It should be noted that for the first estimates, we did not find any significant effects among the three conditions (pairwise t-test: ps > .1).

Figure 4 shows the results of the analysis. Most importantly, the Other’s perspective condition had a significantly shorter response time than the Dialectical condition (pairwise t-test: p < .01, Cohen’s d = 0.96). The results indicate that our method is relatively convenient to use. It should be added that the Repeated condition had a shorter response time than the Other’s perspective and Dialectical conditions since the Repeated condition had no specific instructions (pairwise t-test: the Other’s perspective condition, p < .05, Cohen’s d = 0.84; the Dialectical condition, p < .01, Cohen’s d = 1.72, respectively).

When considered along with the results presented in the last section, our method could be superior to other methods in terms of efficacy and cognitive load.

-----Figure 4 about here-----

When the proposed method worked better (or worse)

For further analysis, we investigated the conditions under which the methods worked better or worse. In Experiment 2, all participants reported their level of confidence in their first estimates. Subsequently, we analysed the influence of confidence on the efficacy of each method. We conducted mixed-effects analyses⁴¹ for each condition with the reduction of the MSE as a dependent variable and confidence as an independent variable, as well as the participants and questions as random variables.

The results showed that in the Other’s perspective condition, higher confidence corresponded to a greater reduction of MSE (F(1, 531.62) = 10.30, p < .01; see also Fig. 5). In other words, the proposed method worked better when participants were confident in their own estimates. Accordingly, it would be better for people to use the method when their confidence is high. For other conditions, we did not find such effects (Dialectical condition: p = .96; Repeated condition: p = .73). Thus, hereafter, we shall discuss the Other’s perspective condition.

How did these results emerge? In Experiment 2, confidence did not correlate with accuracy in the first estimate (p > .1), meaning there is room for improving estimates even when the participants feel confident. Importantly, confidence in the first estimate correlated with accuracy in the second estimate. We conducted an additional mixed-effects analysis that included the MSE in the second estimate as a dependent variable, with confidence as an independent variable. The results showed that, although marginally, the higher the confidence was, the lower the MSE was in the second estimate (F(1, 544.9) = 2.80, p = 0.095). Subsequently, the average could be close to the true value, resulting in the consequences as described above. We shall remark on this issue in the ‘Discussion’ section.

-----Figure 5 about here-----

Overconfidence in the final estimate

Thus far, we have shed light on the positive aspects of the wisdom of the inner crowd. Here, we point out its negative aspects and limitations.

Previous studies^{18, 19} have pointed out the possibility that people cannot ‘utilise’ the wisdom of the inner crowd. As we have discussed, the average of the two estimates was accurate in the proposed method for harvesting the wisdom of the inner crowd. However, people might not naturally utilise averages as their final estimates. For example, some people might adopt their first estimate as their final one. We address this problem based on the results of Experiment 2 since all the participants produced final answers based on their own thinking.

Figure 6A shows the results of the analysis. The analysis compared the MSE of the first and final estimates. As Fig. 6A shows, only in the Repeated condition, the final estimate was more accurate than the first estimate (t(30) = 2.20, p < .05). In the Other’s perspective and Dialectical conditions, we did not find such effects (ps > .1). Thus, as previous studies pointed out, people do not always utilise the wisdom of the inner crowd naturally (as for how the results emerged, see Section S2 of the Supplementary Information).

Moreover, in Experiment 2, the participants also responded with confidence in their final estimates. Comparing this with their confidence in their first estimates, we found that the participants became more confident in the final estimates than in the first estimates across all conditions (Fig. 6B, Other’s perspective: t(27) = 3.29, p < .01; Dialectical: t(26) = 3.15, p < .01; Repeated: t(29) = 4.70, p < .01). Together with the above results, the methods of the wisdom of the inner crowd could lead to participants having ‘overconfidence’^42–45 as a whole. We shall remark on this issue in the Discussion.

-----Figure 6 about here-----

When the number of estimates increased

Thus far, we had asked the participants to provide a single public opinion in the Other’s perspective condition. Subsequently, can the efficacy of our method increase if the number of estimated public opinions increases? Previous research on the wisdom of the inner crowd^17,22 has often discussed the effect of the number of estimates. For instance, one study²² examined a case where participants gave five estimates in response to a single question (note that this study provided no specific instructions). They compared this case with when participants gave two estimates for a question, and the results revealed that increasing the number of estimates did not enhance the wisdom of the inner crowd effect. Thus, to determine the potential of our method, it is important to address whether the number of estimates can enhance the wisdom of the inner crowd effect.

In Experiment 3, all the participants gave five estimates in response to each question: the participants answered their own estimate once and estimated public opinions four times (see more details in the ‘Methods’ section). In the analysis, we calculated the reduction of MSE. In this context, we computed how much the error in the first estimate decreased by averaging all five estimates. Thus, the reduction of MSE was calculated as shown in Eq. 2.

$Reduction of MSE = {MSE}_{first estimates} - {MSE}_{averaged five estimates}$ Eq. 2

Subsequently, we compared the results of this analysis with those of the Other’s perspective condition in Experiment 2. As Fig. 7A shows, the reduction of MSE in Experiment 3 recorded a positive value (95% CI = [47.20, 126.91]; we conducted the bootstrapping based on 10,000 sampling with replacement). In other words, the error in the first estimate decreased to some degree when the participants gave five estimates.

However, most importantly, we could not find any significant effects between them (Welch t-test: t(59) = 0.34, p = .73). This indicates that increasing the number of public opinion estimates did not necessarily enhance the efficacy of our method (see the ‘Discussion’ for speculation on how to overcome this limitation).

How did the results emerge? As mentioned in the ‘Introduction’, the wisdom of the inner crowd paradigm aims to make participants produce different opinions from their own. Subsequently, we calculated ‘distance’ in both experiments. Here distance means the absolute distance between participants’ own and guessed public opinions. As for Experiment 3, we first averaged four public opinions and computed the distance. The results show that Experiment 3 had a smaller distance than Experiment 2 (see Fig. 7b; t(59) = 5.08, p < .01). That is, Experiment 3 failed to make participants produce different opinions compared to Experiment 2.

Table 3 shows more detailed results in Experiment 3. In this table, we categorised participants’ own estimates. Specifically, we categorised them according to the relative size compared to four public opinions: the smallest, second smallest, medium, second largest, and the largest. We first counted the number of estimates falling into each category for each participant and then added up the number for all participants. As a result, the frequencies of appearance of ‘medium’ and ‘second smallest’ categories were larger than those of other categories (95% CI). Especially in the medium category, we can assume that participants assigned two of four public opinions to larger values than their own estimates and did the other two to smaller values than their own estimates. In other words, we can consider that the participants’ first own estimates worked like anchoring^46–48. We could speculate that as a result, the averaged four public opinion was not largely different from participants’ own estimates.

-----Figure 7 about here-----

Table 3

Analysis of the first estimates. Estimates were categorised by size in comparison to the size of all public opinions. We first counted the number of estimates falling into each category for each participant and then added up the number for all participants.
Category	Frequency in the 20 questions (95% CI)
Smallest	[2.10, 3.93]
Second smallest	[5.68, 7.29]
Medium	[5.74, 7.71]
Second largest	[2.19, 3.51]
Largest	[0.61, 1.35]

This study proposes a new method exploiting the wisdom of the inner crowd. Our method asks participants to give two estimates in response to a question: their own estimate and their estimate of public opinion. It then averages the estimates (as for optimal weighting, see S4 for Supplementary Information). Across Experiments 1 and 2, we confirmed that the proposed method produced the wisdom of the inner crowd effect. Moreover, we found that it could be more effective and convenient than other methods.

Moreover, this study makes two substantial contributions. First, we identified the conditions under which the new method works better or worse. These conditions have been given little attention in the existing literature, to the best of our knowledge. However, it is important to understand efficacy in context in order to appropriately utilise a method for eliciting the wisdom of the inner crowd for informing decisions encountered in daily life. The analysis showed that our method worked better when participants had high confidence in their own estimates.

As mentioned above, we found that the accuracy of the second estimate was high when the confidence in first estimate was high. We speculated the cause of the results as follows. When participants had low confidence in their response, it might be even difficult for them to estimate public opinion, resulting in producing estimates that was off the mark (e.g., the answer was 10 and an estimate was 90). In contrast, when they had a higher degree of confidence, it seemed to be relatively easy to produce a plausible estimate of public opinion^49,50. For future study, we plan to examine this speculation directly: put simply, when participants estimate public opinions, we will ask them how confident they feel in taking different perspectives.

Second, the results point to the limitations of utilising the wisdom of the inner crowd. Participants tended to fall into overconfidence: specifically, they became more confident about the third or final estimate compared to the first estimate. However, accuracy did not increase as a whole. We assume that this is an inherent defect of methods that elicit the wisdom of the inner crowd. As for coping strategies, we propose that the methods forcibly average people’s estimates and not rely on people to do naturally.

Additionally, although the number of estimates (that is, simulated public opinions) increased, the efficacy of our method did not increase. However, a previous study²² also showed that theoretically, the efficacy of the methods could increase as the number of estimates increased. Thus, we consider our results as showing the need for a method that is better structured. For example, the wisdom of the inner crowd effect may become larger if our method is blended⁵¹ with those of previous studies^14,15.

Overall, as for both the potentials and limitations, this paper highlights room for further analysis of methods on the wisdom of the inner crowd. Note that previous research can be classified into two directions. First, some studies have attempted to extend the wisdom of the inner crowd to other tasks^24–27. For example, a previous study²⁶ showed that by using this method, an individual could improve performance evaluations of matters for which there is no objective truth (for other examples, see social projection²⁴ and self-deception²⁵). Second, other studies^21–23 focused on its theoretical mechanism. For example, a previous study²² presented a mathematical framework indicating two mechanisms through which the wisdom of the inner crowd works.

In contrast, the method itself has been little proposed. This may be derived from the difficulty of developing the methods¹⁵: the second estimate should differ from the first estimate, however, at the same time, the second estimate should be a plausible one. If the second estimate only adds noise^50,51, the averaged estimate will not be more accurate than the first one. Nevertheless, this study demonstrates that developing an alternative method to those already proposed can facilitate research on the wisdom of the inner crowd.

In all three experiments, the participants provided informed consent prior to joining the study. The experimental protocol was approved by the University of Tokyo Research Ethics Committee and conducted in accordance with the latest version of the Declaration of Helsinki.

Experiment 1: Participants and Procedure

The participants were 452 Japanese adults who participated in the experiment through a web research company. To gather the participants in the web-based survey, we contracted with Rakuten Insight (https://member.insight.rakuten.co.jp/), a well-known investigation company. Rakuten Insight has the largest panel in Japan, consisting of more than 220,000 people. After completing the study, the participants received cash-equivalent points as an incentive that could be used for online shopping.

The stimuli were eight questions about general knowledge (e.g., ‘What percent of the world’s airports are in the United States?’; Table 2), the same as in the previous study¹⁴. The participants answered the question set twice (Sets 1 and 2), yielding a total of two estimates in response to each question. Across the two sets, the order of questions remained constant. Further, we randomised the order of questions for each participant.

There were three conditions: Other’s perspective, Dialectical, and Repeated. The participants were randomly assigned to one of the three conditions (Other’s perspective: n = 150, 98 female and 52 male, M_age = 45.5 and SD_age = 8.0; Dialectical: n = 151, 95 female and 56 male, M_age = 43.6 and SD_age = 8.0; Repeated: n = 151, 94 female and 57 male, M_age = 43.6 and SD_age = 8.1). In the Other’s perspective condition, they used our method of answering Set 1 with their own opinion and Set 2 with their estimate of public opinion. In the Dialectical condition, they did dialectical bootstrapping¹⁵, and in the Repeated condition, they answered both sets without instructions on what to think about (Table 1).

Experiment 2: Participants and Procedure

The participants were 90 Japanese undergraduate and graduate students. They received a flat fee of 1,000 Japanese yen (approximately $9.17 at the currency rate at the time) for their participation.

The stimulus consisted of 20 questions about general knowledge (Table 2), the same as in the previous study¹⁹. The participants answered the question set three times (Sets 1–3), yielding a total of three estimates in response to each question. Across the three sets, the order of questions remained constant. Further, we randomised the order of questions for each participant.

As in Experiment 1, the participants were randomly assigned to one of the three conditions (Other’s perspective: n = 30, 7 female and 21 male, M_age = 21.2 and SD_age = 2.3; Dialectical: n = 29, 8 female and 20 male, M_age = 21.1 and SD_age = 3.1; Repeated: n = 31, 8 female and 23 male, M_age = 20.6 and SD_age = 2.5). We dropped the data of three participants based on demographic data (two in the Other’s perspective condition and one in the Dialectical condition) and five participants as for their confidence level in their first and third estimates (two in the Other’s perspective condition, two in the Dialectical condition, and one in the Repeated condition). Data from these participants are excluded in the corresponding analysis.

In Set 1, they gave their own estimates across all conditions. They also evaluated their level of confidence in each response on a scale ranging from 0 (I am not confident in my answer at all) to 100 (I am very confident in my answer). In Set 2, they gave estimates depending on their assigned condition, as in Experiment 1. Subsequently, in Set 3, they were shown their two previous estimates on the computer display and asked to provide a final answer along with their level of confidence in it.

Experiment 3: Participants and Procedure

The participants were 33 Japanese undergraduate and graduate students (14 female and 19 male, M _age = 20.0 and SD _age = 1.4). They received a flat fee of 1,000 Japanese yen (approximately $9.17 at the currency rate at the time) for their participation.

The stimulus consisted of the same 20 questions used in Experiment 2. We set only one condition in this experiment: All the participants gave five estimates for each question (Sets 1–5). First, they answered with their own estimate, and then they gave estimates of public opinion four times.

Mixed-effect analysis

We performed all mixed-effects analyses using the R packages lme4 and lmerTest⁴¹. We selected the best model and computed all statistical values using the step() function for the full model with random participants and stimulus intercepts.

Data Availability The R-code and the three datasets analysed in this study are available in the Mendeley Data: https://data.mendeley.com/datasets/p29rkjmvjp/1

Conflicts of Interest The authors declare no competing interests.

Funding This research was supported by JSPS KAKENHI Grant Number JP22H03911 and JST CREST Grant Number JPMJCR19A1.

Surowiecki, J. The wisdom of crowds. Anchor (2004).
Lorenz, J., Rauhut, H., Schweitzer, F. & Helbing, D. How social influence can undermine the wisdom of crowd effect. Proc. Natl. Acad. Sci. 108, 9020–9025 (2011). (doi:10.1073/pnas.1008636108)
Hertwig, R. Tapping into the wisdom of the crowd–with confidence. Science 336, 303–304 (2012). (doi:10.1126/science.1221403)
Jayles, B., Kim, H., Escobedo, R., Cezera, S., Blanchet, A., Kameda, T., et al. How social information can improve estimation accuracy in human groups. Proc. Natl. Acad. Sci. 114, 12620–12625 (2017). (doi:10.1098/rsif.2020.0496)
Fujisaki, I., Honda, H. & Ueda, K. Diversity of inference strategies can enhance the ‘wisdom-of-crowds’ effect. Humanit. Soc. Sci. Commun. 4, 107 (2018). (doi:10.1057/s41599-018-0161-1)
Prelec, D., Seung, H. S. & McCoy, J. A solution to the single-question crowd wisdom problem. Nature 541, 532–535 (2017). (doi:10.1038/nature21054)
Moussaïd, M., Herzog, S. M., Kämmer, J. E. & Hertwig, R. (2017). Reach and speed of judgment propagation in the laboratory. Proc. Natl. Acad. Sci. 114, 4117–4122. (doi:10.1073/pnas.1611998114)
Jayles, B., Escobedo, R., Cezera, S., Blanchet, A., Kameda, T., Sire, C., et al. The impact of incorrect social information on collective wisdom in human groups: The impact of incorrect social information on collective wisdom in human groups. J. R. Soc. Interface 17, 170 (2020). (doi:10.1098/rsif.2020.0496)
Herzog, S. M. & Hertwig, R. The wisdom of ignorant crowds: Predicting sport outcomes by mere recognition. Judgm. Decis. Mak. 6, 58–72 (2011).
Becker, J., Brackbill, D. & Centola, D. Network dynamics of social influence in the wisdom of crowds. Proc Natl Acad Sci. 114, E5070-E5076 (2017). (doi:10.1073/pnas.1615978114)
Tump, A. N., Pleskac, T. J. & Kurvers, R. H. J. M. Wise or mad crowds? The cognitive mechanisms underlying information cascades. Sci. Adv. 6, eabb0266 (2020). (doi:10.1126/sciadv.abb0266)
Analytis, P. P., Barkoczi, D. & Herzog, S. M. You’re special, but it doesn’t matter if you’re a greenhorn: Social recommender strategies for mere mortals. Proc. 37th Annu. Conf. Cogn. Sci. Soc. 1799–1804 (2015).
Analytis, P. P., Barkoczi, D. & Herzog, S. M. Social learning strategies for matters of taste. Nat. Hum. Behav. 2, 415–424 (2018). (doi:10.1038/s41562-018-0343-2)
Vul, E. & Pashler, H. Measuring the crowd within. Psychol. Sci. 19, 645–647 (2008). (doi:10.1111/j.1467-9280.2008.02136.x)
Herzog, S. M. & Hertwig, R. The wisdom of many in one mind. Psychol. Sci. 20, 231–237 (2009). (doi:10.1111/j.1467-9280.2009.02271.x)
Herzog, S. M. & Hertwig, R. Harnessing the wisdom of the inner crowd. Trends Cogn. Sci. 18, 504–506 (2014). (doi:10.1016/j.tics.2014.06.009)
Van Dolder, D. & Van Den Assem, M. J. The wisdom of the inner crowd in three large natural experiments. Nat. Hum. Behav. 2, 21–26 (2018). (doi:10.1038/s41562-017-0247-6)
Müller-trede, J. Repeated judgment sampling: Boundaries. Judgm. Decis. Mak. 6, 283–294 (2011).
Herzog, S. M. & Hertwig, R. Think twice and then: combining or choosing in dialectical bootstrapping? J. Exp. Psychol. Learn. Mem. Cogn. 40, 218–232 (2014). (doi:10.1037/a0034054)
Hourihan, K. L. & Benjamin, A. S. Smaller is better (when sampling from the crowd within): Low memory-span individuals benefit more from multiple opportunities for estimation. J. Exp. Psychol. Learn. Mem. Cogn. 36, 1068–1074 (2010). (doi:10.1037/a0019694)
Gaertig, C. & Simmons, J. P. The Psychology of second guesses: Implications for the wisdom of the inner crowd. Manag. Sci. 67, 5921–5942 (2021). (doi: 10.1287/mnsc.2020.3781)
Rauhut, H. & Lorenz, J. The wisdom of crowds in one mind: How individuals can simulate the knowledge of diverse societies to reach better decisions. J. Math. Psychol. 55, 191–197 (2011). (doi:10.1016/j.jmp.2010.10.002)
Steegen, S., Dewitte, L., Tuerlinckx, F. & Vanpaemel, W. Measuring the crowd within again: a pre-registered replication study. Front. Psychol. 5, 786 (2014). (doi:10.3389/fpsyg.2014.00786)
Krueger, J. I. & Chen, L. J. The first cut is the deepest: effects of social projection and dialectical bootstrapping on judgmental accuracy. Soc. Cogn. 32, 315–336 (2014). (doi:10.1521/soco.2014.32.4.315)
Van der Leer, L. & McKay, R. The optimist within? Selective sampling and self-deception. Conscious. Cogn. 50, 23–29 (2016). (doi:10.1016/j.concog.2016.07.005)
Barneron, M., Allalouf, A. & Yaniv, I. Rate it again: Using the wisdom of many to improve performance evaluations. J. Behav. Decis. Mak. 32, 485–492 (2019). (doi:10.1002/bdm.2127)
Fiechter, J. L. & Kornell, N. How the wisdom of crowds, and of the crowd within, are affected by expertise. Cogn. Res. Princ. Implic. 6, 5 (2021). (doi:10.1186/s41235-021-00273-6)
Lorenz-Spreen, P., Geers, M., Pachur, T., Hertwig, R., Lewandowsky, S. & Herzog, S.M. Boosting people’s ability to detect microtargeted advertising. Sci. Rep. 11, 15541 (2021). (doi:10.1038/s41598-021-94796-z)
Grüne-Yanoff, T. & Hertwig, R. Nudge versus boost: How coherent are policy and theory? Minds. Mach. 26, 149–183 (2016). (doi:10.1007/s11023-015-9367-9)
Hertwig, R. & Grüne-Yanoff, T. Nudging and boosting: steering or empowering good decisions. Perspect. Psychol. Sci. 12, 973–986 (2017). (doi:10.1177/1745691617702496)
Epley, N., Keysar, B., Van Boven, L. & Gilovich, T. Perspective taking as egocentric anchoring and adjustment. J. Pers. Soc. Psychol. 87, 327–339 (2004). (doi:10.1037/0022-3514.87.3.327)
Adida, C. L., Lo, A. & Platas, M. R. Perspective taking can promote short-term inclusionary behavior toward Syrian refugees. Proc. Natl. Acad. Sci. 115, 9521–9526 (2018). (doi:10.1073/pnas.1804002115)
Galinsky, A. D. & Moskowitz, G. B. Perspective-taking: Decreasing stereotype expression, stereotype accessibility, and in-group favoritism. J. Pers. Soc. Psychol. 78, 708–724 (2000). (doi:10.1037//0022-3514.78.4.708)
Fujisaki, I., Honda, H. & Ueda, K. A simple cognitive method to improve the prediction of matters of taste by exploiting the within-person wisdom-of-crowd effect. Sci. Rep. 12, 12413 (2022). (doi:10.1038/s41598-022-16584-7)
Yaniv, I. & Choshen-hillel, S. When guessing what another person would say is better than giving your own opinion: Using perspective-taking to improve advice-taking. J. Exp. Soc. Psychol. 48, 1022–1028 (2012). (doi:10.1111/j.1467-9280.2006.01704.x)
Krueger, J. & Mueller, R. A. Unskilled, unaware, or both? The better-than-average heuristic and statistical regression predict errors in estimates of own performance. J. Pers. Soc. Psychol. 82, 180–188 (2002). (doi:10.1037/0022-3514.82.2.180)
Moore, D. A. & Small, D. A. Error and bias in comparative judgment: On being both better and worse than we think we are. J. Pers. Soc. Psychol. 92, 972–989 (2007). (doi:10.1016/j.jesp.2012.03.016)
Galesic, M., Olsson, H. & Rieskamp, J. Social sampling explains apparent biases in judgments of social environments. Psychol. Sci. 23, 1515–1523 (2012). (doi:10.1177/0956797612445313)
Svenson, O. Are we all less risky and more skillful than our fellow drivers? Acta Psychol. 47, 143–148. (1981).
Hsee, C. K. & Weber, E. U. A fundamental prediction error: Self-others discrepancies in risk preference. J. Exp. Psychol. Gen. 126, 45–53 (1997). (doi:10.1016/0001-6918(81)90005-6)
Bates, D., Mächler, M., Bolker, B. M. & Walker, S. C. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67, 1–48 (2015). (doi:10.18637/jss.v067.i01)
Koriat, A., Lichtenstein, S., Fischhoff, B. & Combs, B. Reasons for confidence. J. Exp. Psychol. Learn. Mem. Cogn. 6, 107–118 (1980). (doi:10.1037/0278-7393.6.2.107)
Soll, J. B. & Klayman, J. Overconfidence in interval estimates. J. Exp. Psychol. Learn. Mem. Cogn. 30, 299–314 (2004). (doi:10.1037/0278-7393.6.2.107)
Tsai, C.I., Klayman, J. & Hastie, R. Effects of amount of information on judgment accuracy and confidence. Organ. Behav. Hum. Decis. Process. 107, 97–105 (2008). (doi:10.1016/j.obhdp.2008.01.005)
Walters, D. J., Fernbach, P. M., Fox, C. R. & Sloman, S. A. Known unknowns: A critical determinant of confidence and calibration. Manage. Sci. 63, 4298–4307 (2017). (doi:10.1287/mnsc.2016.2580)
Strack, F. & Mussweiler, T. Explaining the enigmatic anchoring effect: Mechanisms of selective accessibility. J. Pers. Soc. Psychol. 73, 437–446 (1997).
Rader, C. A., Soll, J. B. & Larrick, R. P. Pushing away from representative advice: Advice taking, anchoring, and adjustment. Organ. Behav. Hum. Decis. Process 130, 26–43 (2015). (doi:10.1016/j.obhdp.2015.05.004)
Epley, N. & Gilovich, T. The anchoring-and-adjustment heuristic: Why the adjustments are insufficient. Psychol. Sci. 17, 311–318 (2006). (doi:10.1037/0022-3514.82.2.180)
Hirt, E. R. & Markman, K. D. Multiple explanation: A consider-an-alternative strategy for debiasing judgments. J. Pers. Soc. Psychol. 69, 1069–1086 (1995). (doi:10.1037/0022-3514.69.6.1069)
Mussweiler, T., Strack, F. & Pfeiffer, T. Overcoming the inevitable anchoring effect: considering the opposite compensates for selective accessibility. Pers. Soc. Psychol. B. 26, 1142–1150 (2000). (doi:10.1177/01461672002611010)
Herzog, S. M. & von Helversen, B. Strategy selection versus strategy blending: A predictive perspective on single- and multi-strategy accounts in multiple-cue estimation. J. Behav. Decis. Mak. 31, 233–249 (2016). (doi:10.1002/bdm.1958)
CIA. The world Factbook – Central intelligence agency. (2020)
https://data.worldbank.org/indicator/AG.LND.AGRI.ZS
https://worldpopulationreview.com/country-rankings/total-fertility-rate

No competing interests reported.

Estwissup12.docx

Download PDF

Journal Publication

published 03 Mar, 2023

Read the published version in Scientific Reports →

Editorial decision: Major revision
28 Nov, 2022
Reviews received at journal
13 Oct, 2022
Reviewers agreed at journal
27 Sep, 2022
Reviewers agreed at journal
14 Sep, 2022
Reviewers invited by journal
07 Sep, 2022
Editor assigned by journal
07 Sep, 2022
Editor invited by journal
07 Sep, 2022
Submission checks completed at journal
07 Sep, 2022
First submitted to journal
13 Aug, 2022

You are reading this latest preprint version

On an effective and efficient method for exploiting the wisdom of the inner crowd

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Results

Efficacy of our method

Comparison of the methods

Analysis of cognitive load

When the proposed method worked better (or worse)

Overconfidence in the final estimate

When the number of estimates increased

Discussion

Methods

Experiment 1: Participants and Procedure

Experiment 2: Participants and Procedure

Experiment 3: Participants and Procedure

Mixed-effect analysis

Declarations

References

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1