The illusion of competence reflected in grade predictions and utility of learning strategies

doi:10.21203/rs.3.rs-3091508/v1

The purpose of this study is to test the illusion of competence by analyzing the effects of students’ expected examination scores and the utility of the learning strategies that students use on their actual exam scores. Expected and actual midterm and final exam scores for 105 students in a course for secondary preservice teachers showed negative correlations that confirmed the bidirectional nature of the Dunning-Kruger effect, which is the phenomenon in which people’s estimates of their performance on tasks tend to be inversely proportional to their actual performance, i.e., people who estimate that their performance will be the best perform worse, and vice versa. Students’ knowledge of their actual midterm exam scores had a significant influence on their expected final exam results, in that they made more objective and reliable judgments of their competence for their final exam scores. The utility of the learning strategies that the students used correlated positively with expected scores, but negatively with actual scores. Both the effect of expected score and the effect of learning strategy on actual score were negative, but learning strategy utility had a greater impact than did expected score. The findings of this study confirm previous study results of an illusion of student competence between individuals’ expected and actual task performance, and add that studying using learning strategies of low utility has negative impacts on actual performance.

Educational Psychology

grade prediction

amount of use of learning strategies

Dunning-Kruger effect

illusion of competence.

Students unwittingly experience the illusion of competence in predicting their grades and judging the amount of their use of effective learning strategies (Kang, 2017; Kruger & Dunning, 1999; Seok & Kang, 2019a; 2019b). It is surprising that even college students who have been educated for at least 12 to 15 years show severe self-centered illusion in assessing their abilities. It can be asserted that the ability to determine the level of one’s performance, and to invest an appropriate type of effort and an amount of resources accordingly, is a crucial competence in various domains. High- and low-performing students are known to differ markedly in these metacognitive skills and self-monitoring skills (Everson & Tobias, 1998).

The tendency to overestimate one’s future exam performance due to excessively high expectations is typical of the illusion of competence (Dunning, Johnson, Erlinger, & Kruger, 2003; Koriat & Bjork, 2005; Serra & DeMarree, 2016). For instance, making judgments about one’s knowledge level while reading textbooks commonly results in foresight bias, i.e., when a student is reading a textbook, he or she is looking at a large amount of aggregated knowledge, frequently including correct answers to exams, and this can lead to excessively high expectations of future performance (Koriat & Bjork, 2006). This is because we fail to consider the essential difference between standard study conditions and standard exam conditions, resulting in confusing the learner and test-taker perspectives.

This cognitive inaccuracy in beliefs about ability is universal and can create serious problems. An important characteristic of the Dunning-Kruger effect (1999), the phenomenon of metacognitive illusion about one’s skills, is that low-performing students tend to overestimate their performance and competent students tend to underestimate their performance (Muller, Sirianni, & Addante, 2020; Zell et al., 2020). Underestimating one’s own competence can have deleterious effects on societies by depriving them of contributions from their most competent citizens and opening up opportunities for others who are less competent.

It is possible to measure and confirm students’ metacognitive illusions about their competence by calculating the correlations between their expected and actual exam scores. First, students are typically divided into quartiles according to actual exam scores, and then the lowest and the highest quartiles are classified as the scores of low- and high-performing students, respectively. Then, negative correlations between expected and actual scores will indicate the tendencies of over- and underestimating exam performance by low- and high-performing students, respectively, i.e., low-performing students will estimate that they performed much better than they actually performed, and high-performing students will expect that they performed much worse. Notably, in attempting to judge one’s competence by estimating grades, it is more accurate to estimate expected scores rather than percentiles because percentile estimates can be the result of over- or underestimating other students’ ability rather than accurately evaluating one’s own ability (Kruger & Dunning, 1999; Muller, Sirianni, & Addante, 2020).

It is also possible to measure and confirm students’ metacognitive illusions about their competence according to associations between their scores and the utility[1] and number of learning strategies that they use when studying (Kang, 2017; Seok & Kang, 2019a; 2019b). For the present research, we studied both the differences between students’ expected and actual midterm and final exam scores, and the correlations between those scores and the students’ reported learning strategies.

[1] The utility of learning strategies (Dunlosky et al., 2013) is classified as follows: “high utility” strategies (practice testing, distributed practice); “moderate utility” strategies (elaborative interrogation, self-explanation, interleaved practice); and “low utility” strategies (summarization, highlighting/underlining, keyword mnemonic, imagery for text, rereading).

1. The illusion of competence reflected in grade expectations

The Dunning-Kruger effect derives from in-depth research by social psychologists Justin Kruger and David Dunning (1999) on how people perceive their abilities in various situations. Students in their study who showed the lowest scores on tests on humor judgment, logical reasoning, grammar skills, etc., had overestimated the scores that they anticipated when they were asked before the test, and the students who had actually scored the highest had anticipated much worse performance before the test (Kruger & Dunning, 1999). Indeed, many subsequent researchers have confirmed this discrepancy between perceived abilities and actual performance through empirical investigations of microeconomics college examinations (Ryvkin, Krajč, & Ortmann, 2012), logical reasoning (Schlösser, Dunning, Johnson, & Kruger, 2013), cognitive reflection (Pennycook, Ross, Koehler, & Fugelsang, 2017), size judgments (Sanchez, 2016), finance (Atir, Rosenzweig, & Dunning, 2015), computer programming (Critcher & Dunning, 2009), driving (Svenson, 1981) and flight (Pabel, Robertson, & Harrison, 2012) situations, and professors’ self-evaluations of their teaching abilities (Cross, 1977).

Extant literature on whether this universal phenomenon occurs even among students has produced consistent results, although studies are few. Sharma (2002) compared students’ actual and expected scores, and found very low accuracy among the students. Specifically, it was shown that students who had received grades of A and B had expected low scores, students who had scored Ds had expected high scores, and students who had received Cs had predicted their grades fairly accurately. Furthermore, students who had scored an A underestimated their expected grades by more than did students who had received a B, and students who had received a D showed significantly more overestimation than any of the other grade recipients.

Seok and Kang (2019b) studied 95 students in a course for secondary preservice teachers. The students were asked six weeks into the course to anticipate their eight-week midterm scores, and the students’ expected and actual midterm scores were compared. It was found that 92 students’ estimates exceeded their actual scores (Seok & Kang, 2019b). In that study, the professor of the course had continuously notified the students during lectures of important content that could be on the midterm exam during the lecture, and therefore the authors expected some overestimation of students’ expected scores (Seok & Kang, 2019b). In a scatter plot and graph of the correlation coefficients (Figure 1), the study found no correlations between students’ expected and actual scores (r = .062). Indeed, not only was there no correlation between the two scores, the ranges of expected and actual scores exhibited significant gaps, as well, of 50 to 98 points and 7.2 to 87.6 points, respectively. These ranges confirmed that most of the students in that study had overestimated their test capabilities (Seok & Kang, 2019b). In addition, the distribution of the actual scores was significantly different from the distribution of the expected scores (actual score: 80.4, expected score: 48). In particular, four students who received the lowest grades had overestimated their expected scores by 60 points, and two of the three students who received the highest grades had most underestimated their expected scores (Seok & Kang, 2019b). This bidirectional nature is an important characteristic of the Dunning-Kruger effect and was clearly present here.

Most students in a teaching preserving course are learning content unrelated to their areas of expertise from professors with whom they are unfamiliar. Therefore, it is expected that their estimates of their midterm performance after only a few weeks of experience with the class and professor will differ from their estimates of their final exam performance after they have already experienced the midterm exam. It is also possible that estimates will be more reliable if they are given at regular intervals throughout a semester rather than just once ahead of the midterm and final exams. To test this possibility, this study compared students’ anticipated and actual test performance by taking their estimates six times throughout the semester: 3, 5, and 7 weeks before the eight-week midterm exam and 10, 12, and 14 weeks before the final exam. We considered that this would likely be more reliable than having students predict only the midterm score one time after the fifth week ahead of the eight-week midterm (Seok & Kang, 2019b). We also wanted to check whether estimates later in the semester would be more accurate or whether the Dunning-Kruger effect would still manifest between the expected and actual midterm and final exam scores.

2. The illusion of competence reflected in the use of learning strategies

As discussed above, six of the students in Seok and Kang’s (2019b) study of differences between preservice teachers’ actual and expected midterm exam scores showed large discrepancies between their anticipated and actual scores, clearly demonstrating the bidirectional characteristic of the Dunning-Kruger effect. Seok and Kang (2019b) also investigated correlations between the students’ scores and their learning strategies. Specifically, the authors gave students a list of 10 learning strategies and asked them to select the three that they used most frequently, and looked for correlations between the students’ expected and actual scores and the learning strategies that they reported using (Seok & Kang, 2019b). Table 1 shows that the students with both the best and the worst scores most often used low-utility strategies. For instance, the student with the lowest actual score had most greatly overestimated his test performance and had reported the least effective learning strategy as being the most effective (Seok & Kang, 2019b). In contrast, the students who had earned the highest actual scores had most underestimated their expected performance and deemed strategies of high and moderate utility to be the most effective (Seok & Kang, 2019b).

Table 1. Learning strategies and their effectiveness among overestimators and underestimators.

Score difference

Frequency

(amount of use - effectiveness)

Effectiveness

(effectiveness – amount of use)

Over estimator

(n = 4)

+70.0

+68.4

+63.4

+60.2

Rereading (9-10)

Rereading (10-9)

Summarization (8-6)

Summarization (10-9)

Highlighting/underlining (8-9)

Highlighting/underlining (10-8)

Highlighting/underlining (9-5)

Highlighting/underlining (9-8)

Mnemonics (10-7)

Rereading (9-10)

Self-explanation (10-7)

Summarization (9-10)

Rereading (10-9)

Summarization (8-9)

Practice testing (9-8)

Highlighting/underlining (8-9)

Under

estimator

(n = 2)

−25.8

−23.4

Highlighting/underlining (10-7) Summarization (9-8)

Highlighting/underlining (10-5) Summarization (8-8)

Self-explanation (10-7) Practice testing (10-6)

Elaborative interrogation (10-7) Self-explanation (10-7)

With the present study, we aimed to expand on findings from previous literature, particularly Seok and Kang’s (2019b) results for preservice teachers, to attempt to verify the correlations between great expected versus actual score differences (i.e., more inaccurate judgments of one’s competence) and the utility of study participants’ learning strategies. The study attempts to answer the following four research questions:

RQ1. What is the correlation between a student’s expected and actual scores?

RQ2. How does the experience of checking the actual score of the midterm exam affect estimating final exam score?

RQ3. What is the correlation between the learning strategies used and the difference between actual and expected scores?

RQ4. What are the effects of the expected-actual score difference and the learning strategies used on the actual score?

Students who used low-utility learning strategies likely incorrectly estimated their learning outcomes because they incorrectly believed that their learning strategies had been effective (Seok & Kang, 2019a; 2019b). For this study, we followed the technique described earlier of dividing students into quartiles based on their scores and labeling the students in the lowest and highest quartiles as the low-performing and high-performing students, respectively, to confirm whether low-performing students expect better performance than they achieve and high-performing students expect lower scores than they achieve (Kruger & Dunning, 1999).

1. Participants

The subjects in this study were 105 students in a course for secondary preservice teachers at a university in a metropolitan city in Korea. Table 2 presents the participants’ demographic data: gender, major, and grade. By school level, 44 students were undergraduates and 61 were graduate students, and the same instructor taught both sets of classes. The exam, task, and grade evaluation criteria were also the same.

Table 2. Participant information.
Gender				Major
Male		Female		Liberal arts		Natural sciences		Arts and physical education
23		82		41		40		24
Undergraduate				Graduate school of education
1^st grade	2^ndgrade	3^rd grade	4^th grade	1^st grade	2^ndgrade		3^rd grade		4^th grade
1	10	18	8	23	40		1		4

2. Survey data

A learning management system (LMS) was used to present all of the survey materials to the student participants. Specifically, we asked students the following two questions at regular intervals throughout the school year: what score they expected on their midterm (three times) and final (three times) exams and what learning strategies they used in studying. For anticipated scores, we asked students 3, 5, and 7 weeks into the semester what scores they expected out of 100 points on their 8-week midterm exam and 10, 12, and 14 weeks into the semester what scores they expected on their 15-week final exam. For the students’ learning strategies, we presented 10 strategies listed in the order given by Dunlosky et al. (2013), along with a brief description of each strategy, and asked the students to select the three strategies that they used most in studying.

3. Survey procedure

On the first day of the semester, the students were told that they would be asked to complete several simple surveys during the semester and that the surveys would be related to their coursework. We also announced all survey results on the LMS after all of the final grades were calculated and posted. The students completed the survey a total of six times, each time entering the score that they expected to achieve. The students were also given the learning strategies questionnaire and asked to enter the assigned codes for the three strategies that they used most frequently at the 11^th week of the semester.

4. Data analysis

In this study, students were asked to estimate their scores three times before their midterm exams and three times before their final exams, and the difference score was calculated as the difference between each expected score and the actual score by converting each difference between expected and actual score into an absolute value. For example, if a student predicted midterm scores of 50, 60, and 65 and actually scored 40, that student’s difference score for the midterm would be 55 (10 + 20 + 25), and if instead the student’s actual midterm score had been 55, the difference score would be 20 (5 + 5 + 10).

We used two criteria to determine the scores for learning strategies used. First, the utility of 10 learning strategies had already been proven (Dunlosky et al., 2013; Seok & Kang, 2019a; 2019b), and thus it was reasonable to give differentiated scores to the strategies according to their utility: 1 point for high utility; 3 points for moderate utility; and 5 points for low utility. More points were given for low-utility strategies because those were the strategies that the students said that they used most often, and we were reflecting that reality.

Second, because we had prioritized the respondents’ three most-used strategies, it was also reasonable to vary the points according to priority as follows: 3 points for the strategy selected as the first priority; 2 points for the strategy selected as the second priority; and 1 point for the strategy selected as the third priority. Therefore, if all three of a student’s strategies were low utility, their strategy use score was 30 ([3 points × 5 points] + [2 points × 5 points] + [1 point × 5 points]), and it was 14 ([3 points × 1 points] + [2 points × 3 points) + [1 point × 5 points]) if a student used, in order, strategies of high, moderate, and low utility. In general, a higher use score indicates the use of more low-utility strategies.

1. Descriptive statistics

Table 3 presents the averages and standard deviations for each of the six expected scores (three midterm and three final), the total difference scores, the averages and standard deviations of the actual midterm and final exam scores, and the strategy use scores. The difference score for expected final exam score indicated markedly lower expected final exam scores than expected midterm exam scores.

Table 4 shows the descriptive statistics for the learning strategy use scores and the rankings for the utility of the learning strategy according to strategy use priority. Following the above-described scoring pattern of 5 points for low-utility strategies, 3 points for strategies of moderate utility, and 1 point for high-utility strategies, we calculated an average use score of 4.62 for the 105 learning strategies that respondents selected as the first priority; this average was near the 5 points that we assigned to low-utility strategies. The average score for the 105 strategies that the respondent selected as the third priority was 3.88, which is relatively close to our score of 3 points for strategies of moderate utility.

Expected and actual midterm and final exam scores were examined for above-average effects (i.e., above-average effect, better-than-average effect, comparative bias, positive illusions, self-enhancement effect) in differences (Davidai & Deri, 2019; Heck, Simons, & Chabris, 2018; Moore, 2007; Williams & Gilovich, 2008; Zell, Strickhouser, Sedikides, & Alicke, 2020) to determine whether the average expected scores were higher or lower than the actual scores. Table 5 shows that for the midterm exam, more than 99% of students expected higher scores than they achieved and the ratio of over- to underestimating performance was 66:34 for the final exam scores. The reason for this may be that the students’ knowledge of their midterm scores affected their predictions for their final scores.

2. Correlations between the difference score and the actual score

We calculated significant correlations between the expected and actual midterm scores and the expected and actual final exam scores as -0.685 and -0.609, respectively (p < 0.01). Confirming the Dunning-Kruger effect, students who expected high scores generally scored poorly, and students who predicted low scores earned relatively high scores. Table 6 presents actual midterm and final exam score ranges according to score quartiles. As the table shows, the midterm exam score range for the high-performing group was larger than the ranges for the other quartiles, and the ranges for the high- and low-performing quartiles for the final exam were markedly different from the ranges for the other two quartiles.

Table 6. Ranges of actual midterm and final exam scores by quartile.
Midterm exam			Quartile	Final exam
Max.–min. score	Range	# of cases	Quartile	Max.–min. score	Range	# of cases
73–46	27	26	High quartile	89.0–64.5	24.5	26
46–34	12	26	3^rdquartile	64.5–49.5	15.0	26
33–21	12	26	2^ndquartile	49.0–33.5	15.5	26
20–7	13	27	Low quartile	33.5–8.0	25.5	27

Figure 2 presents the trends for differences between expected and actual midterm and final exam scores by quartile. It was found that the lowest-performing group showed the greatest overestimations of expected test score. The lower-achieving students also greatly overestimated their predicted final exam scores, and the higher-performing students anticipated lower final exam scores. In other words, on both the midterm and final exam scores, the students in this study manifested the Dunning-Kruger effect.

After the actual midterm and final exam scores were divided into quartiles, the differences between each expected score and the actual score were averaged, and Figures 3 to 5 present these change trends. Figure 3 shows that the maximum midterm exam difference score was 56.78 points (low quartile, 1^st difference score), and the minimum was 21.77 points (high quartile, 3^rd difference score). Figure 4 reveals that the maximum final exam difference score was 22.89 points (low quartile, 5^th difference score), and the minimum was 11.79 points (3^rd quartile, 4^th difference score). Figure 5 indicates that the final exam difference score was three times lower than that for the midterm exam (35.01 vs. 11.10 points).

Figure 6 shows the distributions of differences between expected and actual midterm and final exam scores. The figure demonstrates that the high-performing quintile exhibited relatively high difference scores for the final exam, i.e., many students in this quartile underestimated their expected scores.

3. Difference in estimated scores before and after the midterm exam

1) Correlation and mean difference test

Figure 5 shows that the students in this study generally predicted significantly higher midterm exam scores than final exam scores. Table 7 also reveals that the correlations were very high between the three expected midterm exam scores and between the three expected final exam scores.

Table 7. Correlations between expected and actual midterm and final exam scores.
			Final actual score	Expected score
				Midterm exam			Final exam
				1^st trial	2^nd trial	3^rd trial	4^th trial	5^th trial	6^thtrial
Actual score		Midterm exam	.831^**	.181	.224^*	.251^**	.416^**	.350^**	.304^**
Actual score		Final exam		.118	.222^*	.245^*	.425^**	.384^**	.376^**
Difference score	Midterm exam	1^st trial			.777^**	.679^**	.466^**	.510^**	.467^**
		2^nd trial				.783^**	.521^**	.567^**	.534^**
		3^rd trial					.554^**	.584^**	.503^**
	Final exam	4^th trial						.924^**	.835^**
	Final exam	5^th trial							.895^**
^p < .05, ^*p < .01.

Larger difference scores indicate less-accurate estimates of actual performance, so that the difference score and the actual score will exhibit a negative correlation. Table 8 shows that this negative correlation was higher for the midterm than for the final exams.

Table 8. Correlations between midterm and final exam difference and actual scores.
			Final actual score	Difference score
				Midterm exam			Final exam
				1^st trial	2^nd trial	3^rd trial	4^th trial	5^th trial	6^thtrial
Actual score		Midterm exam	.831^**	-.689^**	-.679^**	-.600^**	-.190	-.240^*	-.194^*
Actual score		Final exam		-.595^**	-.538^**	-.470^**	-.212^*	-.229^*	-.127
Difference score	Midterm exam	1^st trial			.887^**	.815^**	.240^*	.270^**	.185
		2^nd trial				.863^**	.207^*	.259^**	.273^**
		3^rd trial					.287^**	.365^**	.339^**
	Final exam	4^th trial						.887^**	.755^**
	Final exam	5^th trial							.825^**
^*p < .01, ^p < .05.

When we calculated the correlations between the actual scores and each of the three expected scores and divided those correlations into quartiles, high positive correlations were identified between the three expected scores in each quartile. This finding indicated consistency between score estimates, irrespective of the student’s actual achievement. In fact, both the lowest- and highest-performing students maintained high positive correlations between their three estimated scores.

Six separate mean differences were calculated for this study: the difference between each of a student’s three estimated midterm scores and three estimated final exam scores and that student’s actual midterm and final exam scores. From these calculations, it was shown that the difference between the midterm three-score difference scores and the final exam three-score difference scores was not statistically significant. Overall, however, the students showed lower expected scores for their final exams than they had predicted for their midterm exams, which may be reasonably ascribed to their tempered expectations after they had seen their actual midterm scores. Table 9 shows the results for repeated-measures analysis of variance of each of the six difference scores (F_(5,100) = 64.54, p < .001) and Scheffe’s post hoc test results. The table indicates a significant difference between the midterm and final exam score difference scores.

Table 9. Multiple mean differences by number of difference scores.
Difference score	# of cases	Mean	SD	F	p	Post hoc
Predicted 1^st – midterm actual (a)	105	41.46	19.25	64.54	.000^***	a, b, c > d, e, f
Predicted 2^nd - midterm actual (b)		41.92	18.67
Predicted 3^rd - midterm actual (c)		36.96	19.60
Predicted 4^th - final actual (d)		15.83	12.80
Predicted 5^th - final actual (e)		17.26	12.96
Predicted 6^th - final actual (f)		17.46	13.50
^***p < .001

2) Expected final exam score by midterm difference score quartile

When we compared the midterm difference scores divided by quartile with the final exam difference scores divided by quartile, it was shown that students whose midterm difference scores were in the lowest quartile predicted their final exam performance relatively accurately, and the students with difference scores in the highest quartile predicted their final exam scores the most inaccurately. For instance, there were 26 students in the lowest-performing quartile for the midterm exam difference score, and of these, 8 had final exam scores in the lowest quartile; 1 of those students had estimated a higher than actual final exam score, and the other 7 underestimated their final exam scores.

In contrast, students with final exam scores in the highest quartile who had the largest difference between the midterm difference score and the actual score had still far overestimated their predicted-actual final exam scores. Specifically, among the 27 students whose midterm score difference was in the high quartile, 12 had final scores that were still in the high quartile, but all 27 had anticipated higher scores than their actual final scores. Figure 7 shows the students’ final exam difference scores according to their midterm exam difference score quartiles. Overall, the lower was the quartile of the difference score, the more accurate was the score prediction.

Table 10 presents detailed support for the Dunning-Kruger effect in this study’s results, as evident in the differences between the students’ actual test performance (competence) and what they believed their competence to be (their illusions). The 26 students with midterm exam difference scores in the lowest quartile showed the smallest differences between their expected and actual midterm exam scores, and thus can be considered the most competent at assessing their own skills. Table 10 shows that 23 students in that quartile estimated lower midterm exam scores than they earned, and only 3 who had overestimated their scores had underestimated their predicted final exam scores. In contrast, the 27 students with midterm exam difference scores in the highest quartile, i.e., who showed the largest differences between expected and actual midterm scores, can be considered incompetent at judging their competence; only 2 underestimated their actual midterm exam scores, and 25 still overestimated their actual midterm exam scores. The findings shown in the table clearly demonstrate the bidirectional nature of the Dunning-Kruger effect

Figure 8 presents the difference between the midterm and the final exam difference score for each student in each quartile. Overall, the slope of each straight line is downward to the right, which indicates that the difference score for the final exam is smaller than the difference score for the midterm exam. It is concluded that this finding confirms our proposal that students more accurately predicted their final exam scores because they had evidence of their midterm exam scores. Table 11 shows that students with low difference scores by quartile tended to greatly overestimate their predicted scores, and students in the highest quartile for difference scores continued to underestimate their predicted scores.

Figure 9 shows the sum of the three expected scores for the midterm exam and the actual score difference, i.e., the student with the lowest midterm difference score to the student with the largest midterm difference score, and then displays the final exam difference score of each student. As shown in the figure, where all midterm difference scores were positive values greater than 0, all students overestimated their competencies, but the final exam difference scores were divided into positive and negative values. In other words, the larger was the midterm difference score, the more the final exam difference score tended to be overestimated, but the lower was the midterm difference score, the more it was underestimated, indicating that most cases were negative values less than 0. Through the comparison of the difference scores between the midterm and final exams, it can be seen that the performance prediction for the midterm exam was highly overestimated, and the bidirectional nature of the Dunning-Kruger effect was clearly confirmed in the final exam performance estimation. Moreover, the foresight bias phenomenon that occurred in the midterm grade estimation process was significantly resolved in the final grade estimation through an educational prescription called grade confirmation, but the bidirectional relationship of the Dunning-Kruger effect intensified.

3) The change in accuracy of prediction: difference scores

The bidirectional phenomenon of the Dunning-Kruger effect can be confirmed by differences changes between the actual midterm and final exam scores and their difference scores. Figure 10 shows the frequency of midterm difference scores by grade according to the actual midterm score quartile. The difference score range was 0 to 240, and we separated the range into 12 grades of 20 points each. Students with high actual midterm exam scores were distributed at the low end of the difference score spectrum, and students with low actual scores were distributed at the high end of the spectrum.

The distribution presented in Figure 10 changed significantly following the final exam. Figure 11 shows the frequencies of the final exam difference scores by grade and actual final exam score quartile. Overall, the students with low actual scores and high difference scores shifted to low difference scores for their final exam grades, regardless of the quartile of their actual final exam scores.

To determine how the midterm and final exam actual scores and difference scores changed, we calculated each frequency by converting the 12^th difference score grade to the 4^th grade, as shown in Table 11. In the lowest quartile for the difference scores, only the top 16 of the 105 scores (15%) were midterm exam scores; scores for 73 students (70%) were final exam scores, regardless of the score. In addition, all 17 students in the highest quartile (largest difference between expected and actual midterm score) and 19 students of the 29 (66%) in the 3^rd quartile had among the lowest actual scores. These findings verified that students’ knowledge of their actual midterm scores had made their estimates of their final exam scores more reliable.

4. Learning strategy use, actual scores, and difference scores

Table 12 presents the results of comparing the learning strategies that students used with their actual and difference scores for the midterm and final exams. The correlation between the total score (midterm exam score + final exam score) and the learning strategies used is that students with high achievement used relatively less-useful strategies than did students with lower scores. Furthermore, this trend held for both midterm and final exams and the strategies used.

Table 12. Relationships between learning strategies used, difference scores, and actual scores.
Examination	Comparison	Correlation
Midterm + final	Actual score vs. strategies used	−.386^**
Midterm exam	Actual score vs. strategies used	−.368^**
Final exam	Actual score vs. strategies used	−.352^**
Midterm exam	Difference score vs. strategies used	.251^**
Final exam	Difference score vs. strategies used	.238^*
^p < .05, ^*p < .01.

In contrast with the actual score relationship, difference score had positive correlations with learning strategies used for both midterm and final exam scores. In other words, when the difference score was high, students tended to use more learning strategies, but students with higher difference scores showed less-accurate predictions of their performance. Therefore, use of more learning strategies reflects that students who lack the ability to objectively judge their competence used less-useful learning strategies. Figures 12 and 13 show that in the relationships between learning strategies used and the actual difference scores for the midterm exam and the final exam, respectively, students tended to use more learning strategies as their actual scores increased and their difference scores decreased as their actual scores increased, especially in the top grades.

5. Effects of difference scores and learning strategies used on actual scores

The results of multiple regression analysis of the effects of difference scores and learning strategies used on actual midterm and final exam scores are as follows: First, each regression model was statistically significant (midterm exam F_(2,102) = 58.422, p < .001; final exam F_(2,102)= 7.262, p < .001) The explanatory power of the regression model was 53.4% for the midterm exam scores and 12.5% for the final exam scores. The Durbin-Watson value was 2.068 for midterm scores and 1.571 for final scores, both of which were near 2, which indicated that the assumption of independence of residuals was valid. We calculated correlations between variables of R = .731 for the midterm exam and R = .353 for the final exam, and tolerance and VIF were 0.1 or more and less than 10, respectively; in both cases, no multicollinearity problem was present.

In calculating the significance of the regression coefficient, excluding the difference score for the final exam (p = .123), the midterm exam difference score (p < .001) and learning strategies used for the two exams (p < .01) had significant negative effects on each actual score. It was found that the actual score decreased as the difference score and the strategy prioritizations increased. When we checked the non-standardization coefficients for the significant variables, the midterm exam difference score (B = -.185) and learning strategies used (B = -.880) and the final exam difference score (B = -.079) and learning strategies used (B = -1.315) were all negative. Actual midterm and final exam scores decreased as the difference score and learning strategy utility increased, and the standardization coefficients were as follows: midterm exam difference score, = −.639; midterm learning strategies used, = −.247; final exam difference score, = −.147; and final exam learning strategies used, = −.296. This study verified that the learning strategies used had a more significant influence on the actual midterm score than the midterm difference score. Additionally, only learning strategies used affected the actual score because we did not secure the difference scores for the final exam. The regression equation representing the degree of increase in midterm exam score according to difference score (X₁) and learning strategies used (X₂) was , and the regression equation representing the degree of increase in final exam score was

In terms of the learning strategies that the students reported using, they each selected the three that they used most frequently, and we prioritized them and assigned points according to priorities (5 points for strategies of low utility, 3 points for strategies of moderate utility, and 1 point for strategies of high utility). The students scored the first- and second-priority strategies 4.62 and 4.49, respectively, which were close to the 5 points for low-utility strategies; the third strategy scored 3.88 points, between the moderate and high strategies. Overall, the students overwhelmingly chose learning strategies of low utility at 76.8%, followed by strategies of moderate and high utility at 12.7% and 10.5%, respectively, which was in accordance with findings from relevant literature (Kang, 2017; Karpicke, 2017; Seok & Kang, 2019). In addition, the correlation coefficient between the learning strategy utility and the total score and the midterm and final test scores were all significantly negative at .001. A negative correlation between the test score and the learning strategy indicated that the utility of the learning strategy used had an inverse relationship with the actual exam test score. This result reflects that students with high academic achievement used learning strategies with more utility, which supported previous studies’ findings that high-utility learning strategies have greater learning effects than do strategies of moderate or low utility.

This study investigated the following four research questions. For the first study question, we found negative correlations between preservice teacher students’ difference scores and their actual scores, which confirmed the typical Dunning-Kruger effect: excellent students inaccurately judged their actual competence, and lower-scoring students overestimated what their scores would be. Specifically, irrespective of their objective performance, students in all categories of competence inaccurately assessed their performance, and all of the lowest-performing students had expected higher midterm exam scores than their actual scores. This trend also continued with the high-performing students underestimating their final exam scores and the lower-performing students expecting higher actual final exam scores. These findings clearly confirmed the bidirectional nature of the Dunning-Kruger effect (Zell et al., 2020).

The three expected scores predicted prior to the midterm examination had high positive correlations with each other, and the overestimates were 36 to 41 points higher than the actual scores. In contrast, the correlations between the three expected final exam scores were also high and positive, but the students only overestimated their scores by approximately 3 points. It is reasonable to infer that the expected scores were markedly different even though the test type and difficulty were the same because the students had the information of their actual midterm exam scores to guide their predictions of their final exam performance, i.e., they recognized that they had overestimated their midterm exam scores. In addition, the difference was not significant between the average of the three expected midterm exam scores and the three expected final exam scores, but the mean difference between the first and third midterm scores and the first and third final expected scores was statistically significant at .001. It can be confirmed from our findings that, in similar such studies, it is not necessary to take three separate estimates of final exam score because the correlations between any two estimates will be sufficiently high. In terms of the learning strategies that the students reported using, we found that the students overwhelmingly utilized strategies with the lowest utility and that learning strategy utility had an inverse relationship with actual exam score, with lowest-utility strategies being used by students who had the highest scores.

According to the results of the multiple regression analysis conducted to confirm the effect of difference score and learning strategy utility on actual score, higher difference scores and higher-utility learning strategies were associated with lower actual score, and higher difference scores indicated less-accurate judgments of competence. Ultimately, higher scores were associated with more accurate judgments of expected performance. Moreover, the lower was the learning strategy utility, the more students used it. Paired with the previous findings, the higher was the utility of the learning strategy, the higher was the actual score, and in both cases, the learning strategy utility was greater than the difference scores (predicted-actual score) for both the midterm and final exams. Learning strategy utility was a more significant predictor of academic achievement than the Dunning-Kruger effect related to over- and underestimating competence, and using low-utility learning strategies had a greater negative effect on actual scores than it did on not accurately predicting grades.

The following conclusions are drawn from the results of this study. First, the correlations were negative between students’ expected and actual scores on both the midterm and final exams. Second, differences existed in the students’ expectations for their final scores between before and after they knew their midterm scores. Third, the rankings of the learning strategies used correlated negatively with the actual midterm and final exam scores and correlated positively with the midterm-final exam difference scores. Fourth, expected scores exhibited negative correlations with learning strategies used, and actual scores in higher difference scores were associated with different learning strategy utility. In aggregate, this study confirmed previous research findings of students’ illusions about their competence and inability to accurately predict grades on tests, irrespective of the field of study.

Funding No funding was received to assist with the preparation of this manuscript.

Competing interests: The author declares no competing

Atir, S., Rosenzweig, E., & Dunning, D. (2015). When knowledge knows no bounds: self-perceived expertise predicts claims of impossible knowledge. Psychological Science, 26(8), 1295–1303.
Critcher, C. R., & Dunning, D. (2009). How chronic self-views influence (and mislead) self-assessments of task performance: Self-views shape bottom-up experiences with the task. Journal of Personality and Social Psychology, 97(6), 931–945.
Cross, K. P. (1977). Not can, but will college teaching be improved? New Directions for Higher Education, 1977(17), 1–15.
Davidai, S., & Deri, S. (2019). The second pugilist’s plight: Why people believe they are above average but are not especially happy about it. Journal of Experimental Psychology: General, 148, 570–587. http://dx.doi.org/10.1037/xge0000580
Dunlosky, J., Rawson, K. A., Marsh, E. J., Nathan, M. J., & Willingham, D. T. (2013). Improving students' learning with effective learning techniques: Promising directions from cognitive and educational psychology. Psychological Science in the Public Interest, 14(1), 4–58.
Dunlosky, J., & Rawson, K. A. (2012). Overconfidence produces underachievement: Inaccurate self evaluations undermine students’learning and retention. Learning and Instruction, 22, 271–280.
Dunning, D., Griffin, D. W., Milojkovic, J. D., & Ross, L. (1990). The overconfidence effect in social prediction. Journal of Personality and Social Psychology, 58, 568–581.
Dunning, D., Johnson, K., Ehrlinger, J., & Kruger, J. (2003). Why people fail to recognize their own incompetence. Current Directions in Psychological Science, 12, 83–87.
Everson, H. T., & Tobias, S. (1998). The ability to estimate knowledge and performance in college: A metacognitive analysis. Instructional Science, 26, 65–79
Heck, P. R., Simons, D. J., & Chabris, C. F. (2018). 65% of Americans believe they are above average in intelligence: Results of two nationally representative surveys. PLoS ONE, 13, e0200103. http://dx.doi.org/10.1371/journal.pone.0200103
Kang, E. (2016). Investigation of integrated effects of practice testing & distributed practice. The Journal of Thinking Development, 12(2), 23–46.
Kang, E. (2017). Theoretical rationale and learning judgment of practice testing. The Journal of Thinking Development, 13(4), 41–66.
Kang, E. (2018). A brain imaging evidence of the effect of retrieval practice on memory enhancement. The Journal of Thinking Development, 14(4), 21–39.
Karpicke, J. D. (2017). Retrieval-based learning: A decade of progress. In J. T. Wixted (Ed.), Cognitive psychology of memory, Vol. 2 of Learning and memory: A comprehensive reference (J. H. Byrne, Series Ed.) (pp. 487–514). Oxford: Academic Press.
Koriat, A., & Bjork, R. A. (2005). Illusions of competence in monitoring one’s knowledge during study. Journal of Experimental Psychology: Learning, Memory, and Cognition, 31, 187–94.
Koriat, A., & Bjork, R. A. (2006). Illusions of competence during study can be remedied by manipulations that enhance learners’ sensitivity to retrieval conditions at test. Memory & Cognition, 34(5), 959–972.
Kruger, J., & Dunning, D. (1999). Unskilled and unaware of it: How difficulties in recognizing one’s own incompetence lead to inflated self-assessments. Journal of Personality and Social Psychology, 77(6), 1121–1134.
Lichtenstein, S., Fischhoff, B., & Phillips, L. D. (1982). Calibration of probabilities: The state of the art to 1980. In D. Kahneman, P. Slovic, & A. Tversky (Ed.), Judgment under uncertainty: Heuristics and biases (pp. 306–334). New York: Cambridge University Press.
Moore, D. A. (2007). Not so above average after all: When people believe they are worse than average and its implications for theories of bias in social comparison. Organizational Behavior and Human Decision Processes, 102, 42–58. http://dx.doi.org/10.1016/j.obhdp. 2006.09.005
Muller, A., Sirianni, L. A., & Addante, R. J. (2020). Neural correlate of the Dunning-Kruger effect. European Journal of Neuroscience, 1–25. http://doi.org/10.1111/ejn.14935
Pavel, S. R., Robertson, M. F., & Harrison, B. T. (2012). The Dunning-Kruger Effect and SIUC university’s aviation students. Journal of Aviation Technology and Engineering, 2, 125–129.
Pennycook, G., Ross, R. M., Koehler, D. J., & Fugelsang, J. A. (2017). Dunning–Kruger effects in reasoning: Theoretical implications of the failure to recognize incompetence. Psychonomic Bulletin & Review, 24(6), 1774–1784.
Pilotti, M. A. E., Alaoui, K. E., Mulhem, H. A., & Salameh, M. H. (2020). A close-up on a predictive moment: illusion of knowing or lack of confidence in self-assessment? Journal of Education, 201, 256–261.
Ryvkin, D., Krajč, M., & Ortmann, A. (2012). Are the unskilled doomed to remain unaware? Journal of Economic Psychology, 33(5), 1012–1031.
Sanchez, C. A. (2016). Differently confident: Susceptibility to bias in perceptual judgments of size interacts with working memory capacity. Attention, Perception, & Psychophysics, 78(4), 1174–1185.
Schlösser, T., Dunning, D., Johnson, K. L., & Kruger, J. (2013). How unaware are the unskilled? Empirical tests of the “signal extraction” counterexplanation for the Dunning-Kruger effect in self-evaluation of performance. Journal of Economic Psychology, 39, 85–100.
Seok, B. Y., & Kang, E. (2019a). Analysis of illusion of competency reflected on judgment of learning. The Journal of Thinking Development, 15(2), 27–48.
Seok, B. Y., & Kang, E. (2019b). Causal analysis of the illusion of competency in judgment of learning strategy effectiveness. The Journal of Thinking Development, 15(3), 73–95.
Serra, M. J., & DeMarree, K. G. (2016). Unskilled and unaware in the classroom: College students’ desired grades predict their biased grade predictions. Memory and Cognition, 44(7), 1127–1137. https://doi.org/10.3758/s13421-016-0624-9
Sharma, H. C. (2002). Can students predict their scores in exams? Journal of Natural Resources and Life Sciences Education, 31, 96–98.
Svenson, O. (1981). Are we all less risky and more skillful than our fellow drivers? Acta Psychologica, 47(2), 143–148.
Vallone, R. P., Griffin, D. W., Lin, S., & Ross, L. (1990). Overconfident prediction of future actions and outcomes by self and others. Journal of Personality and Social Psychology, 58, 582–592.
Weisskirch, R. S. (2018). Grit, self-esteem, learning strategies and attitudes and estimated and achieved course grades among college students. Current Psychology, 37(1), 21–27.
Williams, E. F., & Gilovich, T. (2008). Do people really believe they are above average? Journal of Experimental Social Psychology, 44, 1121–1128. http://dx.doi.org/10.1016/j.jesp.2008.01.002
Zell, E., Strickhouser, J. E., Sedikides, C., & Alicke, M. D. (2020). The better-than-average effect in comparative self-evaluation: A comprehensive review and meta-analysis. Psychological Bulletin, 146(2), 118–149. https://doi.org/10.1037/bul0000218

Tables 3-5 and 10-11 are available in the Supplementary Files section

Tables.docx

The illusion of competence reflected in grade predictions and utility of learning strategies

Status:

Version 1

Abstract

Figures

I. Introduction

II. Theoretical Background

III. Methods

IV. Results

V. Discussion

VI. Conclusion

Declarations

References

Tables

Supplementary Files

Status:

Version 1