所有类别 - Psych Tutor

The ‘crisis’ in Social Psychology - why research methods matter

4/10/2014

When I was at university studying Psychology, I became very bored with the constant focus on research methods. I was young and fascinated by the subject, but I wanted to discuss ideas and conclusions, and I became petulantly frustrated by the insistence that the methods section was the first place that had to be analysed in detail: before we even knew what the study was about we were picking apart the sampling technique! It took a few years of teaching research methods myself, and a number of subsequent scandals in different areas of Psychology, to bring me round to understanding just how crucial research methods are for a thorough understanding of Psychology. Far from being that annoying and boring extra bit that gets tagged onto most Year 12 courses, it is really the central recurring idea at the heart of everything you do as a psychologist. With the aid of a few famous examples, allow me to explain.

In 1996 John Bargh and colleagues published a famous study into ‘social priming’, the idea that our behaviour can be unconsciously affected by clues in our environment. In the study, participants had to create sentences from lists of words that they were given. Half of the participants had words associated with age and the elderly in them (such as old, lonely, grey, elderly and wise). The other half had neutral words, with no age related connotations. After thanking them for making the sentences, the researcher told the participants that the elevator was down the hallway and let them leave... at which point the real test started. The basic procedure was replicated by the BBC in the video on the right.

The replication problem...

What Bargh et al found was that the group ‘primed’ with stereotypes about the elderly walked more slowly down the corridor than the control group. They weren’t aware of moving more slowly when asked about it afterwards, but the times of the two groups showed a significant difference.

As you can imagine, such fascinating and thought-provoking results very quickly became famous within Psychology. Rather more surprisingly, however, was the fact that they seemed to also be pretty much accepted immediately. As all good Psychology students know, one key aspect of Psychology research is that the results should be reliable, and the way to show this is to repeat the experiment and to try to replicate the results. Replication is not perfect (Milgram’s experiment is a great example of a very reliable procedure where people still don’t agree on exactly what the results mean - more on that later), but it does at least indicate to us that we are dealing with a real phenomenon. The more unusual or surprising the findings, the more important it becomes to replicate them before we can have real confidence in the conclusions. It seems especially odd, then, that no one really tried to replicate Bargh et al’s experiment immediately after it was published. Two studies were done which semi-replicated the findings, but both of them changed significant aspects of the method, so they weren’t really true replications. It wasn’t until 2012 that real attempts were made to check Bargh et al’s findings, but before we get onto this study; a slight digression on why it might take 18 years for anyone to check that a famous experiment actually worked.

... but if they're negative results we won't see them

File drawers and quiet failuers

One of the most interesting aspects of looking at research methods in Psychology is that they reveal to us a lot about the psychology of psychologists (and scientists in general)! Imagine that you are a researcher, in your first proper position at a university and ready to do your first piece of real research. Would you rather a) Try to repeat an experiment that someone has already done to check their reliability... or b) try to find out something totally new (which will be far more likely to get published in a research journal than a replication study). In addition, would you a) like to produce a positive finding (which is also much more likely to get published and to raise publicity)... or b) a negative finding, where you have to report that there isn’t actually anything interesting going on.

Like almost all people, you probably answered “a” for both of those options, and science researchers are no different (it is their career and livelihood after all, so they can be forgiven for trying to do the thing which will benefit them the most). The problem is that this leads to huge bias in the way that research is done. Answering ‘a’ to the first scenario means that replication studies are far less likely to be done, and answering ‘a’ in the second means that even when they are done, negative results are often not published. They are just filed away at the bottom of a drawer somewhere and no-one ever knows about them - this is the famous ‘File-Drawer Problem’, which leads to unsuccessful research in all sciences being criminally underreported. Perhaps then... it isn’t that surprising that it took so long for anyone to check Bargh et al’s results replicated, or to raise their voice when they didn’t.

In 2012 Doyen et al replicated the study as closely as they could (though with a larger sample), but instead of experimenters timing the walk, they used infrared sensors. They found no difference in the times of the two groups. They also used experimenters who were ‘blind’ to the experimental condition (they didn’t know whether to expect slow of fast walks); again no difference. Eventually, in the words of science writer Ed Yong:

“they found that the volunteers moved more slowly only when they were tested by experimenters who expected them to move slowly… Let that sink in: the only way Doyen could repeat Bargh’s results was to deliberately tell the experimenters to expect those results.”

Maze-bright rats

We have known for a long time that the expectation of the experimenter can have a powerful effect on the results of an experiment. Either he experimenter unconsciously conveys to participants how they should behave, or they measure the results slightly differently for different groups. The experimenter may be totally unaware of the influence which s/he is exerting and the cues may be very subtle indeed but they have an influence nevertheless.

Rosenthal (1966) is famous for demonstrating how powerful these experimenter effects can be. He used several hundred of his students as experimenters and told one group that they would be studying a strain of maze-bright' rats (bred from intelligent stock) and the other half that they would be studying 'maze-dull' rats. In fact there was no great difference between the rats and they had been randomly assigned to the groups. Nevertheless, the supposedly brainy rats did learn to run the maze more quickly, according to the students’ results! In another of his studies Rosenthal found that male researchers were far more likely to smile at female participants than at male ones. Since this is likely to elicit a smile from the female participant, it means that any study on sex differences in co-operation or friendliness is liable to be spoilt.

In order to reduce these effects, carefully designed studies are often ‘blinded’. In a ‘single-blind’ condition, the participants do not know under which condition they are being tested. This prevents biased responding from participants (so-called ‘demand characteristics’), but doesn’t rule out bias from experimenters. Even better, then, to use a ‘double-blind’ design, where the experimenter does not know the conditions under which the participants are being tested. When Doyen et al double-blinded the Bargh et al design, the differences between the groups seemed to disappear. Sadly, the problems for Social Psychology don’t seem to end there.

In the last few years, with the increased focus on the methods used and the replicability of results from famous findings, more and more Social Psychology studies have come under fire. It started in fields similar to Bargh’s social priming - such as the finding that Dijksterhuis et al (1998)’s ‘intelligence priming’ experiment (where thinking about either a professor or a football hooligan before taking an IQ test sent people’s results up or down) didn’t replicate. It wasn’t helped by the sad case of Diederik Stapel, who was found to have invented or manipulated data for at least 55 of his published research papers, without anyone noticing. Slowly, though the net widened. In the last year, this has come to include the two most famous Psychology studies of all time, the Stanford Prison Experiment and Milgram’s shock experiments. Zimbardo’s experimenter bias was far more obvious than Bargh et al’s - he actively encouraged the behaviour he expected of the guards (this in addition to a host of other problems with the methods)! Milgram, for so long the other staple diet of first-year psychology students has been accused of finding out...well... nothing at all, due to his lack of controls, experimenter effect and flawed data analysis. The list goes on too... Asch’s ‘conformity studies’, Little Albert, the Hawthorne studies, the bystander effect - yet more sacred cows of Psychology being mercilessly slain on the alter of research methodology. One begins to wonder when it will all end, and indeed if we will ever be able to have confidence in any Psychology findings ever again!

Yet if a little existential doubt about our subject is the price we have to pay to move forward, then so be it. Much better to have a big spring clean and to clear out all of the old, cobwebbed research results than to spend another twenty years tripping over them. Although it moves slowly, the increased focus on replication studies and methodological checking of procedures in the last few years - including appeals from Nobel Prize-winning Psychology-celebrities (if such a thing exists) and the formation of the 'Many Labs Replication Project' - offers real hope of a new dawn of Social Psychology findings that we can have confidence in. It’s hardly that methodological problems are totally unique to Social Psych either (fMRI scans have famously taken a bashing for their statistical methods recently), and the mathematician John Ioannidis has even gone so far as to suggest that ‘most published research findings are false’, due to the faulty statistics used in the analysis of many papers.

Should we give up now then? Is there any point in even following Psychology research any further, given how flawed many of the results increasingly seem? Counter-intuitively perhaps, I see this as more of a cause for celebration than despair. It may seem pessimistic and depressing to be constantly re-evaluating research findings, but this process is exactly what science is supposed to go through! Far better this than that we credulously accept every theory that is put before us, swallowing whatever ideas are currently fashionable without ever checking the facts (the ‘paleo diet’, acupuncture or homeopathy spring to mind here). This is precisely the process which makes scientific results, whilst far from perfect, the best means that we have for generating knowledge about the world. As I’ve mentioned before on this blog, human beings are unbelievably complex organisms, who are therefore unbelievably difficult to study. All the more important then, that when we do try to study them we do it in the best way that we possibly can, and check our results as carefully as possible. And that, of course, all comes down eventually to our research methods - the most important thing in science!

32 Comments

The psychological guide to Psychology revision

25/4/2014

33 Comments

Let's start with an admission. Revising for exams is a difficult and tedious business. Hard as we might try to make the process a little more interesting and creative, using colour-coding, mindmaps, infographics or cartoon strips, sooner or later there has to come a period of sustained, hard work. Once you accept that, you're actually over one of the largest hurdles!

Students tend to do poorly in revision for one of two reasons:
1. Not enough time spent in purposeful activity (i.e. getting started and keeping going)
2. The time that is spent was used inefficiently (i.e. doing the most efficient activities)

Fortunately... psychology can help. Studies of learning, motivation, attention, concentration and many others have given us a detailed understanding of which techniques work, and which ones don't, and you can use their wisdom to improve your own revision time

1. Getting started and keeping going

In the 1920s, Bluma Zeigarnik was a psychology student in Lithuania when she noticed that waiters who were able to remember long and complex orders would forget them as soon as their deliveries were completed. She began to investigate what became known as the Zeigarnik effect, which has two main observations. Firstly, that incompleted tasks cause us anxiety and stick in our minds and secondly, that we are less easily distracted once a job is underway than before it has started.

The lessons here for your revision are clear:

1. Finishing stuff feels good and not getting stuff done feels bad! Procrastination causes anxiety, and anxiety is not fun.

2. JUST GET STARTED! Even if you are only writing the titles or organising your notes. However small the task, the fact is that if you've started, you're more likely to continue. Half an hour, or even ten minutes at a time to start making your notes may be all the time you have on some days before study leave... that doesn't matter - just do what you can. JUST START!

2. Doing the right things with your precious time

Once you've used the Zeigarnik effect to help you get started with revision tasks, it's important that you're doing the right things in that time. Here, Psychology has a huge amount to say, and a wealth of useful experimental results to make you learn more efficiently. These are summarised in more detail on the 'Revision Tips and Tricks' page of the site, but the most important ideas are:

Test yourself! Self testing is a hugely powerful tool in learning.
Do the same tasks as you have to do in the exam. Mindmaps etc can be great, but they won't be in your Psychology exam. In psychological terms... you need to make your preparation ecologically valid! Do a learning activity (mindmap, infographic etc), then do an exam question on that topic.
Follow the three golden rules of practice. Practice should be distributed (spaced out over time... so NO CRAMMING!), varied (using a vairety of activities and tasks) and interleaved (mixing subjects and topics regularly - so don't try to have whole days on one subject. You'll remember it better if you switch)

Finally, remember to also look after yourself. Rest and sleep normally (or even a bit more than normally). You need to be rested to remember things effectively. Get exercise (which has a big impact on memory). Eat well.

If you arrive at the exam fit, healthy, rested and having revised efficiently, then there'll be no stopping you. Good luck and do yourself proud!

33 Comments

What does 'normal' sleep look like for different age groups?

29/3/2014

35 Comments

An extension study in relation to Dement and Kleitman.

Almost everyone I know, whether a psychologist or not (and I do have one or two friends who aren't!) is fascinated by sleep. Be it dreaming, sleep-walking (or other forms of parasomnia) or just comparing how much they think they need each night, it's a frequent source of amusement and debate. How strange, then, to think that despite decades of research, we still know relatively little about the function of sleep, or even what 'normal' sleep looks like. Compare what you know about sleep to what you know about diet (arguably something of comparable importance to our health) for example. Many people now analyze their diets in great detail, but do we have any idea how we might do have same for our sleep?

One recent attempt to help rectify this, published in the journal PLOS One by Kevin Peters and his team, examined if age plays a role in what normal sleep looks like and the phases that make it up (termed 'sleep architecture), looking for similarities and differences between the sleep patterns of young and older adults.

Stages of sleep, including REM and sleep spindles

We already know that as people age, they tend to sleep less efficiently (sleeping for less time and spending more time awake in the night) and spend less time in REM sleep (rapid eye movement sleep, a stage of sleep where eyes move rapidly in different directions). Aging is, of course, associated with a general cognitive decline in many areas. Whether sleep changes could be a cause or an effect of these changes, however, is not yet known. We also do not yet know whether changes to certain aspects of sleep might be more important than others in accelerating or preventing cognitive decline.

Peters et al looked at two particular aspects of sleep in groups of 24 young and old adults (mean ages 20.75 and 71.17 respectively), REMs and sleep spindles (bursts of brain activity that occur in phase two of sleep, usually immediately after an outbreak of muscle twitching). Both of these have been separately linked to cognitive function in previous studies, but none of these studies have looked at the relationship between the two across different age groups. To try to make the sleep as 'natural' as possible, Peters et al excluded people with signs of depression or sleep disorders (both of which can lead to abnormal sleep patterns) and all participants had an 'acclimatization night' sleeping in their normal beds with the measurement electrodes attached but not recording.

Having said how poorly understood sleep is, it is perhaps not surprising that the results turned out to be quite complicated! Young adults had greater spindle density than older adults, but the two age groups did not seem to vary significantly in the density of their REM. Perhaps this shows that age affects sleep spindles more than REM, but this contrasts with the findings of some other studies so need to be examined in more detail. In addition, another important finding was the huge amount of variability between individuals' sleep patterns. There seem to be large individual differences between the way we all sleep and Peters et al found that in some areas (such as wakefulness in the night) these differences may actually increase as we age.

What conclusions can we draw then, from this contribution to the study of such a familiar yet mystifying topic? Given that sleep is so crucial to all of us, I find it amazing to think that it can vary so much from person to person. Finding areas of sleep that may be more important than others (for example, a very early possible link here between sleep spindles and ageing) may be exciting, but given all this variability, generalizing any findings from future experiments to all of us will surely prove a huge challenge. The baffling and intriguing study of sleep seems set to continue to give us sleepless nights for a good while yet.

35 Comments

Need directions? Ask a pilot!

14/3/2014

30 Comments

A post on new research relevant to the Maguire key study.

There are few things more frustrating or guaranteed to cause domestic strife than getting lost, so it is perhaps unsurprising that some psychologists have dedicated considerable effort to discovering which sorts of people are best suited to learning ‘cognitive maps’, our mental representations of spaces. Much of this evidence has involved taxi drivers, who continually impress in such tests (though engineering students also do well) whilst dental students seem to struggle, despite having to learn detailed spatial maps of the cross-sections of teeth.

Jennifer Sutton and her team reported results from a group of 18 aviation students on an undergraduate program which included flying time (although they had hugely varying flight experience of between 1 and 259 hours), compared against 18 controls matched for age and video-game usage (video games have previously been shown to influence spatial cognition). Participants spent five minutes navigating a virtual town (adapted from the online game ‘Counter-Strike’), into which 6 key locations had been inserted. Later they were asked to imagine travelling between two of the locations and to indicate the direction of a third using a joystick. The researchers measured the number of degrees that the estimates were out from the true bearing. Participants were also given an ‘Object Perspective Test’, a similar test of spatial processing but without the memory element of the direction judgement test.

Counter-Strike 'Italy' map... as adapted for the study

The researchers found that the pilots significantly outperformed the controls on the judgement of direction task (being on average over 20 degrees more accurate with each guess), though not on the object perception test. This would not have come as a surprise to the pilots, who rated their own spatial abilities significantly higher than the self-ratings of the controls! Sutton et al conclude that the experience of learning to fly (especially in the early stages when visual contact with the ground has to be maintained and so ‘cognitive maps’ are more likely to form) leads to superior processing of space in trainee pilots even when translated into new, more grounded scenarios such as new towns.

Sutton et al concede that there may be some other possible explanations for their findings. We don’t know, for instance, whether being a pilot helps make cognitive maps or whether people who naturally make good cognitive maps are more likely to become pilots. Also, although the researchers controlled for video game usage, one of the major variables here was the use of flight simulator games, which the trainee pilots admitted to playing far more than the controls (at least showing that their job was also their hobby). This extra flight-like experience could have contributed to the results more than their actual flight training, especially given the small amount of time some of the group had spent in the air. The only way to overcome some of these problems would be with a longitudinal study, following the development of spatial abilities in trainee pilots and controls over a number of years and beginning before their training had started. In the meantime though, if you have a pilot handy, maybe take them along on your next family holiday, just in case.

Navigation Experience and Mental Representations of the Environment: Do Pilots Build Better Cognitive Maps?

Sutton JE, Buset M, Keller M (2014) Navigation Experience and Mental Representations of the Environment: Do Pilots Build Better Cognitive Maps? PLoS ONE9(3): e90058. doi: 10.1371/journal.pone.0090058

30 Comments

Life is an idendendent measures experiment

8/3/2014

26 Comments

Picture of identical twins Martin Schoeller’s 'Identical: Portraits of Twins'

Human beings are fascinating and infuriatingly difficult objects to study. At the same time, we manage to be both unnervingly similar to each other and totally different, and working out what we can and can’t generalise to people beyond our immediate sample is one of the major challenges of the subject. This is something of a unique problem amongst sciences. The physicist never has to worry about making generalizations beyond the small crop of particles that have been studied. There is no nagging doubt in their mind that perhaps electrons who have had a different history or that are found in other countries might somehow behave differently to the electrons in their experiment. No physicist ever found themselves worrying about trying to take a stratified sample to ensure that their selection of electrons was representative of the whole population. What goes for one seems to go for all, so results can be merrily generalised without a moment’s hesitation.

In contrast, there are times in Psychology when it seems amazing that we ever feel confident enough to generalise any findings at all, so numerous are the things that can vary between the participants. Even the most tightly controlled laboratory experiments cannot even begin to account for the differences in the personalities, motivations, beliefs, aspirations, intelligence, family backgrounds, or genetic inheritance of their participants. Even in cases of identical twins, natures most perfect controlled experiment, we know that relatively minor changes in environments (such as having slightly different groups of friends in school) can lead to big differences in the individual. The more we find out about epigenetics, where the environment plays a role in gene expression, which in turn affects how we react to the environment and so on, the more complicated we realise that each individual’s life is, and the more hopeless it seems that we can ever fully understand human life in all its complexity (e.g. see here).

Independent measures design

I was thinking about this problem of generalisations the other day, and I realised that it can be summarised in quite neat research methods terminology. The problem with studying human beings is that each human life is essentially an independent measures experiment. We are all in our own little experimental group, being subjected to our own unique set of independent variables which have never been replicated before. There is no control group, only a host of other people all with their own unique social and biological factors, each in their own personal experimental group, just as unique as yours. This is precisely the problem which psychology researchers must wrestle with every time they conduct experiments, and as good Psychology students you know exactly why this is a problem. In independent measures experiments, differences between participants can confound the results. In other words, it is very hard for me to be confident that what worked for someone else will work for me, as we’re each in our own personal, totally different life-experiment, where certain variables may have completely different effects on the two of us.

What are the consequences of realising that life is an independent measures experiment? I think that there are both optimistic and pessimistic possibilities. Pessimistically, if taken too far this thought leads us into relativism and the worry that we can never learn anything from anyone else, or even that it is impossible to make informed decisions about our own lives. It certainly suggests that studying Psychology is a bit of a waste of time; what’s the point in psychological theories if we’re all too unique to be analysed by them? Personally, however, I’m more on the positive side. Optimistically, realising our ‘independent measures-ness’ means we can celebrate our uniqueness and avoid sloppy over-generalising and the assumption that simple solutions which worked for one person can be applied to everyone (I see self-help books and ‘How I made my millions’ type books in this category. See 'survivorship bias' for more on these sorts of cognitive mistakes.) Also, the fact that there are no easy answers or universal solutions to life’s problems, rather than suggesting relativism, simply makes it even more important that we base all our decisions on the best evidence available to us. No explanation or theory works for everyone all of the time, but there are some ideas which work for more people than others, more of the time than others, and the role of Psychology is to find this out. We may never be able to predict human behaviour with the precision that a physicist can an electron, but using psychological evidence we can make a decent guess. Whether it works in your own little independent measures experiment can’t be guaranteed, but it’s definitely the best place to start.

26 Comments

Meehl patterns, predicted grades and Oxbridge interviews

5/10/2013

28 Comments

The second of my posts inspired by Daniel Kahneman’s ‘Thinking, Fast and Slow’

As a student, you will slowly be getting used to teachers making predictions about you (you may never grow to like it, but you will at least get used to it). You have predicted grades for your GCSEs and predicted grades for your AS and A-Levels. The education system to some extent relies upon the idea that teachers can predict with reasonable accuracy how students will do in their exams, before they have even taken them. But how good are teachers’ predictive abilities really?

In the 1950s a psychologist called Paul Meehl reported an example where trained school councillors were asked to predict the grades of students at the end of the year. They were allowed to interview the students for 45 minutes and had access to large amounts of other data such as a personal statement, previous grades and a number of aptitude tests. At the end of the year, the predictions of the councillors were compared to another, much simpler, prediction method, a statistical equation that only looked at previous results and one aptitude test. Who do you think was more accurate? In 11 of 14 cases, the simple statistics did a better job than the professionals. Before you decide that teaching professionals are useless, however, be aware that over the next 30 years similar patterns have been found across numerous areas: from predicting success in pilot training, the price of bottles of wine, cancer survival rates, prospects of success for new businesses and many more.

These cases are called ‘Meehl patterns’ and they occur when an expert in a field tries to predict something complicated, but has a lower success rate doing so than a simple statistical formula (or ‘algorithm’). Meehl thought that the patterns illustrated a potential problem with knowing a lot about a certain subject; it makes us too confident in our judgements and more likely to try to be clever or unexpected in our predictions, rather than just sticking to the simple data. Meehl patterns work in so called ‘low-validity’ environments, ones that involve a significant degree of uncertainty and which are very hard to predict. In such difficult situations, it is often better to use simple statistics to predict performance, rather than trusting an ‘expert’.

This got me thinking about two potential examples of ‘Meehl patterns’ in school settings. The first, as I mentioned at the start, is predicted grades. I would actually argue that predicting exam performance is likely to be less of a Meehl pattern now than it may have been in the 1950s, as teachers today are far more used to using data to inform their decisions, especially data from aptitude tests and so on. What this means, of course, is not that we are able to predict grades with any great accuracy (these are still ‘low-validity’ environments, so even the best guess of an aptitude test is a pretty poor prediction), it just means that teachers might be a bit less bad at it than they used to be! Every teacher will be able to name individuals who have far exceeded their expectations (and predictions) in exam situations, and probably just as many who have gone the other way. This will often be especially pronounced in the more subjective subjects such as English, History or Psychology, where one examiner may grade an answer very differently to another. Such environments (where individual performance, question topic and marker judgements can all vary greatly) are very low-validity indeed, in fact it makes me think that it’s amazing that we ever get any predictions right at all. The next time, however, that I am tempted to give a prediction that is completely at odds with the data before me, just because I have a feeling that I know the student better, I might have to stop and reassess my own biases and the illusion of my own expertise!

The second area of school life in which I see a clear example of a Meehl pattern shows itself every October and November as students prepare their university applications and, in particular for those applying for Oxford or Cambridge, begin to practise their interview technique. Oxbridge interviews are of course the stuff of folklore; subject to countless column inches each year and the source of no end of student angst and public bemusement. They are defended to the hilt, of course, by academics who maintain that such questioning and face-to-face interactions with the applicants provide crucial insight which cannot be ascertained from the pile of glowing school references and string of A and A* grades that each student will arrive with. I can’t help but wondering, however, if they are falling prey to the illusion of their own expertise, over and above any real ability of theirs to truly pick out the most talented. Nervous students, intimidating environments, a random battery of questions which may or may not by chance have relevance to the wider reading that they have been desperately trying to do over the previous weeks; it’s hard to envisage a more ‘low-validity’ environment by which to predict future academic success.

After they’ve made their decisions, of course, the professors will have their judgements vindicated by confirmation bias, another cognitive error. Some students who the professors remember giving particularly impressive answers to interview questions will go on to do important and noteworthy things at university and beyond. Given the number of people from these two institutions who go on to high-profile and important jobs, this is not actually a very surprising fact, but it will create the impression in the minds of the professor that they were right all along, and that interviews remain the way forward.

In actual fact, I suspect that there would be a much simpler way to predict the likelihood of success at university, a method that Meehl would no doubt have approved of: AS level scores, plus the result in a standardized admissions test given by the university. The top performers across these two measures are admitted. Simpler, cheaper, very possibly fairer (especially on students from poorer backgrounds who may be far more intimidated by the atmosphere of the Oxford college than other applicants) and perhaps, if the lessons of Paul Meehl are anything to go by, a fair bit more accurate as well.

28 Comments

Does the Milgram experiment tell us that teaching Psychology is a waste of time?

21/9/2013

29 Comments

The first of a series of three posts inspired by Daniel Kahneman’s ‘Thinking, Fast and Slow’

Milgram’s (in)famous experiment is always great fun to teach. People quickly see the relevance of the study to their everyday lives and to some of the lessons of history and, let’s face it, they love how deeply unethical the whole thing is. However, another reason that students find the study so interesting is that the results are (for them) always extremely surprising. Whenever a new cohort of students learns the basic procedure for the experiment I always ask them the same question:

“If you had been a participant in Milgram’s experiment, would you have gone all the way to 450v?”

The answers are pretty consistent; it is rare that more than 10% of a given class think that they would have carried on to the bitter end.

So far this is hardly a surprising result, after all the survey of Psychology students and professionals that Milgram took before his original study predicted that only one in 1000 people would go that far! If anything, my classes seem to be a bit more realistic, although they still greatly underestimate the true obedience figure of around 65%, which Milgram found and which has been closely replicated in many studies all over the world since.

What is interesting is what comes after we’ve finished covering the study. This is when they get the same question again...

“Given what you know about the Milgram experiment, if you had been a participant, would you have gone all the way to 450v?”

The students know by this stage, of course, that in any given sample, roughly two thirds of people will obey up to 450v. They know that this result has been widely replicated. Presumably then, we would expect them to use that information to come up with a more informed judgement about whether or not they would have obeyed. Indeed the results do tend to show this... a bit! Usually in this second poll around 20-30% of students say that they might have showed full obedience, but that still leaves 70-80% of people who think they would be in the 35% of defiant participants! I’m no mathematician, but something clearly doesn’t seem to add up.

We are faced with something of a psychological problem. How do we explain this difference between what the students know are the results for everyone else and what they assume will be the results for them? Fortunately there are some clear psychological principles that can help us to explain this pattern. Rather more unfortunately, they do seem to have the worrying implication that teaching Psychology at all might be a waste of time!

Explanation 1 - The Fundamental Attribution Error (Ross, 1977)

The fundamental attribution error is a complicated name for a very simple thing. It is the tendency that we have to explain other people’s behaviour by reference to their character (he’s a jerk, she’s arrogant etc), whilst explaining our own behaviour with reference to the situation we’re in (the room was too cold, it was dark and I couldn’t see etc). In psychological terms, we overestimate the effect of dispositions and underestimate the effect of situations in explaining other people’s behaviour (and the opposite for ourselves). For example if we see the person next to us receive a bad mark in a test, we’re likely to jump to dispositional conclusions (“she’s stupid”). However, if a minute later we get out test back and we have scored the same mark, we are far more likely to find situational explanations for the result (“There was too much noise in the room and I didn’t sleep properly the night before).

The fundamental attribution error - as illustrated by Charles Schultz's 'Peanuts' cartoons

Explanation 2 - Neglect of base rates

In 1975 Psychologists Richard Nisbett and Eugene Borgida conducted an experiment in which participants were told the results of a famous ‘helping experiment’ (Darley and Latane, 1968). The helping experiment found that only 4 out of 15 people (27%) went to help another participant (actually a confederate) who they thought was having a seizure. Nisbett and Borgida’s test was very simple; they told participants the results of this study and then asked them if they would have helped. The assumption was that, having been given the normal frequency of helping behaviour (the base rate), the participants would use this in their own decisions, probably coming up with a lower estimate than they might have done otherwise. Nisbett and Borgida found that... they didn’t. In fact the predictions of the experimental group were no different to those of a control group who hadn’t been given the base rates at all; well over half of participants said they’d help. The extra information made no difference! People simply didn’t use it when coming to their decision.

These two phenomena are very closely related; indeed, one may cause the other (base rate neglect may cause the fundamental attribution error, or vice versa), but they are both very useful tools for looking again at the results in my Milgram surveys. In my classes on Milgram, just as for Nisbett and Borgida, the base rate of ‘what most people do’ seems to be completely ignored when people have to decide ‘what would I do?’ Perhaps this is the result of the fundamental attribution error, where we still can’t help but think of the obedient subjects as ‘cruel’ or ‘submissive’ or even just ‘weird’ (even though we know deep down that they were just normal people like us). Either way, the end result seems to be that every time we are confronted with the question of what we would do as individuals, we ignore any evidence from what other people normally do and assume that we are different.

As a psychologist, this is an interesting finding, but it is also a slightly depressing one. Why? Because it seems to imply that learning about psychological facts has absolutely no impact at all upon our actions or decision making! This is a deeply worrying thought. As teachers, although we spend more of our lives than we would like banging on about exams, in truth most of us are actually hoping to give you something more than good grades and a lifelong habit of underlining the date and title. We hope, perhaps naively, that people might leave out classes with ‘life skills’, ideas for different ways of thinking and acting that could benefit you long after you leave school. As Psychology teachers, this is especially true, given that so many of the topics we cover are of direct relevance to people’s lives.

What the consistent surprise at the Milgram experiment’s results, as well as studies like Nisbett and Borgida’s, seem to show, however, is that we are wasting our time. You can write as many essays and learn as many details about the Milgram experiment as you want, but when it comes down to it in real life, when you actually test whether someone’s way of thinking or behaving has been altered by the information we’ve passed on, nothing’s changed!

So there you have it. The reasons why Milgram’s results are still so surprising today. As a psychologist, fascinating; as a teacher, terrifying! Do you agree? Has a lesson you’ve had, teacher you’ve liked or a topic you’ve studied ever changed the way you think? Comment below and stop me from getting depressed!