Item response theory (IRT) is arguably one of the most influential developments in the field of educational and psychological measurement. IRT provides a foundation for statistical methods that are utilized in contexts such as test development, item analysis, equating, item banking, and computerized adaptive testing. Its applications also extend to the measurement of a variety of latent constructs in a variety of disciplines. The success and influence of IRT over its predecessor, classical test theory, comes primarily from the focus in IRT on the individual components that make up a measure; that is, on the test items themselves. By modeling outcomes at the item level, rather than at the test level as in classical test theory, IRT is more complex but also more comprehensive in terms of the information it provides about test performance. The purpose of this article is to give a broad overview of the published research on and applications of IRT since its origins in the 1950s. The first part of this article introduces some General Overviews and more Specific Overviews of IRT. Sections then present journal articles that introduce IRT Model Formulations, topics in Estimation and Fit and Related Issues, and, finally, some common Applications of IRT.
The general overviews listed here are a subset of the many book-length treatments of IRT published since the 1960s. They were chosen because of their comprehensiveness in dealing broadly with the essential theory and applications of IRT. They all provide a thorough introduction to the traditional IRT models; however, they differ in their emphasis on the theory and statistical underpinnings upon which IRT is built and the testing applications that IRT supports. De Ayala 2009 provides a nice balance of theory, statistical foundations, and application, making it most suitable as a graduate-level textbook or general reference book. Embretson and Reise 2000, van der Linden and Hambleton 1997, and Lord 1980 are also balanced and serve as good general references, but they are less comprehensive and somewhat outdated. Baker 2001 and Hambleton, et al. 1991 both emphasize conceptual explanations and applications over equation and derivations, making them less authoritative but also simpler and more accessible to a broader audience of beginning users.
Baker, F. B. 2001. The basics of item response theory. 2d ed. College Park, MD: ERIC Clearinghouse on Assessment and Evaluation.
Second edition of a simple conceptual introduction to IRT with limited emphasis on the underlying mathematics. Includes a list of recommended readings, online resources, and exercises that can be completed in companion software. Content covers fundamentals of IRT, statistical theory, numerical methods, and the mechanics of computer programs for estimation.
de Ayala, R. J. 2009. The theory and practice of item response theory. New York: Guilford.
A comprehensive and balanced presentation of the theory and application of IRT, written for professionals and advanced graduate students. Covers the traditional unidimensional models with additional chapters on estimation approaches, multidimensional IRT, and differential item functioning. Demonstrations are provided throughout.
Embretson, S. E., and S. P. Reise. 2000. Item response theory for psychologists. Mahwah, NJ: Lawrence Erlbaum.
A readable introduction to IRT for psychologists and others having some background in testing. Traditional dichotomous and polytomous models are introduced, with details on applications in test construction (especially for noncognitive measures), person measurement, item calibration, and computerized adaptive testing, as well as cognitive, developmental, personality, and attitude measurement.
Hambleton, R. K., H. Swaminathan, and H. J. Rogers. 1991. Fundamentals of item response theory. Newbury Park, CA: SAGE.
Classic introduction to IRT. Less reliance on mathematical formulae makes it more accessible, but less authoritative. Topics include the application of IRT methods to problems in test construction, identification of differentially functioning items, test equating, and adaptive testing.
Lord, F. M. 1980. Applications of item response theory to practical testing problems. Mahwah, NJ: Lawrence Erlbaum.
A complete introduction to IRT from a leading figure in the field. Provides a thorough overview of IRT, while covering important testing topics, including analysis of multiple-choice items, the optimal number of options in such items, flexilevel and multilevel tests, tailored testing, criterion-referenced testing, equating, and item bias.
van der Linden, W. J., and R. K. Hambleton. 1997. Handbook of modern item response theory. New York: Springer.
An authoritative handbook providing a broad overview of IRT while also stressing its application to educational and psychological testing. It is also the comprehensive reference volume for practitioners and researchers, at least as of the 1990s. It has six sections, each considering a class of models.
Users without a subscription are not able to see the full content on this page. Please subscribe or login.
How to Subscribe
Oxford Bibliographies Online is available by subscription and perpetual access to institutions. For more information or to contact an Oxford Sales Representative click here.
Purchase an Ebook Version of This Article
Ebooks of the Oxford Bibliographies Online subject articles are available in North America via a number of retailers including Amazon, vitalsource, and more. Simply search on their sites for Oxford Bibliographies Online Research Guides and your desired subject article.
If you would like to purchase an eBook article and live outside North America please email firstname.lastname@example.org to express your interest.
- Abnormal Psychology
- Action Research
- Addictive Behavior
- Affective Forecasting
- Allport, Gordon
- Alzheimer’s Disease
- Animal Behavior
- Animal Learning
- Anxiety Disorders
- Art and Aesthetics, Psychology of
- Attention-Deficit/Hyperactivity Disorder (ADHD) In Adults
- Attention-Deficit/Hyperactivity Disorder (ADHD) in Childre...
- Attraction in Close Relationships
- Attribution Theory
- Authoritarian Personality
- Behavior Therapy, Rational Emotive
- Behavioral Economics
- Behavioral Genetics
- Bereavement and Grief
- Biological Psychology
- Birth Order
- Body Image in Men and Women
- Bystander Effect
- Clinical Neuropsychology
- Clinical Psychology
- Cognitive Consistency Theories
- Cognitive Dissonance Theory
- Cognitive Neuroscience
- Communication, Nonverbal Cues and
- Comparative Psychology
- Conformity, Compliance, and Obedience
- Coping Processes
- Counseling Psychology
- Critical Thinking
- Cross-Cultural Psychology
- Cultural Psychology
- Death and Dying
- Deceiving and Detecting Deceit
- Defensive Processes
- Depressive Disorders
- Development, Prenatal
- Developmental Psychology (Cognitive)
- Developmental Psychology (Social)
- Diagnostic and Statistical Manual of Mental Disorders (DSM...
- Dissociative Disorders
- Drugs and Behavior
- Eating Disorders
- Ecological Psychology
- Educational Settings, Assessment of Thinking in
- Embodiment and Embodied Cognition
- Emerging Adulthood
- Emotional Intelligence
- Environmental Neuroscience and Environmental Psychology
- Ethics in Psychological Practice
- Evolutionary Psychology
- Eyewitness Testimony
- Factor Analysis
- Festinger, Leon
- Five-Factor Model of Personality
- Flynn Effect, The
- Friendships, Children's
- Fundamental Attribution Error/Correspondence Bias
- Gambler's Fallacy
- Game Theory and Psychology
- Health Psychology
- Heuristics and Biases
- History of Psychology
- Human Factors
- Implicit Association Test (IAT)
- Industrial and Organizational Psychology
- Intercultural Psychology
- Intergroup Conflict
- International Classification of Diseases and Related Healt...
- International Psychology
- Interviewing in Forensic Settings
- Intimate Partner Violence, Psychological Perspectives on
- Item Response Theory
- Law, Psychology and
- Learned Helplessness
- Learning versus Performance
- LGBTQ+ Romantic Relationships
- Lie Detection in a Forensic Context
- Life-Span Development
- Locus of Control
- Meaning in Life
- Mechanisms and Processes of Peer Contagion
- Memories, Autobiographical
- Memories, Flashbulb
- Memories, Repressed and Recovered
- Memory, Human
- Memory, Implicit versus Explicit
- Minnesota Multiphasic Personality Inventory (MMPI)
- Moral Development
- Moral Psychology
- Moral Reasoning
- Neuroscience of Associative Learning
- Obsessive-Complusive Disorder (OCD)
- Operant Conditioning
- Optimism and Pessimism
- Organizational Justice
- Parenting Stress
- Path Models
- Peace Psychology
- Perception, Person
- Personality Disorders
- Personality Psychology
- Placebo Effects in Psychology
- Positive Psychology
- Post-Traumatic Stress Disorder (PTSD)
- Prejudice and Stereotyping
- Prisoner's Dilemma
- Prosocial Behavior
- Prosocial Spending and Well-Being
- Psychology, Political
- Psychophysics, Visual
- Psychotic Disorders
- Reasoning, Counterfactual
- Rehabilitation Psychology
- Religion, Psychology and
- Research Methods
- Risk Taking
- Schizophrenic Disorders
- School Psychology
- Self, Gender and
- Self, Psychology of the
- Self-Regulation in Educational Settings
- Sensation Seeking
- Sex and Gender
- Sexual Minority Parenting
- Sexual Orientation
- Single People
- Skinner, B.F.
- Sleep and Dreaming
- Small Groups
- Social Class and Social Status
- Social Cognition
- Social Neuroscience
- Social Touch and Massage Therapy Research
- Somatoform Disorders
- Sports Psychology
- Stereotype Threat
- Subjective Wellbeing Homeostasis
- Teaching of Psychology
- Terror Management Theory
- Testing and Assessment
- Theory of Mind
- Therapy, Cognitive-Behavioral
- Thinking Skills in Educational Settings
- Time Perception
- Trait Perspective
- Twin Studies
- Type A Behavior Pattern (Coronary Prone Personality)
- Women and Science, Technology, Engineering, and Math (STEM...
- Women, Psychology of