SURVEY DESIGN AND QUESTIONNAIRE DATA ANALYSIS
Academic Year 2024/2025 - Docente: Luca MARTINORisultati di apprendimento attesi
General Objectives.
• Know and understand the main elements and concepts of the Bayesian inference (likelihood function, prior densities etc.).
• Know and understand the main concepts and techniques for the classical questionnaires analysis and item response theory (IRT).
• Know and understand Bayesian versions of IRT.
Synthetic General Description.
The course aims at introducing the methodology and practical techniques of
- Main elements of the Bayesian inference
- The design and (frequentist and Bayesian) analysis of questionnaires
This knowledge is relevant in several applications. Bayesian inference has found application in a wide range of activities, including machine learning, engineering, philosophy, medicine, etc. Moreover, questionnaires are frequently used by researchers from various fields to gather opinions, attitudes, and behaviors, and to identify trends or correlations. Researchers use questionnaire analysis methods to interpret the collected responses. It involves extracting valuable insights and patterns from the data gathered, enabling researchers or analysts to draw conclusions and make informed decisions based on the results Questionnaires analysis and item response theory (IRT) are nice example of application since extends the knowldge of the students from regression and classification, to another kind of application, which has several points of similarities to a classification problem handled by logistic-type regression but at the same time is completely different. Connection with parallel logistic regressors are also discussed. Bayesian versions of IRT are described.
Expected Learning Results
The course aims at introducing the methodology and practical techniques for the design of questionnaires
and Bayesian statistics. This knowledge is relevant in several areas. The course will give the main concepts and techniques for the design of questionnaires and data analysis of
collected data. On completion, students will acquire knowledge about:
- Elements of Bayesian inference;
- design of a statistical survey;
- techniques for questionnaire design;
- methods for statistical analysis of collected data and for providing statistical reports.
On completion, students will be able: a) to perform Bayesian statistics; b) to design a statistical survey; c) to analyze collected data through suitable statistical methods and models; d) to provide a statistical report for summarizing the main results. Students will able how to choose a suitable statistical model, apply sound statistical methods, and perform the analyses using statistical software Matlab, and/or R.
Course Structure
Lectures and practical data modeling in Matlab/Octave (or in R or SAS).
Required Prerequisites
- Basic of mathematics and statistics;
- Basic elements of Data Analysis and Statistical Learning;
- More specifically, as examples: concept of probability density and likelihood function, properties of the estimators, unbiasness and consistency.
Attendance of Lessons
Detailed Course Content
Introduction to Bayesian inference. (3 CFU)
- Description of the main actors: likelihood function, priors, marginal likelihood and posterior density.
- Examples of application.
- Benefits and drawbacks – practical application: Monte Carlo methods (MCMC, importance sampling etc.
- Labs in Matlab/Octave (or R).
Statistical Analyses of Questionnaire Data. (3 CFU)
- Design of Questionnaires: types of measures and Questions. Types of error in surveys.
- Evaluating survey questions; methods of Data Collection; introduction to Survey Research Methods.
- Classical theory; factorial analysis, Latent Class Analysis.
- Modern theory: Item Response Theory (IRT); likelihood of IRT – connections with parallel logistic regressors and Bayesian versions.
- Labs in Matlab/Octave (or R).
Textbook Information
C. P. Robert and G. Casella. Monte Carlo Statis- tical Methods. Springer, 2004.
-
F. Liang, C. Liu, and R. Caroll. Advanced Markov Chain Monte Carlo Methods: Learning from Past Samples. Wiley Series in Computational Statistics, England, 2010.
- L. Martino, D. Luengo, J. Míguez, "Independent Random Sampling Methods", Springer, 2018. DOI: 10.1007/978-3-319-72634-2
- L. Martino, V. Elvira. "Metropolis Sampling", Wiley StatsRef: Statistics Reference Online, 2017. arXiv:1704.04629
- V. Elvira, L. Martino, "Advances in Importance Sampling", Wiley StatsRef: Statistics Reference Online, 2020. arXiv:2102.05407
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
- Falissard B. (2012), Analysis of Questionnaire Data with R, CRC Press, Boca Raton.
- Bartolucci F., Bacci S., Gnaldi M. (2016), Statistical Analysis of Questionnaires, CRC Press, Boca Raton.
- Fowler F. J. (2009), Survey Research Methods, SAGE Publications, Thousand Oaks, California.
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
- Lecture notes and slides
Course Planning
Subjects | Text References | |
---|---|---|
1 | Introduction to the Bayesian Inference | |
2 | Application of Bayesian Inference | |
3 | Statistical Analysis of Questionnaire Data | |
4 | Item Response Theory (IRT) |
Learning Assessment
Learning Assessment Procedures
Evaluation by a research project on a topic decided jointly by student and Professor.
Practical activities (data analysis and modeling) and, possibly, an oral exam for increasing the final mark. Learning assessment may also be carried out on line, should the conditions require it.
Scale marks:
- Failed: the student does not know the basic concept of the course and has completed less than 40% of the required assignemnts
- 18-20: the student has a basic knowledge of the topics of the course but he has great difficulties in applying them to practical exercises and problem solving pipelines.
- 21-24: the student has a basic knowledge of the topics of the course and he is able to solve simple problemns and exercises with some guidance from the teacher.
- 25-27: the student has a good knowledge of the topics of the course and can complete the assignemnt in autonomy with minor errors.
- 28-30 e lode: The student has full knowledge of the topics of the course and is able to complete in autonomy assignemnts making connections and with only very minimal occasional mistakes.