Introductory Quantitative Methods
| Robert D. Duval | Office: | Class: |
| Bob.Duval@mail.wvu.edu | 301A Woodburn | G15 Woodburn |
| Phone: 293-3811 x5299 or 599-8913 |
Hrs: MTWTh 11:30-12:30 |
Hrs: TTh 10:00-11:15 |
This course introduces the graduate student to basic methods of empirical inquiry in the social sciences. The overwhelming majority of studies that test hypotheses, empirically fit models, produce predictions, or estimate policy impacts are based upon some form of quantitative or statistical analysis. This course will provide a basic introduction to statistical methods for political scientists and policy analysts. The course will provide an introductory foundation in statistical inference, enabling the student to become a competent producer of basic statistical research. In addition the skills acquired will enable the student to become a somewhat more sophisticated consumer of more advanced research methodologies.
The level of mathematical treatment is somewhat more advanced than an undergraduate class, but will still be moderate (high school algebra). We will not dwell upon derivations. [While it is desirable to have had some prior coursework in statistical inference, this course does begin with the basics. Statistical inference seems to require several repetitions to sink in, so Statistics 211 (or its equivalent) is considered a prerequisite. The student taking PS601 without having had a prior stats class will find that the class will be somewhat more difficult, but can be survived. Please discuss this with me if you have not had any prior coursework in statistics.]
The basic organization of the course is: (1) a general introduction to research methodology, (2) descriptive statistics, their use and interpretation, (3) the essential elements of probability, (4) the foundations of statistical inference, (5) and finally an overview of selected hypothesis tests.
Course Requirements
The course requirements are two in-class open-book examinations and a moderate number of assignments (~12).
The 'midterm' will be late in the semester due to the desire to spread the more mathematical elements across two
tests rather than place it all in the final. No seminar paper is required, however, the assignments will contain
written work in which clarity and sophistication of presentation are critical elements of the assignments. In addition,
the assignments will be required to be turned in as Web pages. I will provided detailed instructions on how to
create web pages and how to accomplish this task. Don't worry...It's easy!
| Grading | |
| Midterm | 30% |
| Final | 35% |
| Assignments | 35% |
Also of some value, but not required are:
Additional Features
A few other articles and books will be placed on reserve in 316 Woodburn. In addition there are several Web assignments in the course outline.
Also, my class notes and presentation slides will be available via the Web. These notes are under development and will change constantly throughout the semester. I expect them to change just before each class. I strongly recommend against extensive printing of the entire file! [Instructions on saving paper when printing Powerpoint.] I recommend printing just before class. I will frequently revise these notes right up until 9:55.
A Word about Computers
All of the assignments for the course may be accomplished with either Microsoft Excel, Corel's Quattro Pro, or Lotus 123 if it involves spreadsheets, or SPSS, SAS or Stata if it uses statistical packages. In addition, all written assignments are required to be posted as or linked from web pages. Some assignments may be spreadsheets, while others will require the use of a Web page editor. Instruction will be provided on how to create web pages for those who have never done so.
If you wish, you may elect to use your own graphics, spreadsheet, or statistical software. I will be, however, be unable to provide support except for these three packages Excel and Stata. To facilitate the assignments, I will provide some basic instruction in both Excel and Stata. Students have generally found spreadsheet software to be relatively easy to use and it also provides a powerful graphics 'toolkit'. Supplementary instruction on its use will be provided. I encourage you to become comfortable with computers to whatever degree you can. . Basic computer literacy will be an increasingly critical component of job placement.
Free software used in course. Please download at your leidure:
Course Outline with Readings (* = Recommended)
This is an initial reading list. I may add more as the semester goes on.
| Week | Topic | Description | Readings |
| Course Overview | |||
| Week 1 1/15-1/17 |
Overview |
|
|
| Week 2 1/22-1/24 |
Logic: Symbolic and Digital |
|
|
| Week 3 1/29 - 1/31 |
Description and Measurement |
|
|
| Week 4 2/5-2/7 |
Frequency Distributions |
How to describe the distribution of data with pictures and equations.
|
|
| Week 5 2/12 - 2/14 |
Probability Distributions | The Normal Curve |
|
| Week 6 2/19 - 2/21 |
Probability | Probability Experiments |
|
| Week 7 2/26 - 2/2\8 |
Sampling Distributions |
The Central Limit Theorem |
|
| Week 8 3/4-3/6 |
Hypothesis Testing | Point Estimates and Confidence Intervals |
|
| Week 9 3/11 - 3/13 |
Hypothesis Testing | One-sample hypothesis tests: the z-test. | Homework Example: Converting Nominal dollars to real dollars |
| Week 10 3/18 - 3/20 |
The t-test |
One Sample t-tests Test#1: Thursday March 15th |
|
| Week 12 3/25 - 2/27 |
Spring Break |
|
|
| Week 11 4/1- 4/3 |
Two-sample t-tests |
The Difference of Means test |
|
| Week 13 4/8 - 4/10 |
Chi-square and Contingency Table Analysis |
|
|
| Week 14 4/15 - 4/17 |
Correlation and Bivariate Regression |
|
|
| Week 15 4/22-4/24 |
Multiple Regression |
|
|
| Week 16 4/29 - 5/1 |
Wrapping it all up |
|
|
| Final Exam | See the Sample Tests on the Samples Page Spring 2003 Final |
||
All of the statistics assignments for the course may be accomplished with either Excel and Stata 9.0. If you wish, you may elect to use your own graphics, spreadsheet, or statistical software. I will, however, be unable to provide support. To facilitate the assignments, I will provide some basic instruction in Excel. Spreadsheet packages are quite easy to use and also provides a powerful graphics 'toolkit'. Supplementary instruction on Excel will be provided. I encourage you to become comfortable with computers to whatever degree you can. They are becoming increasingly important and useful, and this trend will intensify substantially over the next few years. Basic computer literacy will be an increasingly critical component of job placement.
These assignments are still in draft. They will be modified before assignment.
| Assign # | Assignment Name | Points1 | Recommended Software | Description of Work Required | Due Date2 |
| 1 | Acquiring Data from the Web | 6/4 | Netscape & Excel | 5 spreadsheets | 2/15 |
| 2 | Measures of Central Tendency and Dispersion | 6/4 | Excel | spreadsheet | 2/22 |
| 3 | Stem-and-Leaf Plot and Histogram | 6/4 | Excel | Graph + Paragraph | 3/1 |
| 4 | Pie Graph | 6/4 | Excel | Graph + Paragraph | 3/8 |
| 5 | Bar Chart | 6/4 | Excel | Graph + Paragraph | 3/8 |
| 6 | Line Graph - Time Series Plot | 6/4 | Excel | Graph + Paragraph | 3/8 |
| 7 | Converting Nominal Dollars to Real Dollars (Controlling for Inflation) | 6/4 | Excel | Graph + Paragraph | 3/15 |
| 8 | T-Test | 8/2 | Excel | 2 Paragraph | 4/5 |
| 9 | Two-sample t-test | 8/2 | Stata | 2 Paragraphs | 4/12 |
| 10 | Regression | 16/4 | Stata | 1-2 Pages | 4/26 |
2. These dates are highly tentative.
Description of Assignments
The following is a description of the assignments for PS 601. Several of them require that you select your own data to make some point. The Statistical Abstract of the United States (SAUS) is available online to assist you in finding data. You might also try a few data sources available on the PolyCy Guide
Using MS Internet Explorer and Microsoft Excel (or your own favorite Web browser and Spreadsheet) find data for that you can use for the next several assignments. These "data sets" should be for a number of cases (e.g. 50 states,) and should measure some political variables of interest to you. I would like for you to acquire this data from the Internet, so I suggest that you start with the Statisitical Abstract of the United States.
Because you will need several different sets of data, you can make this a "Data Scavenger Hunt". You need to find the following general types of data:
Save these, and their web addresses to your disk/drive. You will need the data, and its reference, later!
One of these 50 state data sets you will need to turn into a dataset for a statistical package. In order to do this, you need to:
- Remove extra labeling
- Give data columns "variable names" of 8 characters or less using only letters, numbers, and the characters "-" and "_".
- Remove blank lines
- remove any extraneous text
- Save as Excel Spreadsheet
In order to facilitate this, I have set up some basic instructions for capturing data from a Web Source. Please follow them for this assignment. The end product will be a data set saved on disk for use with Assignment #2.
Goals:
For full Content points: At least one variable, with row labels (or row number for ID). Cell entries for data are numbers - not text. It must be 'cut-and-pastable' for Stata, or readable by StatTransfer.
For full Presentation points: No extraneous rows or labeling, no fancy formating. Nothing outside what is required to be read as a data set.
Assignment # 2 Modeling compound growth
In this excercise, you will develop a scenario or model that will teach you how to use a spreadsheet to model a process. The particular process I want you to be familiar with is compound interest. Additionally, you need to look at information on the existing social security program to estimate what you will collect in retirement from it. See SSA's Calculator (which is an online simulation similar to this one!).
You are to develop a retirement fund estimator for yourself, calculating several assumptions. In order to complete this exercise, you will need to use estimates of the following information. Note that some assumptions can be either a % per year or a specific/variable $ amount - your choice.
- Your starting salary ($)
- Your expected rate of salary increased (% or $/year)
- The number of years you plan to work (n) (or retirement date.)
- The starting date for contributions to your retirement fund (year)
- A second scenario with a different starting date. (Just a second column)
- The rate of interest on funds you put into retirement (% or $/year)
- The rate that you will withdraw funds from your retirement account after you retire (% or $/year)
- The annual or monthly $ amount you expect to receive from social security ($/year)
Use reasonable guesses, not just what you'd like. Perform the calculations rather than using the @Functions. Then make the spreadsheet presentable. The bottom line is your estimate of your annual retirement income. At this point, you do not need to worry about inflation. We will return to that later. For a similar example, look at this car loan calculator.
You may template your calculator from this example.Use labels. Explore the fonts. Make the page you write well formated for printing! This exercise is designed to get you used to Windows and a Spreadsheet as a calculator.
Goals:
Assignment_3 Frequency Histogram and Stem-and-Leaf Plot
Using Stata 8.0, produce a Stem-and-Leaf plot and a Frequency Histogram using the data that you collected in assignment 1.
Goals:
Using Excel and some new data that you select, produce a pie graph. I suggest that you provide data which makes some point. Provide a paragraph which articulates how the graph demonstrates your case.
Goals:
Using Excel and the 'raw magnitude' data you obtrained inAssignment #1, produce a bar (or column) graph. I suggest that you provide data in order to make some point. Provide a paragraph which articulates how the graph demonstrates your case.
Goals:
Assignment # 6 Line Graph - Time Series Plot
Using Excel and data tiome series data from Assignment #1, produce a line graph which shows data across time (Time series data is not strictly required for this assignment, however, it is usually what is needed for this type of graph to be preferable to a bar graph.). Provide a paragraph which articulates how the graph demonstrates your case.
Goals:
Assignment # 7 Converting Nominal Dollars to Real Dollars (Controlling for Inflation)
Using Excel and the time series data used in Assignment #6 that involves money over time (e.g. per capita income for the period 1960-1995) convert the raw (nominal) dollar values into real dollars. See the handout for this assignment.
Goals:
Develop a hypothesis about whether some sample is different from some population? Collect data for which you have a population mean, and then define a sub sample from it. For instance, using one of your 50 state data sets, test whether a subsample is different from the population mean.
Goals:
Assignment # 9 Two-sample t-test
Develop a hypothesis about the difference between two groups. Collect the data and perform the appropriate comparison
between them.
Goals:
For full Content points:
For full Presentation points:
Assignment #10 Regression, Correlation, & Multiple Regression
Using a data set which I will supply, test a number of interrelated hypotheses. (Use the
presidential popularity data. ??)
Goals:
For full Content points:
For full Presentation points: