Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

25
Dr. Michael R. Hyman, NMS U Data Preparation

Transcript of Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

Page 1: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

Dr. Michael R. Hyman, NMSU

Data Preparation

Page 2: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

2

File, Record, and Field

Page 3: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

3

Data Matrix

Page 4: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

4

Data Entry

Process of transforming data from research projects to computers

Page 5: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

5

(1) Validation(2) Editing(3) Coding(4) Data entry/transcription(5) Machine cleaning of data

Five Steps for Data Preparation

Page 6: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

6

Check that interviews conducted as specified• Ensure respondent qualified• Interviewer looked/acted professionally• Interview conducted in proper environment• All appropriate questions asked

Validation

Page 7: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

7

Check for:• Omissions• Ambiguities• Inconsistencies• Proper skip patterns • Properly recorded answers, especially

to open-ended questions

Editing: Personal Interviews

Page 8: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

8

Check for:• All questionnaire sections and key

questions answered• Respondents understood instructions

and took task seriously• No missing pages• Questionnaire returned before cutoff

date

Editing: Self-Administered Questionnaires

Page 9: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

9

Solutions for Editing Problems

• Re-contact respondent

• Discard questionnaire

• Use only good items

– Data analysis implications (beyond scope of class)

Page 10: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

10

Coding

• Process of grouping and assigning numeric codes to different question responses

• Closed-ended questions easier because pre-coded

Page 11: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

11

Pre-coding Example

Page 12: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

12

Coding an Open-Ended Question

• Generate list of responses

• Consolidate responses (subjective judgment)

• Set response category codes

• Assign independent response category and record associated numeric code

Page 13: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

13

Portion of Travel Study Code Book

Page 14: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

14

• Validated, edited, and coded questionnaires given to data entry operator

• More accurate and efficient to go directly from questionnaire to data entry device and storage medium

• Skip coding sheets

Data Entry Process

Page 15: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

15

Data Transcription

Page 16: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

16

• Checking entered data for internal logic by either the data entry device or another connected device

• Excel/Quattro and SPSS rely on dumb data entry• Require data cleaning

Intelligent Data Entry

Page 17: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

17

Machine Cleaning of Data

• Computerized error check– Identifies and suggests fixes for logical

errors• Marginal report

– Computer-generated table of response frequencies for questions

– Monitor entry of valid codes and skip patterns

Page 18: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

18

Machine Cleaning Instructions

Page 19: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

19

Recoding Data

Page 20: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

20

Recoding Data

• Using computers to convert original codes used for raw data into codes that are more suitable for analysis

• Var1 = 8 - Var1

Page 21: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

21

Collapsing a Five-Point Likert Scale

Page 22: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

22

Coping with Missing Data

Page 23: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

23

Page 24: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

24

Item Non-response to Questions of Fact

Page 25: Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.

25

Ways to Handle Missing Responses

• Leave blank

• Case-wise deletion

• Pair-wise deletion

• Mean response

• Imputed response