Deck 11: Data Preparation
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Unlock Deck
Sign up to unlock the cards in this deck!
Unlock Deck
Unlock Deck
1/67
Play
Full screen (f)
Deck 11: Data Preparation
1
When coding multiple-response questions where respondents are asked to tick all answers that apply, each option is assigned a number according to its listed sequence.
False
2
A coding category must be exhaustive; that is, it should be provided for all subjects, objects and responses.
True
3
Data cleaning is a process used to determine completeness, outliers and logical consistency in the data.
True
4
ANOVA test is important for understanding distribution of data.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
5
In order to understand spread of data, the upper and lower quartile ranges are calculated.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
6
Data editing is often done by hand and is time consuming.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
7
It is always important to work on an original data file, never work with a copy.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
8
The best way to deal with data analysis problem is to ensure the reliability of the data after data gathering stage.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
9
The purpose of coding is to ensure that the data are complete, consistent and readable.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
10
Item non-response is term assigned to a respondent refusal to participate in the study.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
11
Data scientists and data analysts report that between 60% and 80% of their time is spent on data analysis.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
12
Coding is an assignment of numerical scores or classifying symbols to data.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
13
Research firms need to carefully maintain data coding system that helps maximise data integrity.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
14
When a researcher asks the respondent a question, only if the answer in a previous question is stated in a particular way; this is called design for logical consistency.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
15
Data analysis is process of cleaning and transforming raw data prior to analysis.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
16
Artificial intelligence tools can detect logs for suspiciously quick responses to detect for fraudulent responses.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
17
Many statistical packages are case-sensitive, that is 'Male' and 'male' are interpreted differently.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
18
Many data sets are deliberately set up such that missing values have a special code, e.g. '99' so that these are easily identifiable by humans.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
19
Raw data requires coding before the analysis.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
20
Multiple-response questions must be treated as a single question, with a field given for each option.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
21
Open-ended questions are usually not suitable for telephonic interviews.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
22
Open-ended questions are useful when researcher wants to understand motivation for a particular response.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
23
One should never substitute missing value to:
A) 9999
B) 99
C) 999
D) 0
A) 9999
B) 99
C) 999
D) 0
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
24
Qualitative data may be subjective in parts, as well as its analysis.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
25
Which of the following involves systematic looking for suspicious features in assessment questionnaires and databases?
A) screening
B) treatment
C) coding
D) diagnosis
A) screening
B) treatment
C) coding
D) diagnosis
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
26
What is always helpful for understanding distribution of data?
A) descriptive analysis
B) correlation analysis
C) ANOVA analysis
D) t-test
A) descriptive analysis
B) correlation analysis
C) ANOVA analysis
D) t-test
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
27
The assignment of numbers to edited data is known as:
A) editing.
B) recording.
C) coding.
D) recoding.
A) editing.
B) recording.
C) coding.
D) recoding.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
28
Coding open-ended questions is a very complex issue.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
29
Indigenous category is a term used by respondent with a particular meaning and significance in their setting.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
30
Which of the following requires an in-depth understanding of all types and sources of possible errors during the data collection and entry processes:
A) screening
B) diagnosis
C) treatment
D) codebook
A) screening
B) diagnosis
C) treatment
D) codebook
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
31
Which of the following is not a possible way for detecting fraudulent responses?
A) checking for location
B) speed of response
C) consistency in response
D) incompleteness of response
A) checking for location
B) speed of response
C) consistency in response
D) incompleteness of response
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
32
Many researchers worry about __________ the most as this is only thing that can be measured with confidence.
A) sample size
B) validity
C) reliability
D) variance
A) sample size
B) validity
C) reliability
D) variance
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
33
Which of the following are two basic rules for code-construction?
A) inclusive; independent
B) inclusive, dependent
C) mutually exclusive; independent
D) mutually exclusive; dependent
A) inclusive; independent
B) inclusive, dependent
C) mutually exclusive; independent
D) mutually exclusive; dependent
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
34
A major objective in the code-building process is to accurately transfer the meanings from written responses to numeric codes.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
35
If qualitative research is done well, then often, for the purpose of the business decision, there is no need to carry out further quantitative research
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
36
A codebook identifies all, except:
A) variable description
B) code name
C) position in the data matrix
D) missing value
A) variable description
B) code name
C) position in the data matrix
D) missing value
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
37
_____________ means identifying the nature of the defective data.
A) screening
B) diagnosis
C) treatment
D) codebook
A) screening
B) diagnosis
C) treatment
D) codebook
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
38
Pair-wise detection is a good way of handling missing data in most applications as it preserves information and amenable to even advanced statistical procedures.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
39
Closed-ended questions should be avoided in survey questionnaire.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
40
Which of the following is not an interrelated rule to make dataset tidy?
A) Each variable must have its own column.
B) Each variable must have its own row.
C) Each variable must have its own cell.
D) Each variable must have a coded-value.
A) Each variable must have its own column.
B) Each variable must have its own row.
C) Each variable must have its own cell.
D) Each variable must have a coded-value.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
41
Which of the following is not true about MAXQDA?
A) MAXQDA is little less complicated to learn than NVivo.
B) MAXQDA interface comprises of six windows.
C) MAXQDA has a coding schema.
D) MAXQDA is flexible and easier to use program.
A) MAXQDA is little less complicated to learn than NVivo.
B) MAXQDA interface comprises of six windows.
C) MAXQDA has a coding schema.
D) MAXQDA is flexible and easier to use program.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
42
The tallying of a small sample of total number of replies to a particular question in order to construct coding categories is known as:
A) code building
B) test tabulation
C) cross tabulation
D) recoding.
A) code building
B) test tabulation
C) cross tabulation
D) recoding.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
43
Which of the following is not a good example of qualitative data source?
A) focus group
B) interviews
C) survey
D) text from local news stories about a particular issue over a period of time
A) focus group
B) interviews
C) survey
D) text from local news stories about a particular issue over a period of time
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
44
Which of the following is not true about NVivo?
A) NVivo is less complicated to learn than MAXQDA.
B) NVivo is more flexible in handling data.
C) NVivo produces more specialised output.
D) NVivo is very easy to use to find similar words.
A) NVivo is less complicated to learn than MAXQDA.
B) NVivo is more flexible in handling data.
C) NVivo produces more specialised output.
D) NVivo is very easy to use to find similar words.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
45
All of the following are reasons we might choose a qualitative research method, except:
A) to develop theory.
B) to uncover possible reasons for a phenomenon.
C) to learn the triggers for an event to occur.
D) to learn the strength of a relationship.
A) to develop theory.
B) to uncover possible reasons for a phenomenon.
C) to learn the triggers for an event to occur.
D) to learn the strength of a relationship.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
46
Which of the following is about physically handling the text such as using eyeballing?
A) unmarked text
B) pawing.
C) metaphors.
D) transitions.
A) unmarked text
B) pawing.
C) metaphors.
D) transitions.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
47
When coding qualitative data, a researcher may come across terms used by the respondent that have a particular meaning and significance to the setting. This is known as a(n):
A) indigenous category.
B) connector.
C) emergent code.
D) a priori code.
A) indigenous category.
B) connector.
C) emergent code.
D) a priori code.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
48
Which of the following is not a classification of coding strategy?
A) Word repetitions
B) Compare and contrast
C) Social science queries
D) NVivo
A) Word repetitions
B) Compare and contrast
C) Social science queries
D) NVivo
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
49
Which of the following codes appear in the process of reading ad examining the data codes created before data have been gathered and examined, usually drawn from existing theory and current knowledge of the research domain?
A) A priori codes
B) Emergent codes
C) Demand codes
D) Indigenous codes
A) A priori codes
B) Emergent codes
C) Demand codes
D) Indigenous codes
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
50
Data cleaning occurs at which stage of the data analysis process?
A) Data editing
B) Data coding
C) Data entry
D) Data analysis
A) Data editing
B) Data coding
C) Data entry
D) Data analysis
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
51
Which of the following statement is not true about qualitative research?
A) Qualitative data may be subjective in part, but analysis of such data should not be.
B) Qualitative method is used to investigate strength of relationship between two or more variables.
C) In a qualitative study, coding has a role of organising and sorting data.
D) Qualitative method is used to investigate reason for something.
A) Qualitative data may be subjective in part, but analysis of such data should not be.
B) Qualitative method is used to investigate strength of relationship between two or more variables.
C) In a qualitative study, coding has a role of organising and sorting data.
D) Qualitative method is used to investigate reason for something.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
52
Which of the following is a cloud based service that permits the user to quickly see graphic representations of raw data and make transformations and recodes?
A) Trifecta wrangler
B) Tableau prep
C) Python
D) R statistic
A) Trifecta wrangler
B) Tableau prep
C) Python
D) R statistic
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
53
Which of the following is a data-visualisation package and dashboard tool that integrates well with many different online services and data types?
A) Microsoft Power
B) Micro Strategy
C) Qlik Sense
D) Python
A) Microsoft Power
B) Micro Strategy
C) Qlik Sense
D) Python
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
54
Which of the following coding strategy alert to the possibility of the new idea contained in the pieces of text that have already been coded?
A) Memos and codes
B) Constant comparison
C) Key-words-in-context
D) Connectors
A) Memos and codes
B) Constant comparison
C) Key-words-in-context
D) Connectors
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
55
In test tabulation, what is the typical percentage of responses from completed questionnaires that a researcher will use to create coding categories?
A) 10 per cent
B) 20 per cent
C) 30 per cent
D) 40 per cent
A) 10 per cent
B) 20 per cent
C) 30 per cent
D) 40 per cent
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
56
The process of using a computer to convert original codes used for raw data into codes that are suitable for analysis is known as:
A) field.
B) recording.
C) recoding.
D) coding.
A) field.
B) recording.
C) recoding.
D) coding.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
57
Codes that are established before examining the data are known as:
A) A priori codes
B) Emergent codes
C) Demand codes
D) Indigenous codes
A) A priori codes
B) Emergent codes
C) Demand codes
D) Indigenous codes
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
58
When the researcher coding qualitative data is looking for causal or logical links between terms, they are using a coding strategy known as:
A) constant comparison.
B) transitions.
C) connectors.
D) pawing.
A) constant comparison.
B) transitions.
C) connectors.
D) pawing.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
59
If a seven-point scale for variable 1 (VAR1) is to be recoded, which of the following programming statement might be used to subtract the original code from score 8?
A) VAR7 = 8 - VAR1
B) NEWVAR1 = 8 - VAR7
C) NEWVAR8 = 8 - VAR7
D) NEWVAR1 = 8 - VAR1
A) VAR7 = 8 - VAR1
B) NEWVAR1 = 8 - VAR7
C) NEWVAR8 = 8 - VAR7
D) NEWVAR1 = 8 - VAR1
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
60
Which of the following is not conducted or developed before the survey?
A) Pre-test
B) Code-book
C) Design for logical consistency
D) Screening
A) Pre-test
B) Code-book
C) Design for logical consistency
D) Screening
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
61
Which of the following is not true about code building process?
A) Code building is used to accurately transfer meaning from numeric codes to written responses.
B) Code building is based on thoughts, not just words.
C) Result of code building should be a list.
D) Code building list should be in orderly form.
A) Code building is used to accurately transfer meaning from numeric codes to written responses.
B) Code building is based on thoughts, not just words.
C) Result of code building should be a list.
D) Code building list should be in orderly form.
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
62
A record that identifies each variable in a study, and gives variable description, code name and position in the data matrix is called:
A) recoding
B) codebook
C) cross-tabulation
D) test-tabulation
A) recoding
B) codebook
C) cross-tabulation
D) test-tabulation
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
63
Which of the following coding strategy refers to notes to researcher and team during coding process?
A) Memos and codes
B) Constant comparison
C) Key-words-in-context
D) Connectors
A) Memos and codes
B) Constant comparison
C) Key-words-in-context
D) Connectors
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
64
Which of the following coding strategy refers to traditional technique of making photocopy of all transcripts and pasting them onto cards.
A) Pawing
B) Key-words-in-context
C) Cutting and sorting
D) Constant comparison
A) Pawing
B) Key-words-in-context
C) Cutting and sorting
D) Constant comparison
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
65
Which of the following approach a researcher can use in Qualtrics so that respondent do not see the question that researcher do not want them to
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
66
Which of the following is performed after the survey?
A) Pre-test
B) Detecting fraudulent responses
C) Develop a code book
D) Design for logical consistency
A) Pre-test
B) Detecting fraudulent responses
C) Develop a code book
D) Design for logical consistency
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck
67
In the process of organising and sorting, a researcher must carefully think about all, except:
A) events
B) motivations
C) contingencies
D) results
A) events
B) motivations
C) contingencies
D) results
Unlock Deck
Unlock for access to all 67 flashcards in this deck.
Unlock Deck
k this deck