Interviews and Survey Open-Ends

Learning Goals

Lesson

Check Understanding

What makes a question "leading"? Rewrite this question to make it non-leading: "Don't you find that test-driven development improves code quality?"

A leading question signals the answer the interviewer expects, making it socially awkward for the participant to disagree. "Don't you find that..." presupposes the answer is yes. A non-leading version removes the presupposition: "How has writing tests before code affected the quality of your work, if at all?" or more simply, "Walk me through your experience with test-driven development."

A researcher interviews developers about their debugging practices; all participants are the researcher's current colleagues. What is wrong with this sampling strategy and what should the researcher do instead?

The researcher's colleagues share a workplace, a codebase, a toolchain, and possibly a team culture. Any finding will reflect that specific environment, not debugging practice in general. This is also a convenience sample rather than a purposive one: the participants were chosen because they were available, not because they represent the range of debugging experience relevant to the research question. The researcher should define the target population more precisely (e.g., developers with at least three years of experience debugging production systems in a compiled language) and recruit participants who fit that definition from outside their immediate workplace, using purposive or snowball sampling.

What is the difference between purposive sampling and snowball sampling? When would you use each?

Purposive sampling means selecting participants because they have specific characteristics relevant to your research question. You define the criteria in advance and recruit accordingly. Snowball sampling means asking participants to refer others who might qualify. Purposive sampling is appropriate when you can identify potential participants directly — for example, by searching LinkedIn for developers who have contributed to a specific type of project. Snowball sampling is appropriate when the target population is hard to identify from the outside — for example, developers who have experienced burnout, who are unlikely to self-identify publicly.

Why do Ameller et al.'s findings about non-functional requirements matter for how SE courses teach requirements engineering?

If courses teach that non-functional requirements should be documented in a structured format and tracked through a requirements management tool, but practitioners systematically handle them differently, students are learning a process they will never use. Ameller et al.'s interviews revealed that the actual practice diverges from the prescribed process in specific, describable ways. That gap is exactly the kind of finding that should feed back into curricula — and it required interviews to discover, because no survey instrument would have known to ask the right questions about informal practices.

Exercises

Writing an interview guide

Write five interview questions for a study about how developers choose between writing tests before or after writing code. At least two questions must be grand-tour or experience questions. None may be leading, binary, or double-barreled. For each question, write one sentence explaining what type of information it is designed to surface and why that information is relevant to your research question.

Converting a question to a survey item

Take one of the questions you wrote in Exercise 1 and rewrite it as a Likert-scale survey item (1 = strongly disagree, 5 = strongly agree). Write one sentence explaining what information the quantitative version captures that the interview question does not, and one sentence explaining what information the interview question captures that the Likert scale cannot. Then write one sentence about when you would use each format.

Probing non-functional requirements

Ameller et al. found that developers handle non-functional requirements in ways that differ from textbook prescriptions. Pick one non-functional requirement type — performance, security, or usability — and write two interview questions that would reveal how a developer actually handles it day-to-day, without assuming they follow any particular documented process. For each question, write one sentence explaining why it avoids the leading/binary/double-barreled pitfalls.

Evaluating a sampling strategy

A study recruits interview participants by posting on Reddit and Twitter. Identify two specific ways this sampling strategy might bias the findings — think about who uses those platforms, who responds to research recruitment posts, and what that implies about whose experience is captured. For each bias, propose one mitigation and write one sentence explaining what the mitigation costs in time, money, or access.

Explaining saturation to a skeptical supervisor

You conduct 12 interviews and your supervisor asks how you know you have enough. Write a three-sentence response that explains theoretical saturation in plain language and describes one specific piece of evidence from your interviews that would support the claim that saturation has been reached — not just the assertion that you reached it. Then write a prompt you could give an LLM to help you evaluate whether your interview data shows signs of saturation:

[your prompt here]