How to Identify a Research Question

How to Identify a Research Question

Dr. Benjamin Bleiberg and Dr. Daniel Lefler

Foundational Research Curriculum

Introduction

Selecting a clear, clinically meaningful research question sets your future work up for success and amplifies the impact of your findings. Ensuring that you have thought through your central question carefully can save time in study design and analysis and help you get buy-in from mentors and collaborators. In this guide, we break down key steps to identifying and refining an impactful clinical research question.

Note: This guide focuses on clinical research. While there are some shared elements with basic science or translational research, the goals of that work and the process are different.

Why Having a Clear Research Question Matters

Your research question—paired with a testable hypothesis—is the north star of your project. To be useful, it should be discrete, well-defined, and answerable with the data you have. Without that clarity, even interesting findings risk being unfocused, hard to interpret, or difficult to translate into practice. A strong question frames how results are interpreted, guides productive conversations with mentors, and makes downstream steps like abstract writing, manuscript preparation, and grant development far more efficient. In practice, a well-formulated question strengthens your project in several ways:

  • Guides design and analysis: A clear question defines which variables to collect, comparators to use, and outcomes to specify, preventing scope creep and wasted effort.
  • Enables efficient mentorship: A focused question lets mentors quickly assess feasibility, suggest data sources, and connect you with collaborators who can move the project forward.
  • Accelerates writing: Abstracts and manuscripts are easier to draft when the question and outcomes are explicit, since every section flows from that anchor.
  • Improves dissemination: Reviewers and readers more easily grasp the work’s contribution and limitations when the central question is clear, making findings impactful.
  • Translates to grants: For those pursuing funding, this same clarity is critical for crafting effective pilot proposals and, later, well-defined Specific Aims in grant applications.

Steps to Developing a Research Question

Below are five practical steps to help you identify, generate, and refine a research question.

1. Remain curious and attentive

Research ideas often emerge in the flow of daily work—on rounds, in patient care discussions, during multidisciplinary discussions (e.g., transplant selection meetings or tumor boards), and when looking up evidence to guide decisions. Conversations with mentors and staff can highlight questions that still lack clear answers. Pay attention to patterns, practice variation, and outcomes patients care about, as well as inefficiencies in day-to-day care that lack good solutions, areas where new technologies are adopted before strong evidence exists, or differences in care that raise equity concerns.

Tip: Keep a running list (e.g., notes app) of questions from rounds or patient care that have incomplete answers or reflect areas of equipoise. Ask your attending: “Why do we do it this way?” If the answer is uncertainty, lack of evidence, or simply “that’s how it’s done,” you may be close to a promising research question.

Signs you may be at the edge of practice include frequent “it depends,” conflicting guidelines, or missing patient-centered outcomes—places where existing evidence leaves gaps or uncertainty.

2. Generate a hypothesis

Moving from a broad idea to a hypothesis is what makes a project studyable. A hypothesis states the exposure or intervention, the primary outcome, and the anticipated relationship between them. A practical way to shape the question is with PICO (Population, Intervention/Exposure, Comparator, Outcome), which sharpens the question and points to a testable hypothesis. If your mentor has an existing dataset or registry for a different project, there may be adjacent questions that can be reframed into a clear PICO for another study.

Example: From PICO to Hypothesis

PICO: In adults with type 2 diabetes (P), does use of SGLT2 inhibitors (I) compared with sulfonylureas (C) reduce hospitalization for heart failure (O)?

Hypothesis: Among adults with type 2 diabetes, treatment with SGLT2 inhibitors is associated with lower hospitalization rates for heart failure compared with sulfonylureas.

Tip: Choose hypotheses in areas that matter clinically or scientifically, and frame them so the answer is useful whether the association exists or not. For example, finding no difference between two common treatments—or showing a suspected risk factor is not linked to complications—can be just as impactful as positive findings.

3. Review the existing literature

After you have identified a promising idea and shaped it into a hypothesis, the next step is to see what is already known. Reviewing the literature helps ensure your project contributes new knowledge rather than repeating prior work. Ideally, your study addresses uncertainty in clinical practice by asking a new question or by approaching an existing one with novel data—for example, using a larger, more diverse, or more granular dataset. Mapping the published work helps you spot gaps, clarify limitations, and define how your question adds value. Importantly, even when a topic has been studied, there are usually related angles—different subgroups, comparators, outcomes, or real-world contexts—that remain important to explore. As you read, note which populations, comparators, outcomes, and covariates were studied, as well as the analytic approaches applied. Observing these conventions helps you anticipate data needs and align your approach with accepted methods.

Tip: Keep a focused list of the top 5–20 papers most relevant to your question. These will often become key citations in your eventual manuscript. Pay special attention to the “limitations” and “future directions” sections—authors often highlight unanswered questions that can spark your next project.

A structured question using PICO can also guide your search. Each element can be used as a keyword or MeSH term in PubMed, or as search phrases in Google Scholar. The interactive builder below can generate a starter PubMed query.

4. Iterate with mentors and define the approach

After reviewing the literature, the next step is to refine your idea with mentors and collaborators. Share your draft PICO and hypothesis, then iterate together on inclusion criteria, exposures, outcomes, covariates, and the overall study design. Most resident projects are retrospective, so discussions often focus on aligning the question with available data and right-sizing the scope for a residency timeline. Apply the FINER framework as a stress test: is the project Feasible given time, data, and skills; Interesting to you and others; Novel compared to prior work; Ethical with respect to patient data; and Relevant to clinical practice? You will return to feasibility more concretely when you assess data sources in the next step, but at this stage the goal is to ensure the project makes sense in principle and is worth pursuing.

Agenda for an early study design meeting: PICO → candidate datasets → inclusion/exclusion → key exposures → primary/secondary outcomes → key covariates → rough power/precision needs → analysis sketch → roles & authorship expectations.

5. Determine your sample and data source

With your study approach outlined, the next step is to confirm that the data exist to support it. Ask whether the dataset is large enough for your outcomes of interest, whether key variables are reliably captured, and how long access will take. This is the practical side of FINER: feasibility in terms of time, data, and resources. The realities of your sample may reshape the scope, but small studies and rare-disease cohorts can still be valuable for pilot work or hypothesis generation. In some cases, multiple datasets can be combined or used sequentially—for example, one for discovery and another for validation. If no suitable dataset exists, consider building one: manual chart abstraction with a data dictionary and entry into a secure platform (e.g., REDCap), or a prospective registry if timelines allow.

Common data sources for residents

  • EHR: Penn EHR (Epic), VA Clinical Data Warehouse (CDW)
  • Public health: NHANES, NCI SEER, HCUP National Inpatient Sample (NIS)
  • Claims: CMS Medicare, Optum, Merative MarketScan
  • Registries: Flatiron Health (oncology), NCDR, UNOS (transplant)
  • Multi-institution networks: TriNetX
  • Consortia: multi-institutional datasets via subspecialty mentors
  • Bioinformatics: GEO, Caris Life Sciences
Plan ahead: Applications, data-sharing agreements, IRB approvals, and data pulls can take weeks to months. Build this into your study timeline from the start to avoid delays.

Common Pitfalls to Avoid

  • Selecting a topic you are not genuinely interested in. You will be the driver of the project, so pick a question or method that motivates you—not just your mentor.
  • Mismatched timelines. Ensure the scope fits your training horizon; aim to present an abstract early and submit a manuscript within ~12 months. Completed work carries more weight than “in progress” during fellowship or job interviews.
  • Delaying input from mentors and biostatisticians. Early feedback can prevent infeasible designs, duplicated efforts, and rework later.
  • Duplicating prior studies without differentiation. If the space is crowded, add value by targeting a distinct population, leveraging a novel data source, or focusing on different outcomes or implementation contexts.
  • Poor question–data fit. Confirm that key variables are captured with adequate granularity and that your sample size supports the planned analyses; list must-have variables up front and avoid collecting data you will not use.
  • Spreading yourself across too many projects. Committing to several questions at once dilutes your focus; prioritize one strong idea to maximize follow-through and impact.
  • Unclear roles and authorship. Set expectations early—who is doing what, target deadlines, and authorship plan—to prevent friction later.

Conclusion

A clear, well-scoped question is the foundation of impactful research. Pair a focused hypothesis with a concise review of the literature, iterate with mentors, and confirm that the necessary data and timelines are feasible. By approaching projects this way, you position yourself not only to produce a strong abstract or manuscript during residency, but also to build the skills and habits that will support future high-quality scholarly work.

Interactive: Build & Test Your Question

Use this workspace to turn a topic into a concrete question, sanity-check its quality, right-size the scope, and launch a PubMed search.

PICO/PECO Builder

Your one-sentence question

In [Population], does [Intervention/Exposure] compared with [Comparator] affect [Outcome]?

FINER Reflection

Quality meter: 0/5

Feasibility & Scope

Suggested designs at this level: retrospective cohort, survey study, pre–post QI

Generate a PubMed Query

Suggested PubMed search string

(Fill in PICO above to generate a query)

Note: If a phrase is not a MeSH heading, PubMed’s Automatic Term Mapping still expands your text word search. Add synonyms with semicolons (e.g., metformin; biguanides) and refine terms based on what you see in the results.

Continue Learning

Related modules in the Research Curriculum: