PL-300 logo
Focused certification exam prep
Start practice

PL-300 Domain 1: Prepare the data (25-30%) - Complete Study Guide 2026

TL;DR
  • Domain 1 (Prepare the data) is worth 25-30% of the PL-300, tied with two other domains for the largest weighting.
  • Expect scenario-based questions on Power Query transformations, not just multiple-choice definitions.
  • Data profiling, error handling, and query folding are frequently tested but often under-studied.
  • The exam runs 100 minutes with 40-60 items, so Domain 1 questions get roughly 25-30 minutes of your time budget.

What Domain 1 Actually Covers

Domain 1, "Prepare the data," is one of three co-weighted domains on the PL-300 exam, each accounting for 25-30% of your final score. That places it on equal footing with Domain 2 (Model the data) and Domain 3 (Visualize and analyze the data), and noticeably heavier than Domain 4 (Manage and secure Power BI) at 15-20%. If you're mapping out your prep across all four content areas, the PL-300 Exam Domains 2026: Complete Guide to All 4 Content Areas is a useful companion to this deep dive.

In Microsoft's official skills outline (current as of April 20, 2026), Prepare the data spans everything that happens before you build a single visual: connecting to sources, shaping and profiling data, and preparing it for modeling. This is the Power Query layer of Power BI, and it demands real, hands-on fluency rather than surface-level familiarity.

Why This Domain Trips People Up: Candidates often assume Domain 1 is "the easy part" because it feels less abstract than DAX or data modeling. In practice, the sheer number of transformation functions, error-handling scenarios, and data source nuances make it easy to underprepare.

Connect to and Get Data From Multiple Sources

The first cluster of skills in Domain 1 revolves around identifying and connecting to data. You need working knowledge of how Power BI Desktop handles different source types and connectivity modes, because the exam will present you with scenarios asking which connection mode or source configuration is appropriate for a given business requirement.

Data Source Connectivity

Candidates must understand the practical differences between connection modes and when each is the right choice.

  • Import vs. DirectQuery vs. Live Connection, and the tradeoffs of each
  • Connecting to files (Excel, CSV, JSON, PDF), databases (SQL Server, Azure SQL), and online services
  • Working with dataflows and shared/certified semantic models as reusable sources
  • Understanding when to use a composite model that mixes Import and DirectQuery
  • Configuring parameters for source connections to support environment changes (dev/test/prod)

A subtlety many candidates miss: the exam doesn't just ask "what is DirectQuery." It asks you to read a business scenario (real-time reporting need, huge fact table, refresh limitations) and infer the correct connectivity approach. This scenario-based framing is consistent across the whole exam, and it's covered in more depth in the PL-300 Study Guide 2026: How to Pass on Your First Attempt.

Profile, Clean, and Transform Data

Once data is connected, Domain 1 shifts to data profiling and transformation - arguably the meatiest part of this domain. You're expected to evaluate data quality, identify anomalies, and apply the right Power Query transformation to fix them.

  • Using the Data Profiling tools (Column quality, Column distribution, Column profile) to assess completeness and accuracy
  • Identifying and resolving errors, nulls, duplicates, and inconsistent data types
  • Merging and appending queries, and understanding the difference between the two
  • Pivoting, unpivoting, and transposing data to reshape tables for modeling
  • Splitting columns, extracting values, and using conditional columns
  • Creating and applying custom functions in Power Query (M language basics)
  • Managing query dependencies and understanding the applied steps sequence

Key Takeaway

Practice building at least one query from scratch that merges two sources, unpivots a column, and resolves data type errors - this combination mirrors how Domain 1 scenarios are typically constructed on the real exam.

How Domain 1 Questions Are Written on the Exam

PL-300 is delivered through Pearson VUE, either at a test center or online proctored, with a standard fee of USD 165 (regional pricing and student discounts may apply at checkout). The exam runs 100 minutes and typically includes 40-60 items, some grouped into case studies. Microsoft doesn't publish a fixed count of scored versus unscored questions, so you should treat every item as if it counts.

For Domain 1 specifically, expect a mix of:

  • Multiple-choice questions describing a data quality problem and asking which Power Query step resolves it
  • Case study questions that give you a business scenario with multiple data sources and ask you to sequence the correct preparation steps
  • Interactive or drag-and-drop style items testing your ability to match transformation types to outcomes

Because passing requires a scaled score of 700 out of 1000, no single domain can be treated as optional - but Domain 1's weighting means a shaky grasp of Power Query can meaningfully drag your score down. If you're still deciding how much total prep time this exam deserves, How Hard Is the PL-300 Exam? Complete Difficulty Guide 2026 breaks down where most of the difficulty actually lives.

DomainWeightingCore Focus
Domain 1: Prepare the data25-30%Connect, profile, clean, transform
Domain 2: Model the data25-30%Relationships, DAX, star schema
Domain 3: Visualize and analyze25-30%Reports, visuals, analytics
Domain 4: Manage and secure Power BI15-20%Workspaces, security, deployment

Power Query Skills You Must Master

If you take one thing away from this guide, it's that Domain 1 is fundamentally a Power Query exam. Microsoft's own prerequisite guidance states candidates should already be proficient with Power Query and DAX before attempting PL-300 - Domain 1 is where that Power Query proficiency gets tested most directly.

Data Type and Structure Management

You need fluency in recognizing and correcting structural issues in raw data before it reaches the model.

  • Changing and validating column data types (text, decimal, whole number, date/time)
  • Handling locale-specific formatting issues
  • Detecting and resolving referential integrity issues between tables
  • Grouping and aggregating data at the query level

Query Performance and Folding

A frequently underestimated topic: understanding how Power Query optimizes (or fails to optimize) transformations against the source.

  • Recognizing which transformation steps support query folding
  • Understanding how folding affects refresh performance
  • Reordering steps to preserve folding where possible
  • Diagnosing why a query stopped folding partway through

These performance-oriented concepts are easy to skip in a rushed study plan, but they show up in scenario questions where the "correct" answer depends on efficiency, not just functional correctness.

Common Mistakes Candidates Make on Domain 1

  1. Treating Power Query as "just clicking buttons." The exam tests conceptual understanding of what each transformation does under the hood, not just the ability to click through the UI.
  2. Skipping the M language entirely. While you won't be asked to write complex M code from memory, understanding basic M syntax helps you interpret advanced editor scenarios.
  3. Ignoring data profiling tools. Column quality and distribution features are specifically named in the skills outline and appear in scenario questions.
  4. Confusing merge and append. This distinction is tested repeatedly in different scenario wrappers.
  5. Under-practicing with real messy datasets. Clean sample data doesn't prepare you for the kind of "find the anomaly" questions Domain 1 favors.
Practice Tip: Work through Domain 1 questions on a realistic practice platform like our PL-300 practice test engine so you get used to the scenario-based phrasing before exam day, not on exam day.

Where Domain 1 Fits in Your Study Schedule

Because Domain 1, Domain 2, and Domain 3 are each weighted 25-30%, none of them should be an afterthought - but Domain 1 pairs naturally with early study weeks since data preparation is the foundation everything else builds on. You can't model relationships or build meaningful visuals from data you haven't properly shaped.

Week 1-2

Domain 1 Foundations

  • Practice connecting to at least four different source types
  • Work through data profiling tools on a messy sample dataset
  • Build and break queries intentionally to understand error messages
Week 3-4

Domain 2 Transition

  • Move into relationships and star-schema design once your queries are clean
  • Revisit Domain 1 transformations that feed directly into modeling decisions

This sequencing matters because Domain 2 (Model the data) is widely considered the hardest domain due to DAX and star-schema complexity - going in with clean, well-understood Power Query habits from Domain 1 removes one variable from that harder stretch. Full details on Domain 2 are covered in our dedicated PL-300 Domain 2: Model the data (25-30%) - Complete Study Guide 2026.

How Domain 1 Connects to the Other Domains

Data preparation isn't an isolated skill set on this exam - it's the input to everything else. A poorly shaped table in Domain 1 creates downstream problems in Domain 2's relationships and Domain 3's visuals. Understanding this connective thread helps you study more efficiently rather than in four disconnected silos.

If you're earlier in your research and still working out what this credential involves overall, background pieces like What Is PL-300?, PL-300 Meaning, and PL-300 Certification provide useful context before you commit to a study timeline.

Who Actually Uses These Domain 1 Skills on the Job

Data preparation skills tested in Domain 1 map directly to real job responsibilities. Employers hiring for business intelligence analyst, reporting analyst, and data analyst roles routinely list Power Query, data cleaning, and source integration as core requirements - not optional extras. If you're evaluating whether this certification is worth pursuing for your career stage, Is the PL-300 Certification Worth It? Complete ROI Analysis 2026 and PL-300 Jobs both dig into how these skills translate into hiring conversations.

Understanding compensation expectations for roles that require these skills can also help you decide how much time investment makes sense; see PL-300 Salary Guide 2026: Complete Earnings Analysis for a numbers-based view.

Registration and Retake Mechanics Worth Knowing

PL-300 is scheduled through Pearson VUE at a standard US fee of USD 165, with regional pricing and student discounts shown at checkout in many locations - there are no separate member/non-member pricing tiers. There are no formal prerequisites to register, though Microsoft explicitly recommends Power Query and DAX proficiency before attempting the exam, which reinforces just how central Domain 1's subject matter is to overall readiness. Once earned, the certification is valid for 12 months and can be renewed for free through an online assessment on Microsoft Learn during the six-month window before expiry. A full cost breakdown, including what happens if you need to retake, is available in PL-300 Certification Cost 2026: Complete Pricing Breakdown.

Quick Registration Reminder: Because there's no prerequisite exam or course requirement, nothing stops you from registering before you're fully ready. Use a structured practice routine on our PL-300 practice test platform to confirm Domain 1 readiness before you book your date.

FAQ: PL-300 Domain 1

How much of the PL-300 exam is Domain 1: Prepare the data?

Domain 1 accounts for 25-30% of the exam, the same weighting range as Domain 2 (Model the data) and Domain 3 (Visualize and analyze the data). Domain 4 (Manage and secure Power BI) carries less weight at 15-20%.

Is Domain 1 easier than Domain 2 (Model the data)?

Many candidates find Domain 1 more approachable than Domain 2 because it involves less abstract logic than DAX and star-schema design. However, Domain 1 still requires deep, hands-on Power Query fluency and shouldn't be underestimated.

Do I need to know the M language for Domain 1?

You're not required to write complex M code from scratch, but understanding basic M syntax helps you interpret Advanced Editor scenarios and troubleshoot query steps that appear in exam questions.

What tools should I practice with to prepare for Domain 1?

Spend time in Power BI Desktop's Power Query Editor working with the Data Profiling tools, practicing merges, appends, pivots, and error resolution on realistic, messy datasets rather than clean sample files.

How does Domain 1 connect to the rest of the PL-300 exam?

Domain 1 skills feed directly into Domain 2's data modeling and Domain 3's visualization work. Clean, well-structured queries from Domain 1 make relationship-building and report design significantly easier, which is why it's worth mastering early in your study plan.

Ready to pass your PL-300 exam?

Put this into practice with free PL-300 questions across every exam domain.