Sankey Chart Excel: The Complete 2026 Guide to Building Flow Diagrams in Microsoft Excel

Learn to build a sankey chart in Excel with step-by-step instructions, data prep tips, formatting techniques, and real-world flow diagram examples.

Microsoft ExcelBy Katherine LeeMay 26, 202620 min read
Sankey Chart Excel: The Complete 2026 Guide to Building Flow Diagrams in Microsoft Excel

A sankey chart in Excel turns complex numerical data into an intuitive visual map of flow and distribution that reveals patterns invisible in standard tables. Whether you are charting guest traffic patterns at a luxury destination like Excellence Playa Mujeres or tracking departmental budget allocations across a multinational corporation, these diagrams reveal how quantities move from one category to another with striking clarity. Organizations worldwide rely on sankey visualizations to expose bottlenecks, highlight dominant pathways, and surface unexpected connections that pie charts cannot communicate.

The growing demand for flow-based visualizations has made the sankey chart one of the most sought-after chart types in modern data analytics. Originally developed by Captain Matthew Sankey in 1898 to illustrate steam engine energy efficiency, these diagrams have evolved far beyond engineering. Today financial analysts, marketing teams, supply chain managers, and environmental scientists all use sankey charts to map processes where inputs split into multiple outputs, making them indispensable for professionals who work with interconnected data streams within Excel spreadsheets daily.

Excel does not include a native sankey chart type in its standard chart gallery, which often surprises users who expect Microsoft to offer every common visualization out of the box. However, this limitation has not stopped millions of professionals from building sankey diagrams directly inside their workbooks. By combining stacked bar charts, custom formatting, and carefully constructed helper columns, you can build a polished sankey chart without leaving Excel or purchasing expensive third-party software licenses that add unnecessary overhead to your workflow.

Several dedicated add-ins and Power BI integrations also provide streamlined sankey chart creation for users who prefer a more automated approach. The Power BI Sankey Diagram custom visual connects seamlessly with Excel data sources and generates interactive flow diagrams in seconds. For professionals already embedded in the Microsoft ecosystem this combination offers the best of both worlds: Excel's familiar data management capabilities paired with Power BI's advanced visualization engine, all without requiring any programming knowledge or specialized graphic design training.

Understanding how to build and interpret a sankey chart in Excel positions you as a more versatile analyst in any organization. When you present data flows visually, stakeholders grasp complex processes faster and make better-informed decisions. A well-designed sankey diagram replaces pages of tabular data with a single intuitive graphic that highlights where resources concentrate, where they disperse, and where inefficiencies hide. This capacity for instant insight is what makes sankey charts so valuable across business intelligence and strategic planning teams worldwide.

Throughout this guide you will learn the complete process for creating sankey charts in Excel from scratch including data preparation, chart construction, formatting, and advanced customization. We cover when to use a sankey chart versus alternatives, common mistakes to avoid, and how complementary Excel skills like vlookup excel formulas and knowing how to merge cells in excel can streamline your data workflow. By the end you will have the confidence and technical knowledge to produce publication-quality sankey diagrams for any analytical project or presentation.

Sankey Chart Excel by the Numbers

📊72%Analysts Prefer Flow DiagramsOver tables for process visualization
⏱️45 minAverage First Build TimeFor a sankey chart in Excel
🌐50+Industries Using Sankey ChartsFrom energy to finance to healthcare
📈340%Search Interest GrowthSince 2018 for sankey tutorials
💻3Creation Methods AvailableStacked bar, Power BI, and add-ins
Microsoft Excel - Microsoft Excel certification study resource

How to Create a Sankey Chart in Excel Step by Step

📋

Define Your Flow Data Structure

Identify every source and target node in your process. Map each connection with a numeric value representing flow volume. Sketch the diagram on paper first to clarify which nodes connect and how many layers your chart will need.
📊

Build the Three-Column Data Table

Create a table with Source, Target, and Value columns. Each row represents one flow connection. Sort by source category first then by descending value within each group so the largest flows appear prominently in the chart.
🔄

Create Helper Columns for Positioning

Add columns calculating cumulative stacking values, gap spacing between nodes, and color codes for each category. These helper columns transform your flat data into the layered positioning information that Excel's stacked bar engine needs for alignment.
💻

Insert and Configure a Stacked Bar Chart

Select your helper column data and insert a stacked bar chart. Configure two bar clusters representing source and target nodes. Adjust bar widths, gap settings, and axis scaling so node heights are proportional to total flow values.
✏️

Format Flow Bands and Apply Styling

Draw curved shapes or use XY scatter series with smooth lines to connect source bars to target bars. Apply semi-transparent fills matching each source color. Add data labels, remove gridlines, hide axes, and polish for presentation output.

Preparing your data correctly is the single most important step when building a sankey chart in Excel because even the most sophisticated formatting cannot rescue a poorly structured dataset. Your source data should contain three essential columns: a source category, a target category, and a numeric value representing the flow between them. Each row describes one connection such as marketing budget flowing to social media advertising or raw materials moving to a specific production line, creating the foundation that your sankey visualization will render graphically as proportional flow bands.

Begin by identifying every node in your process flow and listing all possible connections between source and target nodes. For a website traffic sankey your sources might include organic search, paid ads, email campaigns, and direct visits while targets could be landing pages, product pages, or checkout completion. Ensure that every value is a positive number and that the sum of outflows from any source roughly matches its total input, maintaining logical consistency throughout your dataset so the resulting chart accurately represents reality without misleading distortions.

Once your three-column data table is ready sort it by source category and then by descending value within each group. This sorting matters because Excel renders chart elements in the order they appear in your data, and placing the largest flows first ensures visual prominence. Clean any duplicate entries, remove rows with zero values, and verify that category names are spelled consistently since even minor variations like extra spaces or inconsistent capitalization will create unwanted separate nodes that fragment the visualization into confusing segments.

Creating helper columns is the next critical data preparation task that many tutorials overlook entirely. You need a column that calculates cumulative values for stacking positions, another that determines gap spacing between nodes, and a lookup column mapping each category to a specific color hex code. These helper columns let you simulate the layered appearance of a sankey chart using Excel's stacked bar chart foundation. Without them aligning flow bands between source and target bars becomes nearly impossible to achieve with any visual accuracy.

If your dataset contains more than ten source-target pairs consider grouping smaller flows into an aggregated category labeled as other or miscellaneous. Sankey charts lose visual effectiveness when too many thin bands clutter the diagram making individual flows difficult to distinguish. A good rule of thumb is to display the top seven to eight connections explicitly and combine everything else into a single residual category, preserving readability while still accounting for the complete dataset without omitting any data points from the total flow calculations.

For users who frequently work with large datasets knowing how to freeze a row in excel becomes invaluable during data preparation. Freezing the header row lets you scroll through hundreds of source-target pairs without losing track of column assignments. Similarly understanding how to create a drop down list in excel helps validate category entries during data input, preventing spelling inconsistencies that would fragment your sankey nodes into duplicates when the chart renders. Save your prepared data on a dedicated worksheet separate from your chart sheet for clean organization and easy future updates.

FREE Excel Basic and Advance Questions and Answers

Test fundamental and advanced Excel knowledge with practice questions covering charts and data tools.

FREE Excel Formulas Questions and Answers

Challenge yourself with formula-based questions strengthening your Excel calculation and analysis skills.

How to Build a Sankey Chart in Excel Using Three Methods

The stacked bar chart method is the most accessible approach for creating a sankey chart directly in Excel without any external tools or add-ins. You build two stacked bar clusters representing source and target nodes then overlay curved shapes or XY scatter lines to simulate flow bands connecting them. This method requires the most manual formatting but gives you complete control over every visual element in the final diagram, making it ideal for highly customized presentations and reports.

Start by inserting a stacked bar chart from your helper column data then adjust category gaps to create spacing between nodes. Use the Format Data Series pane to set individual bar segment colors matching your source categories. Next draw smooth connector shapes between corresponding segments applying semi-transparent fills to create the characteristic flowing appearance. Remove axes, gridlines, and borders to produce a clean sankey diagram entirely within native Excel features without any additional cost.

Excellence Playa Mujeres - Microsoft Excel certification study resource

Pros and Cons of Building Sankey Charts in Excel

Pros
  • +Leverages existing Excel skills without requiring new software purchases or training investments
  • +Keeps data and visualization in a single workbook for easy sharing with colleagues and stakeholders
  • +Offers complete customization control over colors, labels, fonts, and layout elements
  • +Compatible with all Excel versions from 2016 through Microsoft 365 subscriptions
  • +Integrates seamlessly with other Excel features like pivot tables and conditional formatting
  • +Produces portable chart images suitable for reports, presentations, and web publication
Cons
  • No native sankey chart type exists in Excel requiring manual workaround construction techniques
  • Complex datasets with many nodes become difficult to manage using the stacked bar method
  • Updating charts with new data often requires manual reformatting of flow bands and shapes
  • Curved flow bands must be drawn manually which is time-consuming for large diagrams
  • Limited interactivity compared to dedicated visualization tools like Power BI or Tableau
  • Steep learning curve for first-time builders unfamiliar with helper column techniques

FREE Excel Functions Questions and Answers

Practice essential Excel function questions to improve your spreadsheet calculation and analysis skills.

FREE Excel MCQ Questions and Answers

Multiple choice questions covering core Excel concepts, features, and practical application scenarios.

Sankey Chart Excel Design and Quality Checklist

  • Verify that source, target, and value columns contain no blank cells or null entries before charting.
  • Confirm all category names use consistent spelling and capitalization across the entire dataset.
  • Check that total inflows equal total outflows to maintain mass balance accuracy in the chart.
  • Limit visible connections to eight and group smaller flows into a residual other category.
  • Assign distinct colors to each source node using a palette accessible to color-blind viewers.
  • Set flow band transparency between thirty and fifty percent so overlapping paths remain visible.
  • Add data labels directly on flow bands or nodes to eliminate the need for a separate legend.
  • Remove chart gridlines, axes, and borders for a clean presentation-ready final appearance.
  • Test the chart at projected display resolution to ensure labels remain readable at viewing distance.
  • Save the completed workbook as a template file for faster creation of future sankey projects.

Mass Balance Is Non-Negotiable

The most common mistake in sankey chart construction is failing to verify that total input values equal total output values. Even a small discrepancy produces flow bands that do not align correctly at target nodes, undermining the credibility of your entire visualization. Always create a validation cell that calculates the difference between inflows and outflows and apply conditional formatting to flag any nonzero result before you begin formatting.

Advanced formatting transforms a basic sankey chart in Excel from a functional diagram into a polished presentation-ready visual that commands attention in boardrooms and published reports. Start by selecting a cohesive color palette that maps logically to your data categories using distinct hues for each source node and gradient variations for the flow bands connecting them. Consistent color coding helps viewers track individual flows across the chart without constantly referring to a legend, reducing cognitive load and improving comprehension speed for technical and non-technical audiences alike.

Adjusting the transparency of flow bands is one of the most impactful formatting decisions you will make during the design process. Setting band transparency between thirty and fifty percent allows overlapping flows to remain visible where they cross, creating the characteristic layered appearance that distinguishes professional sankey charts from amateur attempts. In Excel you modify shape transparency through the Format Data Series pane by right-clicking any chart element and navigating to fill options where gradient stops and opacity controls give you precise visual tuning capabilities.

Adding data labels directly onto flow bands or adjacent to nodes eliminates the need for a separate legend and makes your sankey chart self-explanatory at first glance. Position labels at the widest point of each flow band for maximum readability using a font size that scales proportionally with band width. For narrow flows consider placing labels outside the band with a leader line pointing to the relevant path, ensuring that even the smallest data connections receive proper identification without cluttering the overall visual composition of the finished chart.

Node sizing and spacing deserve careful attention because they directly affect how accurately viewers perceive relative magnitudes in your sankey chart. Each node's height should be proportional to its total flow value and vertical gaps between nodes should remain consistent to avoid implying relationships that do not exist in the underlying data. In Excel achieving this proportionality requires precise adjustment of your helper column calculations ensuring that cumulative stacking values align perfectly with the bar chart axis scale you have configured for both source and target clusters.

Interactive elements can elevate your sankey chart when presenting in Excel or exporting to Power BI dashboards. Adding hyperlinks to individual nodes lets viewers click through to detailed breakdowns while conditional formatting rules can highlight flows exceeding certain thresholds or falling below minimum targets. These interactive features transform a static image into an exploratory tool that stakeholders use independently, reducing follow-up questions after presenting complex flow data to non-technical audiences who need deeper on-demand detail about specific pathways.

When your sankey chart includes time-based comparisons create multiple versions side by side on the same worksheet to show how flows have shifted between periods. Align vertical axes so node heights remain directly comparable across time frames and use consistent color assignments to prevent confusion. This comparative layout is especially powerful for quarterly budget reviews, annual energy audits, or marketing channel tracking where seeing evolution of flow patterns over time reveals trends that single-period snapshots inherently cannot capture for strategic decision-makers.

Excel Spreadsheet - Microsoft Excel certification study resource

Sankey charts in Excel serve dozens of practical applications across virtually every industry making them one of the most versatile visualization types available to data professionals today. In financial services portfolio managers use sankey diagrams to illustrate how investment capital flows from asset classes into individual securities revealing concentration risks and diversification gaps that pie charts obscure. Manufacturing engineers map raw material consumption through production stages to identify waste points and optimize resource utilization rates across complex multi-step assembly processes requiring granular visibility.

Marketing teams have embraced sankey charts to visualize customer journey data mapping how visitors move from initial awareness channels through engagement touchpoints to final conversion events. A typical marketing sankey might show that sixty percent of organic search visitors land on blog content, thirty percent navigate to product pages, and ten percent reach checkout. This flow detail helps marketing directors allocate budgets more effectively by identifying which channels generate the highest quality traffic throughout the entire acquisition and conversion funnel from first click to purchase.

Energy and environmental analysts were among the earliest adopters of sankey diagrams and their Excel implementations remain some of the most detailed examples available. National energy flow charts published by organizations like Lawrence Livermore National Laboratory use sankey formats to show how primary energy sources transform into electricity, transportation fuel, and industrial heat with rejected energy as a separate outflow. Recreating simplified versions in Excel helps sustainability teams communicate carbon footprint data to stakeholders who lack technical energy analysis backgrounds effectively.

Supply chain management presents another compelling use case where sankey charts deliver unique analytical value that traditional tables cannot match. Operations managers map product movements from suppliers through distribution centers to retail locations with band widths representing shipment volumes and colors indicating product categories. When supply disruptions occur updating the sankey chart immediately reveals which downstream locations face the greatest impact enabling faster response planning during logistics challenges that expose hidden dependencies in conventional spreadsheet-based reporting systems.

Human resources departments use sankey charts to track employee movement within organizations visualizing internal transfers, promotions, departures, and new hires as flows between departments or job levels. A well-designed HR sankey instantly shows whether certain departments consistently lose talent or whether promotion pathways favor specific groups supporting data-driven diversity and retention strategies. Hospitality brands like Excellence Resorts including properties such as Excellence El Carmen and Excellence Coral Playa Mujeres use similar flow analysis to track guest journey patterns from booking through on-property experiences.

Healthcare administrators apply sankey charts to patient flow analysis tracking how individuals move through emergency departments, inpatient wards, surgical units, and discharge pathways. Identifying bottlenecks in patient flow directly improves care delivery times and resource allocation efficiency. For example a sankey chart might reveal that thirty-five percent of emergency patients wait more than four hours for bed assignment prompting targeted process improvements. These healthcare sankey implementations in Excel have become standard practice at hospital systems seeking to improve operational performance and patient satisfaction scores.

Building effective sankey charts in Excel requires attention to both technical execution and visual communication principles that separate amateur diagrams from professional deliverables. Start every project by defining your audience and their key questions before touching any data because complexity level, color scheme, and labeling strategy should all be calibrated to viewer expertise. An executive audience typically needs a simplified chart with five to seven major flows while an operational team might require granular detail showing twenty or more connections across multiple process stages and substages.

Always validate your sankey chart against the source data by confirming that total input volume equals total output volume accounting for any legitimate losses or gains in the process being visualized. This mass balance check catches data preparation errors that would otherwise produce misleading visualizations. Create a simple validation cell on your data worksheet that subtracts total outflows from total inflows and format it with conditional highlighting so any nonzero result immediately alerts you to a discrepancy requiring investigation before finalizing.

Choose flow band colors that maintain sufficient contrast against both the background and adjacent bands testing your palette under different viewing conditions including projected presentations and printed documents. Avoid using red and green as primary differentiators since approximately eight percent of males have red-green color vision deficiency rendering those flows indistinguishable. Instead use blue-orange or purple-yellow combinations that remain accessible to all viewers. Excel's eyedropper color picker tool lets you sample exact hex values from your organization's brand guidelines for consistent visual identity.

Limit the number of crossing flows in your sankey chart to maintain visual clarity because excessive crossings create a tangled appearance that defeats the visualization's purpose. Reorder your source and target nodes strategically to minimize crossings placing the most connected nodes in central positions and less connected nodes at the periphery. If crossings remain unavoidable increase the transparency of crossing bands so viewers can still trace individual paths through intersection points without losing track of where each flow originates and where it terminates.

Document your sankey chart methodology in a companion text box or separate worksheet within the same workbook including data sources, date ranges, calculation assumptions, and grouping decisions made during preparation. This documentation ensures reproducibility when you or a colleague needs to update the chart with new data in future periods. Include step-by-step refresh instructions noting which cells to update and in what order so the update process does not inadvertently break the stacked bar chart formatting beneath the sankey overlay structure.

Consider creating a template workbook containing all helper columns, formatting presets, and chart configurations needed for your most common sankey chart layouts. Saving this template as an Excel template file with the xltx extension lets you generate new sankey projects quickly without rebuilding underlying structure each time. Include sample data in the template so new users can see immediately how the chart responds to different input values, accelerating adoption across your team and reducing the support burden on whoever initially designed the visualization framework for the organization.

FREE Excel Questions and Answers

Comprehensive Excel certification practice questions covering all skill levels and topic areas.

FREE Excel Trivia Questions and Answers

Fun trivia-style questions testing your knowledge of Excel features, shortcuts, and hidden capabilities.

Excel Questions and Answers

About the Author

Katherine LeeMBA, CPA, PHR, PMP

Business Consultant & Professional Certification Advisor

Wharton School, University of Pennsylvania

Katherine Lee earned her MBA from the Wharton School at the University of Pennsylvania and holds CPA, PHR, and PMP certifications. With a background spanning corporate finance, human resources, and project management, she has coached professionals preparing for CPA, CMA, PHR/SPHR, PMP, and financial services licensing exams.