Excel Practice Test

The LINEST function in Excel calculates statistics for a linear regression — slope, intercept, R-squared, standard errors, F-statistic, and more — using the least squares method. It's one of Excel's most powerful statistical functions, but its array-formula syntax and dense output table intimidate many users. Once you understand the structure, LINEST replaces hours of manual regression analysis with a single formula.

What LINEST does. Given a set of dependent values (Y) and one or more independent variables (X), LINEST returns the line of best fit — y = mx + b for simple regression, or y = m₁x₁ + m₂x₂ + ... + mₙxₙ + b for multi-variable regression. It also returns regression statistics that tell you how well the line fits the data.

When to use LINEST. Trend analysis (sales over time, weight loss over weeks). Predictive modeling (forecast future values based on past trends). Multi-variable analysis (how do multiple factors affect outcome). Statistical validation (is the relationship significant). Comparing models (which combination of variables fits best).

Why LINEST over SLOPE/INTERCEPT. SLOPE and INTERCEPT return single values for simple regression. LINEST returns the full statistical output — slope, intercept, errors, R-squared, F-stat, residuals — and supports multi-variable models. For serious analysis, LINEST is essential.

The catch. LINEST is an array formula (CSE in older Excel, dynamic in Excel 365). The output is a 2D table, not a single value. You need to select the output range, enter the formula, and confirm with Ctrl+Shift+Enter (or just Enter in 365). Once you understand this, the rest is straightforward.

This guide covers LINEST syntax, simple regression examples, multi-variable models, the full statistics table, and common pitfalls. It's for analysts, students, and data professionals using Excel for regression.

Function Overview
  • Syntax: =LINEST(known_y's, [known_x's], [const], [stats])
  • Returns: Slope, intercept (and optionally regression statistics)
  • Output: 2D array — 5 rows × (n+1) columns when stats=TRUE
  • Array formula: Excel 365 = dynamic. Older Excel = Ctrl+Shift+Enter
  • known_y's: Dependent variable values (required)
  • known_x's: Independent variable values (optional, defaults to 1,2,3...)
  • const: TRUE = include intercept (default), FALSE = force through origin
  • stats: TRUE = return full statistics, FALSE/omitted = slope/intercept only
  • Multi-variable: Use multiple X columns for multiple regression
  • Related: SLOPE, INTERCEPT, FORECAST.LINEAR, TREND, LOGEST
Try a Free Excel Practice Test

Basic LINEST syntax and simple regression. Start with the easiest case.

Simple linear regression. You have Y values (dependent) and X values (independent). You want the slope (m) and intercept (b) of the best-fit line y = mx + b.

Formula. In Excel 365: =LINEST(B2:B11, A2:A11). The result spills automatically into two cells — slope first, intercept second. Note: the slope is in the LEFT cell, intercept on the RIGHT. This counter-intuitive order trips up many users.

In older Excel. Select two adjacent cells (e.g., D2:E2). Type =LINEST(B2:B11, A2:A11). Press Ctrl+Shift+Enter (CSE) to confirm as an array formula. Excel adds curly braces {=LINEST(...)} indicating array formula.

Reading the output. Cell D2 shows slope (m). Cell E2 shows intercept (b). To predict Y for any X value: Y = D2 * X + E2. Or use the FORECAST.LINEAR function.

Adding statistics. Set the stats argument to TRUE: =LINEST(B2:B11, A2:A11, TRUE, TRUE). Now LINEST returns a 5×2 array of statistics (for simple regression). You need to select 5 rows × 2 cols before entering the formula (or rely on dynamic spill in 365).

The five rows of statistics output (for simple regression). Row 1: slope (m), intercept (b). Row 2: standard error of slope, standard error of intercept. Row 3: R-squared, standard error of Y estimate. Row 4: F-statistic, degrees of freedom. Row 5: regression sum of squares, residual sum of squares. Each tells you something about the regression quality.

For multi-variable regression. The columns expand. If you have 3 X variables, you get a 5×4 output: slopes for X₃, X₂, X₁ (right-to-left in input order!), intercept, then statistics. The 'right-to-left' ordering is another LINEST quirk that catches people.

Understanding the Output

🔴 Slope (m)

How much Y changes per unit change in X. The line's steepness.

🟠 Intercept (b)

Y value when X = 0. Where the line crosses the Y-axis.

🟡 R-squared (R²)

How well the line fits the data. 0-1; closer to 1 means better fit.

🟢 Standard Error

Precision of the estimate. Smaller = more precise.

🔵 F-statistic

Is the regression significant overall? Higher = stronger relationship.

🟣 Degrees of Freedom

Sample size minus parameters estimated. Influences statistical tests.

Practical example. Predicting sales based on advertising spend.

Data setup. Column A: monthly advertising spend ($). Column B: monthly sales ($). 12 months of data in rows 2-13.

Question: How much do sales increase per dollar of advertising? What's the predicted sales if we spend $50,000 on advertising next month?

Simple LINEST. =LINEST(B2:B13, A2:A13) returns slope and intercept. Suppose slope = 5.2 (each $1 advertising = $5.20 sales) and intercept = 25000 (baseline sales of $25,000 without advertising).

Prediction. For $50,000 ad spend: 5.2 × 50000 + 25000 = $285,000 predicted sales.

Adding R-squared. =LINEST(B2:B13, A2:A13, TRUE, TRUE). Select 5×2 cells before pressing Ctrl+Shift+Enter (or use dynamic spill in 365). R² appears in row 3, column 1. If R² = 0.85, advertising explains 85% of sales variance — a strong predictor. If R² = 0.30, the relationship is weak; other factors dominate.

Statistical significance. The F-statistic (row 4, column 1) tells you if the regression is statistically meaningful. To test it: compute the F critical value at your chosen alpha (typically 0.05) using F.INV.RT(0.05, df1, df2). If your F-statistic > F critical, the regression is significant.

Standard error of slope (row 2, column 1). Useful for testing if slope is different from zero. T-statistic = slope ÷ standard error. If |T| > about 2 (for typical sample sizes), the slope is statistically significant.

Predicted Y standard error (row 3, column 2). The typical error of the regression's predictions. If 25,000 with predicted Y of 285,000, you should expect actual sales somewhere in $260,000-310,000 range typically.

LINEST Examples

📋 Sales Trend

X = month number (1-12). Y = monthly sales. =LINEST(B2:B13, A2:A13, TRUE, TRUE). Slope shows monthly sales growth ($). Intercept = baseline. R² = how linear the trend is. Useful for forecasting next quarter.

📋 Multi-Variable

Y = home price. X1 = square feet, X2 = bedrooms, X3 = age. =LINEST(B2:B100, C2:E100, TRUE, TRUE). Returns slopes for each variable + intercept. Output is 5×4 array. Slopes are in RIGHT-to-LEFT order (age, bedrooms, sqft, intercept).

📋 Time Series

Y = daily sales. X = day number (1, 2, 3...). =LINEST(B2:B100, A2:A100). Slope shows daily sales growth. For non-linear trends use LOGEST (exponential) or polynomial regression with multiple X columns.

📋 Without Intercept

=LINEST(B2:B11, A2:A11, FALSE, TRUE). Forces regression through origin (b = 0). Use when theory says Y should equal 0 when X = 0 (e.g., zero ad spend = zero ad-driven sales). Generally rare; usually keep intercept (TRUE).

📋 Polynomial Fit

Y = some non-linear data. Create X columns: A=X, B=X², C=X³. =LINEST(Y_range, A:C, TRUE, TRUE). Returns slopes for each polynomial term. Allows curved fit while still using linear LINEST. Common in physics, engineering.

📋 Comparison

Test two regression models on same data. Compare R² values directly. Higher R² = better fit, but watch for overfitting (too many variables in too little data). Adjusted R² (calculate manually) corrects for variable count.

Practice Excel Skills

Multi-variable regression. The real power of LINEST.

Scenario: predicting home prices. Y (sale price) depends on multiple factors: square footage, number of bedrooms, age of home, lot size. LINEST handles them all in one formula.

Data setup. Column A: sale price. Column B: square feet. Column C: bedrooms. Column D: age. Column E: lot size. 50 home sales in rows 2-51.

Formula. =LINEST(A2:A51, B2:E51, TRUE, TRUE). Output is a 5×5 array (4 X variables + intercept = 5 columns). Select 5 rows × 5 columns before entering (or use dynamic spill in 365).

Interpreting output. Row 1: slopes in REVERSE order (E first: lot size), then D (age), C (bedrooms), B (square feet), then intercept. The reverse order is a LINEST quirk — slopes correspond to X columns right-to-left.

Example output. Slope for lot size: 1.5 ($1.50 per sq ft of lot). Slope for age: -1500 (each year decreases price by $1,500). Slope for bedrooms: 8000 (each bedroom adds $8,000). Slope for square feet: 95 ($95 per square foot of living space). Intercept: 35000 (baseline price).

Predicted value formula. Price = 35000 + 95×sqft + 8000×bedrooms + (-1500)×age + 1.5×lot_size. For a 1500 sqft, 3 bedroom, 10-year-old home on 7500 sq ft lot: 35000 + 95×1500 + 8000×3 + (-1500)×10 + 1.5×7500 = 35000 + 142500 + 24000 - 15000 + 11250 = $197,750.

R-squared interpretation. If R² = 0.78, these four variables explain 78% of price variation. Remaining 22% is location quality, school district, condition, time on market, etc.

Adding variables doesn't always help. If R² jumps from 0.75 to 0.76 when adding a new variable, the new variable adds little. Use adjusted R² (manual calculation) to penalize unnecessary variables. Overfit models predict poorly on new data.

Common multi-variable mistakes. Including variables that are correlated with each other (multicollinearity) — destabilizes slope estimates. Including too many variables relative to sample size — overfitting. Forgetting that slopes are in reverse order. Misinterpreting one slope's meaning by ignoring others (each slope is the effect of one variable holding others constant).

LINEST Output Layout

Row 1
Slopes + intercept
Row 2
Standard errors of slopes/intercept
Row 3
R² and standard error of Y
Row 4
F-statistic and degrees of freedom
Row 5
Regression SS and residual SS
Cols
Number of X vars + 1 (intercept)

Statistical interpretation. What the numbers actually tell you.

R-squared. The proportion of variance in Y explained by the regression. R² = 0.50 means 50% of Y's variation is explained by the X's. Higher is better. But R² alone doesn't tell you if the model is good — a single outlier can inflate or deflate it. Always pair with residual analysis and statistical significance tests.

Standard error of Y estimate. The standard deviation of predicted Y values around the actual Y values. Roughly, your predictions should be accurate within ±2 standard errors. If SE_y = 5,000 and you predict $100,000, the actual is typically $90,000-110,000 (95% confidence).

F-statistic. Tests whether the regression as a whole is significant. F = (regression SS / k) / (residual SS / (n-k-1)) where k = number of X variables. Compare to critical F at your alpha. If F > critical, reject the null (model is significantly better than just predicting the mean).

Standard error of each slope. Tells you how precisely you've estimated that slope. T-statistic for slope = slope ÷ SE_slope. If |T| > about 2 for your sample size, slope is significantly different from zero. T.DIST.2T(ABS(slope/SE_slope), df) gives the exact p-value.

Degrees of freedom. n - k - 1 where n = sample size, k = number of X variables. Affects all significance tests. Larger df = more confidence in estimates.

Regression sum of squares vs residual sum of squares. Regression SS = variation explained by the model. Residual SS = unexplained variation. R² = Regression SS / (Regression SS + Residual SS).

Adjusted R-squared (manual calculation). R²_adj = 1 - (1 - R²) × (n - 1) / (n - k - 1). Adjusts for number of variables. Use when comparing models with different variable counts.

Common Pitfalls

🔴 Slope Order

Slopes appear RIGHT-to-LEFT relative to X column order. Easy to mix up.

🟠 Forgot Array Entry

Older Excel needs Ctrl+Shift+Enter. Just Enter returns only first cell.

🟡 Selection Size

Must select correct cell count before entering. Wrong size = #N/A or missing data.

🟢 Multicollinearity

Correlated X variables destabilize slopes. Check correlation first.

🔵 Outliers

One bad data point can ruin regression. Plot residuals; remove or robustify.

🟣 Extrapolation

Don't predict beyond data range. Linear assumption may break down.

Free Excel Practice Test

Practical tips for using LINEST effectively.

Tip 1: Validate before relying. Plot Y vs each X on a scatter plot. Verify linearity visually before running LINEST. If the data is curved or has obvious structure, linear regression isn't appropriate.

Tip 2: Check residuals. After regression, calculate predicted Y values and residuals (actual - predicted). Plot residuals. They should look like random noise. Patterns (curve, fan shape) indicate the linear model isn't fitting properly.

Tip 3: Watch outliers. Sort data by residuals. Investigate the largest residuals. Sometimes they reveal data errors; sometimes they're legitimate anomalies. Decide whether to remove or keep before reporting results.

Tip 4: Avoid extrapolation. LINEST tells you the linear relationship within your data range. Predicting far outside that range is dangerous — linearity may break down (sales growth doesn't continue forever; biological data has limits).

Tip 5: Use related functions. SLOPE(Y, X) and INTERCEPT(Y, X) for quick simple regression without array formula complexity. FORECAST.LINEAR(new_x, Y, X) to predict from existing data. TREND(Y, X, new_x) for predicting multiple new X values. CORREL(X, Y) for the correlation coefficient (square of R gives R² for simple regression).

Tip 6: Power Query / Power Pivot for big data. LINEST works on Excel sheets. For datasets with millions of rows, use Power Query for data prep and consider Python/R for regression. Excel struggles with very large datasets.

Tip 7: Document your model. Save the formula, note R², F-statistic, significance, and which variables you included/excluded. Three months later, you'll thank yourself.

Tip 8: Use Trendline for visualization. Right-click chart series → Add Trendline → Linear (or Polynomial, Exponential, etc.). Excel shows the equation and R² directly on chart. Easier than LINEST for one-time visualizations.

Tip 9: Power BI / external tools for deeper stats. Excel LINEST gives basic regression. For confidence intervals on slopes, prediction intervals, robust regression, or non-linear models, consider Power BI's R/Python script or dedicated stats software (SPSS, R, JMP, Stata).

Tip 10: Learn one thing at a time. Master simple regression first. Then add R². Then add multi-variable. Then add statistical tests. Trying to learn everything at once leads to confusion.

LINEST vs alternatives — choosing the right approach.

LINEST vs SLOPE/INTERCEPT. For just slope and intercept of a simple regression: SLOPE and INTERCEPT are easier — return single values, no array formula. Use SLOPE/INTERCEPT for quick simple calculations. Use LINEST when you need the full statistics or have multiple X variables.

LINEST vs FORECAST.LINEAR. FORECAST.LINEAR predicts a single Y value for a new X. LINEST gives you the formula coefficients. If you just want to predict a few values, FORECAST.LINEAR is easier. If you want to understand the relationship or predict many values, LINEST.

LINEST vs TREND. TREND returns predicted Y values directly (an array of predicted values matching your inputs). LINEST returns coefficients. TREND is convenient for filling a column of predictions; LINEST is better for understanding the model.

LINEST vs LOGEST. LOGEST is for exponential regression (Y = b × m^X). LINEST is for linear (Y = mX + b). If your data fits an exponential curve better, LOGEST gives proper coefficients. Both have similar syntax and output structure.

LINEST vs Data Analysis Toolpak Regression. Data Analysis Toolpak's Regression tool produces a more readable output (named labels, formatted statistics, ANOVA table) but requires the Toolpak add-in enabled. Best for one-time analysis with formatted reports. LINEST is better for live, formula-based regression that updates with data changes.

LINEST vs Pivot Tables with calculated fields. Pivot Tables don't natively run regression. For aggregated regression analysis (e.g., regression by region), you'd need separate LINEST formulas per group, or pivot to summarize then regress.

LINEST vs Python/R. Excel LINEST handles datasets up to about 100,000 rows comfortably. For millions of rows, multi-variable interactions, or advanced techniques (robust regression, regularization, non-linear), use Python (statsmodels, scikit-learn) or R. Excel is for everyday business analysis; Python/R is for serious statistical work.

Function Comparison

📋 Simple Slope/Intercept

Use SLOPE(Y, X) and INTERCEPT(Y, X). Single values, no array entry needed. Best for: quick analysis, when you just need two numbers.

📋 Full Statistics

Use LINEST(Y, X, TRUE, TRUE). Returns full 5×(n+1) statistics table. Best for: understanding fit quality, multi-variable regression, statistical reporting.

📋 Predict Specific Values

Use FORECAST.LINEAR(new_x, Y, X). Returns predicted Y. Best for: one-off predictions, business forecasting without statistical detail.

📋 Predict Multiple Values

Use TREND(Y, X, new_x). Returns array of predictions. Best for: filling a forecast column for many new X values.

📋 Exponential Trend

Use LOGEST(Y, X, TRUE, TRUE). For exponential growth/decay. Best for: population growth, compound interest, viral spread modeling.

📋 Quick Visual

Right-click chart series → Add Trendline → Display Equation and R-squared. No formula. Best for: presentations, quick exploration, communicating to non-analysts.

Real-world LINEST applications. Where regression actually pays off in business and science.

Marketing attribution. How much do different ad channels contribute to sales? Y = sales. X1 = Google Ads spend. X2 = Facebook Ads spend. X3 = TV spend. LINEST gives you coefficients showing dollar-per-dollar return for each channel. Identify which channels deserve more budget.

Sales forecasting. Quarterly sales depend on multiple seasonal and economic factors. Y = quarterly sales. X1 = quarter (1-4). X2 = GDP growth rate. X3 = unemployment rate. X4 = average price. LINEST coefficients quantify each factor's impact and predict next quarter.

Pricing optimization. How does price affect demand? Y = units sold. X = price. Slope shows price elasticity (typically negative). Test different price points; LINEST helps quantify the relationship and find the optimal price (where revenue = price × units sold is maximized).

Employee performance modeling. Y = annual sales by salesperson. X1 = years experience. X2 = client visits per month. X3 = average deal size. X4 = lead conversion rate. LINEST identifies which factors most predict performance, informing training and hiring.

Real estate appraisal. Y = sale price. X variables = square feet, bedrooms, age, lot size, schools, etc. Multi-variable LINEST gives you a baseline appraisal model. Real estate professionals use this for comparable analysis.

Energy usage prediction. Y = monthly kWh. X1 = average outdoor temperature. X2 = number of occupants. X3 = home square footage. LINEST coefficients show how each factor affects usage, helping homeowners optimize.

Quality control. Y = product defect rate. X1 = manufacturing line speed. X2 = operator experience. X3 = ambient temperature. LINEST identifies factors contributing to defects, guiding process improvements.

Healthcare research. Y = patient outcome score. X1 = age. X2 = treatment dosage. X3 = comorbidities. X4 = adherence percentage. Multi-variable regression identifies which factors most influence outcomes, informing treatment protocols.

Financial modeling. Y = stock return. X1 = market return. X2 = company size. X3 = book-to-market ratio. Fama-French style multi-factor models use LINEST-style regression to decompose returns.

LINEST in Practice

🔴 Forecasting

Predict next month's sales, demand, revenue from historical trends.

🟠 Attribution

Quantify how each channel/factor contributes to outcomes.

🟡 Pricing

Find optimal price point by measuring elasticity.

🟢 Quality Improvement

Identify which process factors most affect defects.

🔵 Research

Test hypotheses about relationships between variables.

🟣 Modeling

Build predictive models for finance, real estate, operations.

Practice — Free Excel Test

Advanced LINEST techniques. Beyond basic regression.

Polynomial regression with LINEST. Want to fit a curve, not a line? Add polynomial X columns. To fit Y = a + bX + cX² + dX³ (cubic), create columns: A=X, B=X², C=X³. =LINEST(Y, A:C, TRUE, TRUE). Slopes correspond to d, c, b respectively (right-to-left), plus intercept a. Useful for curved data like quadratic decline or sigmoid growth (approximate with cubic).

Categorical variables (dummy coding). LINEST handles only numeric variables. For categorical (e.g., region: North/South/East/West), create dummy columns: North = 1 if North else 0; South = 1 if South else 0; East = 1 if East else 0. (Use West as the baseline by excluding it.) Three dummy columns capture four regions.

Interaction terms. Two variables might affect each other. Create X1×X2 as a new column. Include all three: X1, X2, X1×X2. LINEST coefficient for the interaction term shows how their effect changes together.

Robust regression. LINEST uses ordinary least squares — sensitive to outliers. For robust regression, you'd need iterative methods (winsorize data, use weighted regression, etc.) — Excel doesn't have a robust LINEST natively. Consider Python/R for serious robust analysis.

Weighted least squares. Want some data points to count more? Create weights column. Multiply X and Y by sqrt(weight). Run LINEST on weighted variables. The result accounts for unequal observation reliability.

Time series regression. Adding lagged Y values as X variables: Y_t depends on Y_(t-1), Y_(t-2). Create lagged columns; use LINEST. This is AR (autoregressive) modeling. Works for moderately stable time series.

Logistic regression workaround. LINEST is linear only. For binary Y (yes/no), use solver-based logistic regression or manually transform. Not LINEST's strength — Python/R or Excel's Solver add-in are better tools.

LINEST Pros and Cons

Pros

  • LINEST has a publicly available content blueprint — you know exactly what to prepare for
  • Multiple preparation pathways accommodate different schedules and budgets
  • Clear score reporting shows specific strengths and weaknesses
  • Study communities share current insights from recent test-takers
  • Retake policies allow recovery from a difficult first attempt

Cons

  • Tested content scope requires substantial preparation time
  • No single resource covers everything optimally
  • Exam-day performance can differ from practice test performance
  • Registration, prep, and retake costs accumulate significantly
  • Content changes between versions can make older materials less reliable

Excel Questions and Answers

What does LINEST do in Excel?

LINEST calculates linear regression statistics: slope, intercept, R-squared, standard errors, F-statistic, and degrees of freedom. It supports simple regression (one X variable) and multi-variable regression. Returns a 2D array of statistics that tells you both the regression equation and how well the model fits.

What's the syntax for LINEST?

=LINEST(known_y's, [known_x's], [const], [stats]). known_y's = dependent variable (required). known_x's = independent variable(s) (optional). const = TRUE includes intercept, FALSE forces through origin. stats = TRUE returns full statistics, FALSE returns just slope and intercept. Output is a 2D array.

Why are LINEST slopes in reverse order?

LINEST returns slopes RIGHT-to-LEFT relative to your X variable columns. If X is columns B, C, D (in order), the output shows slope for D first, then C, then B, then intercept. This counter-intuitive ordering is a long-standing LINEST quirk you must remember.

How do I enter LINEST as an array formula?

In Excel 365: just type the formula and press Enter — results spill automatically into adjacent cells. In older Excel: select the output range first (5 rows × number-of-X-vars+1 columns), type the formula, then press Ctrl+Shift+Enter (CSE) to enter as an array formula. Excel adds curly braces {} indicating array entry.

How do I read the LINEST output?

5 rows × (n+1) columns where n = number of X variables. Row 1: slopes (right-to-left order) and intercept. Row 2: standard errors of slopes and intercept. Row 3: R² and standard error of Y. Row 4: F-statistic and degrees of freedom. Row 5: regression sum of squares and residual sum of squares.

How accurate are LINEST's predictions?

Depends on R² and standard error of Y. R² shows how much variance is explained (0.7+ is good for most uses). Standard error of Y tells you typical prediction error. Predictions are accurate within ±2 SE of Y for 95% of cases. Always validate with residual analysis and out-of-sample testing.

When should I use LINEST vs SLOPE/INTERCEPT?

Use SLOPE and INTERCEPT for quick simple regression when you just want slope and intercept as single values — no array formula needed. Use LINEST when you need full statistics (R², F-statistic, standard errors), multi-variable regression, or you're doing formal statistical analysis.
Free Excel Practice Test

Final thoughts. The LINEST function turns Excel into a serious regression tool. For basic business analysis, sales forecasting, attribution modeling, and quality improvement — LINEST handles 80% of regression work without ever leaving Excel.

Start simple. Master single-variable LINEST first. Plot your data. Run LINEST. Verify slope and intercept make intuitive sense. Add R² and check fit quality.

Then expand. Add multiple X variables for multi-variable models. Learn to read the 5×N output. Practice interpreting standard errors and F-statistics. Build intuition for what 'good fit' looks like.

Use LINEST alongside visualization. Charts with trendlines show LINEST's work visually. Combined with formal LINEST output, you get both intuition and statistical rigor.

Know when to graduate. For large datasets, advanced techniques, or rigorous research, Excel LINEST has limits. Python (statsmodels), R (lm), or stats software (SPSS, JMP) offer more power. But for everyday analysis — quarterly forecasts, marketing attribution, pricing decisions — Excel LINEST is the right tool, deployed quickly with formulas your colleagues can also read and update. The combination of accessibility and statistical depth makes LINEST one of Excel's most underutilized power features.

▶ Start Quiz