Frequently Asked Questions

General Questions

What does DFI stand for?

Disentangled Feature Importance. It’s a framework for computing feature importance using optimal transport to create counterfactual distributions. DFI includes both standard (Gaussian OT) and flow-based (entropic OT) methods.

How is DFI different from SHAP?

SHAP uses Shapley values from game theory, computing the average marginal contribution of each feature across all possible feature orderings. DFI uses optimal transport to create counterfactual distributions, specifically:

Transform to an uncorrelated representation (Z-space)
Replace each feature with an independent sample
Measure the change in model output

Both measure feature importance but use different mathematical frameworks.

When should I use DFI vs SHAP?

Use DFI when:

You have continuous or mixed-type data
You want built-in confidence intervals and p-values
You prefer an OT-based interpretation

Use SHAP when:

You need exact Shapley values
You’re using tree models (TreeSHAP is very fast)
You want the game-theoretic interpretation

Explainer Questions

Which explainer should I start with?

Start with OTExplainer. It’s fast, stable, and works well for most continuous data problems.

from fdfi.explainers import OTExplainer
explainer = OTExplainer(model.predict, data=X_train, nsamples=50)
results = explainer(X_test)

Why does my explainer raise NotImplementedError?

Some explainers (TreeExplainer, LinearExplainer, KernelExplainer) are placeholder implementations. Use OTExplainer or EOTExplainer for working implementations.

How do I handle categorical features?

Use EOTExplainer with Gower distance:

from fdfi.explainers import EOTExplainer

explainer = EOTExplainer(
    model.predict,
    data=X_train,
    cost_metric="gower",  # or "auto"
    categorical_threshold=10,  # Features with ≤10 unique values = categorical
)

You can also manually specify feature types:

import numpy as np
feature_types = np.array(["continuous", "binary", "categorical", "continuous"])

explainer = EOTExplainer(
    model.predict,
    data=X_train,
    cost_metric="gower",
    feature_types=feature_types,
)

Statistical Inference Questions

How do I get confidence intervals?

Use the conf_int() method after computing importance:

results = explainer(X_test)
ci = explainer.conf_int(alpha=0.05, alternative="two-sided")

print("Estimates:", ci["score"])
print("CI Lower:", ci["ci_lower"])
print("CI Upper:", ci["ci_upper"])
print("P-values:", ci["pvalue"])

What is the variance floor?

The variance floor is a minimum standard error applied to prevent confidence intervals from being too narrow for features with very small variance. This improves coverage and statistical stability. A TwoComponentMixture is fitted to the raw standard errors; the upper quantile of the smaller (noise) component is used as the floor.

ci = explainer.conf_int(
    variance_floor_q=0.90,       # quantile of the noise component (default)
)

What is the practical margin?

The margin is a threshold for practical significance. Instead of testing \(H_0: \phi_j = 0\), you can test \(H_0: \phi_j \leq \delta\) where \(\delta\) is a meaningful effect size.

ci = explainer.conf_int(
    margin=0.01,  # Only significant if importance > 0.01
    alternative="greater",
)

How do I run a one-sided test?

Pass alternative='greater' (or 'less') to conf_int(). The resulting CI has an open bound (+inf or -inf), and confidence_interval_plot() renders it automatically with a caret stub.

ci = explainer.conf_int(alpha=0.05, alternative="greater")
from fdfi.plots import confidence_interval_plot
confidence_interval_plot(ci, feature_names=feature_names)

See Statistical Inference with FDFI for a full discussion of one-sided tests, variance floors, and FDR correction.

How do I correct for multiple testing?

Pass multitest_method to conf_int():

ci = explainer.conf_int(alpha=0.05, multitest_method="fdr_bh")
# ci["pvalue_adj"] contains Benjamini-Hochberg adjusted p-values
# ci["reject_null"] reflects the adjusted decision

Performance Questions

How can I speed up explanation?

Reduce nsamples (but increases variance):

explainer = OTExplainer(model, data=X, nsamples=20)

Use subset of background data:

from fdfi.utils import sample_background
X_bg = sample_background(X_train, n_samples=100)
explainer = OTExplainer(model, data=X_bg)

Disable flow fitting (if not needed):

explainer = Explainer(model, data=X, fit_flow=False)

Why is EOTExplainer slower than OTExplainer?

EOTExplainer uses Sinkhorn iterations to solve the entropic OT problem, which requires O(n²) cost matrix computation and iterative optimization. OTExplainer uses a closed-form Gaussian solution.

Troubleshooting

I get “Flow matching requires torch” error

Install the flow dependencies:

pip install -e ".[flow]"

Or disable flow fitting:

explainer = OTExplainer(model, data=X, fit_flow=False)

My confidence intervals are all negative to positive

This happens when the standard errors are large relative to the estimates. Try:

Increase nsamples for lower variance
Use more test samples
Your features may genuinely have low importance

I get NaN or infinite values

Check for:

NaN values in your data
Model returning NaN for some inputs
Very large or small feature values (consider standardizing)