Research Methodology

How we gathered and verified information about experimentation platforms

Disclaimer: This comparison is based on publicly available information and may not reflect the most current features or pricing of these platforms. The information about Microsoft's ExP is derived from research papers and books, while information about other platforms comes from public documentation. Please verify details with the platform providers before making any decisions.

Sources and Methodology

How we collected and verified information for our comparison

Research Approach

Our comparison of experimentation platforms is based on a comprehensive review of publicly available information, including:

Official documentation and websites for commercial platforms
Published research papers and technical blogs from Microsoft and Netflix
Conference presentations and technical talks
Books and academic literature on experimentation systems
Industry reports and case studies

Key References

Microsoft ExP Platform:
- Kohavi, R., Tang, D., & Xu, Y. (2020). Trustworthy Online Controlled Experiments: A Practical Guide to A/B Testing. Cambridge University Press.
- Gupta, S., Ulanova, L., Bhardwaj, S., Dmitriev, P., Raff, P., & Fabijan, A. (2018). The Anatomy of a Large-Scale Online Experimentation Platform. IEEE International Conference on Software Architecture (ICSA).
- Bajpai, A., Gupta, S., Nagpal, S., Bhardwaj, S., Dmitriev, P., & Fabijan, A. (2022). Extensible Experimentation Platform: Effective AB Test Analysis at Scale. IEEE International Conference on Software Architecture (ICSA).
Netflix XP Platform:
- Xu, Y., Chen, N., Fernandez, A., Sinno, O., & Bhasin, A. (2015). From Infrastructure to Culture: A/B Testing Challenges in Large Scale Social Networks. KDD '15.
- Netflix Technology Blog. (Various dates). Articles on experimentation and personalization.
- Conference presentations by Netflix engineers at QCon, Strata, and other technical conferences.
Commercial Platforms (StatSig, Eppo):
- Official documentation and feature descriptions from company websites
- Technical blogs and case studies published by the companies
- Product demos and webinars
- Public pricing information and feature comparisons

Verification Process

To ensure accuracy in our comparison, we followed these verification steps:

Cross-referencing information across multiple sources when available
Prioritizing recent sources (published within the last 2-3 years)
Distinguishing between confirmed features and inferred capabilities based on published information
Providing a mechanism for reporting inaccuracies to continuously improve our comparison

Limitations and Considerations

Important context for interpreting our comparison

Information Availability

There are significant differences in the amount and detail of information available for each platform:

Microsoft ExP: Extensively documented in research papers and books, but as an internal platform, some details may be outdated or incomplete
Netflix XP: Less extensively documented than Microsoft's platform, with information primarily from conference talks and blog posts
Commercial Platforms: Documentation focuses on marketing and user-facing features, with less detail on internal architecture and implementation

Evolving Platforms

All experimentation platforms are continuously evolving, which presents challenges for comparison:

Features and capabilities may change over time
Pricing models and tiers may be updated
Internal platforms like Microsoft ExP and Netflix XP may have significant changes that aren't publicly documented
Commercial platforms regularly release new features that may not be reflected in our comparison

Context-Specific Considerations

The suitability of an experimentation platform depends heavily on organizational context:

Scale of experimentation (number of users, experiments, metrics)
Existing technical infrastructure and integration requirements
Team size, expertise, and resources available for implementation and maintenance
Industry-specific requirements and constraints
Budget considerations and ROI expectations