Design of Experiments (DoE) - Frequently Asked Questions (FAQs)

1. What exactly is Design of Experiments (DoE), and why should I care?

You can regard DoE as a more intelligent approach to experimentation. The old way of going about it, testing a single variable at a time, is not only a slow process but can be deceptive. With DoE you can put several factors to the test at once and see how they play off one another. It doesn’t matter if your background is in research, engineering or marketing; DoE will have you down to reliable answers in less time and with no resources squandered on guesswork.

2. What's the difference between a factor, a level, and a response in DoE?

Think of a factor as the variable you are altering, be it temperature or dosage. The level is simply the particular value you put that factor at, say 100°C or 150°C. Then there is the response, which is your metric for the result, whether that be a patient’s recovery rate or the yield of a product. They are the three essentials of any DoE study; get a handle on them and the rest will make sense.

3. How do I decide how many samples or test runs I actually need?

At the end of the day it is a matter of statistical power, that is to say how well your experiment can pick up on an effect if there is one to be had. You will have greater confidence with a larger sample size, of course, but then you are putting in more time and money for it. DoE has its ways of dealing with this; a power analysis, for instance, is a good way to zero in on the right balance. And as a general rule, the more variability you see in the data, the more runs you are going to need to make any firm conclusions.

4. What is randomization, and why do experimenters make such a big deal of it?

When you randomize, you are putting your treatments or conditions in a random sequence as opposed to some set pattern. The point is to shield your results from any hidden variables that you have not put into the equation and which could distort what you find. If you don’t do it, you run the risk of being too sure of yourself and crediting an effect to your own factor when in fact something else was at work, be it a change of shift or a machine that has been warming up.

5. When should I use a Randomized Complete Block Design (RCBD)?

If there is a source of variability you can’t do away with but can put under control, that is when you turn to RCBD. Things like the day of the week, which operator is on duty or even the batch of raw material you are working with. You block on such a variable to take its effect off the table and keep it from skewing your results. Think of it as ensuring every team in a tournament has to play on the same field; then you know the scores are down to skill and not the venue.

6. What is ANOVA and how does it fit into DoE?

You will find that ANOVA, or Analysis of Variance, is the statistical workhorse for most DoE analysis. Its job is to put your findings to the test and show you if what you are seeing between groups is genuine or merely a matter of chance. Take a one-factor experiment with several levels for instance; a one-way ANOVA will give you the bottom line on whether altering that factor has any effect at all. Should it, you can then get into the details and see where those differences are coming from.

7. What are 'main effects' and 'interactions' and why do interactions matter so much?

You have the main effect, which is the unvarnished cause and effect of a single factor on an outcome. Then there is interaction, where one factor’s impact is contingent on the level of another. In my experience, that is where you will find the substance of the matter. Take a drug for instance: it may do its job at high doses in an adult yet be no good for a child. That is an interaction between age and dosage. And if you overlook those interactions, you are prone to drawing some very wrong conclusions.

8. What statistical software is best for analysing DoE data?

You will find JMP, Minitab and R to be the workhorses of the trade. They are all popular for good reason. If you are in industry, you tend to see a lot of JMP and Minitab because their interfaces are so intuitive and easy for a novice to pick up. In an academic setting, on the other hand, R is the tool of choice; it is free and has more than enough power for research purposes. Then there is Python, which is seeing more use of late among data scientists. But in the end, there is no single best option. It comes down to your own background and what is already in use by your team.

9. What is a 2k factorial design, and when is it the right choice?

With a 2k factorial design you put k factors to the test, running each at precisely two levels for what are typically called the low and high settings. The method is well structured and efficient; in a workable number of runs it will show you the main effects and any interactions plainly enough. If you are in the early stages of a new process and need to zero in on the important factors before you get into the finer points of optimization, it makes an excellent place to begin.

10. What's a fractional factorial design, and when do I sacrifice some information for practicality?

With a large number of factors at play, you are looking at thousands of runs for a full factorial design, which is hardly practical. The fractional approach is more sensible: it will execute only a select portion of the combinations. You give up some information in the process, typically on higher-order interactions that don’t make much difference in any case, but the time and cost you put back in your pocket is considerable. When you have to be rigorous yet are short on resources, it is the pragmatic way to go.

11. What is Response Surface Methodology (RSM), and how is it different from basic factorial designs?

With RSM you can go beyond what DoE has to offer. It does more than simply tell you which factors are of consequence; it will show you the optimal way to set them. By charting the relationship between your response and the various factors on a kind of surface, you can zero in on the peaks and valleys. That is where its real worth lies, say in product formulation or chemical engineering. When you are at the stage of wanting to know not if something works but what the very best version of it can be, RSM is an invaluable tool for process optimization.

12. Who actually uses DoE in the real world - is it just for scientists?

You won’t find that to be the case. An engineer will put it to work on a manufacturing line for optimization, while in pharma they are using it to design clinical trials and put together new drugs. Marketers rely on it for large-scale A/B testing, and agricultural researchers have it to get the most out of their crop yields. Even software teams are using DoE to run tests on product features. The truth is DoE is domain-agnostic; if you have variables and outcomes and need to make a sound decision, it is applicable.

13. Do I need a statistics background to learn DoE?

You don’t have to be a mathematician to put in the work, though some statistical know-how is useful. For the most part, any DoE course will take you through the material in its own time, even the beginner level ones. As long as you have a grasp of the fundamentals – what an average or p-value is, for instance, and the notion of variability – you are well grounded. In the end, it is putting these things to use on problems that matter to you where you will do your best learning.

Compliance Trainings related to Design of Experiments

Introduction to Design of Experiments: Methods and Analysis
By the end of this training, participants will be equipped with the skills and requisite knowledge to effectively apply Design of Experiments in their work, leading to improved decision-making, efficiency, and innovation.

Statistical Elements of Implementing ICH Quality Guidelines
This 9hr training course will provide attendees with an understanding of the fourteen ICH Quality guidelines as relates to statistical guidance and analysis. The course will provide tools, techniques and insight that will allow participants to immediately begin implementation of the information learned within their organization/firm

Statistical Methods for Quality Improvement
This webinar presents an overview of essential quantitative methods for assessing and ensuring product quality. The methods include: Statistical Process Control, Process Capability Assessment, Regression Modeling, Design of Experiments, Hypothesis Testing, and Measurement Systems Assessment.

Sample Size Determination for Design Validation Activities
Statistical Methods are typically used to ensure that product performance, quality, and reliability requirements are met during the Design Validation phase of product development. This webinar discusses common elements of sample size determination and several specific sample size applications for various design validation activities including Reliability Demonstration/Estimation, Acceptance Sampling, and Hypothesis Testing.

Statistical Elements of Sample Size Calculations for Non-Clinical Verification and Validation Studies
This webinar provides the logic and processes for determining samples sizes for common tests used in verification or validation of processes. The focus of this webinar is on providing the information needed for attendees to know the appropriate measures and formulas to use for the determining sample size and providing justification for the planned sample sizes.

Acceptance Sampling Plans for Process Validation and Production Lot Monitoring
This webinar provides details regarding the generation of acceptance sampling plans often used in process validation and production control to ensure quality of final products. By attending this webinar, participants will be able understand the key inputs and issues involved in determining acceptance sampling plans. Sampling plans for attribute data are the primary focus although variable acceptance sampling plans are presented as well.

Stability Studies and Estimating Shelf Life
The webinar will provide useful methods and techniques for conducting a stability study and analyzing the resulting data for the purpose of estimating shelf life. Participants should be able to immediately apply the methods presented. Also, the interpretation and communication of results will be stressed and illustrated in several examples.

Predicting Product Life Using Reliability Analysis Methods
Participants will gain awareness of the overall methodology for setting reliability targets, estimating product reliability from test data and/or field data, and determining whether or not reliability targets are achieved. Participants will also learn how to calculate sample sizes for reliability testing.

Optimizing Target Weights for Foods and Beverages
This training program will elaborate factors affecting the target weight decision and help determine the tolerable risks of under-filling and the costs of over-filling. Attendees will gain an understanding of process stability and process capability concepts and methods for process optimization.

Useful Statistical Methods for Defining Product and Process Specifications - Part I
This webinar covers useful and important statistical methods that assist scientists and engineers in the development of appropriate product and process specifications. Appropriate product specifications are critical to achieving adequate and reliable product performance.

Useful Statistical Methods for Defining Product and Process Specifications - Part II
This webinar covers useful and important statistical methods that assist scientists and engineers in the development of appropriate product and process specifications. Appropriate product specifications are critical to achieving adequate and reliable product performance.

Normality Testing: Applications and Issues
This webinar discusses applications of normality testing and several issues that may arise when testing data for normality. Several methods for testing data for normality are presented. We discuss some of the common types of goodness-of-fit tests that may be used (e.g. Andersen-Darling, Kolmogorov Smirnoff, etc.). We also discuss common reasons that normality tests are rejected.

By using this site you agree to our use of cookies. Please refer to our privacy policy for more information. Close