![]() Rather than remove outliers, an alternative approach is to fit all the data (including any outliers) using a robust method that accommodates outliers so they have minimal impact. The problem with this approach is that the outlier can influence the curve fit so much that it is not much further from the fitted curve than the other points, so its residual will not be flagged as an outlier. One option is to perform an outlier test on the entire set of residuals (distances of each point from the curve) of least-squares regression. Unfortunately, no outlier test based on replicates will be useful in the typical situation where each point is measured only once or several times. If you have plenty of replicate points at each value of X, you could use such a test on each set of replicates to determine whether a value is a significant outlier from the rest. Several formal statistical tests have been devised to determine if a value is an outlier, reviewed in. With such an informal approach, it is impossible to be objective or consistent, or to document the process. Outlier elimination is often done in an ad hoc manner. Removing such outliers will improve the accuracy of the analyses. These points will dominate the calculations, and can lead to inaccurate results. But some outliers are the result of an experimental mistake, and so do not come from the same distribution as the other points. In this case, removing that point will reduce the accuracy of the results. ![]() ![]() Even when all scatter comes from a Gaussian distribution, sometimes a point will be far from the rest. Even a single outlier can dominate the sum-of-the-squares calculation, and lead to misleading results. However, experimental mistakes can lead to erroneous values – outliers. ![]() This assumption leads to the familiar goal of regression: to minimize the sum of the squares of the vertical or Y-value distances between the points and the curve. ![]() Nonlinear regression, like linear regression, assumes that the scatter of data around the ideal curve follows a Gaussian or normal distribution. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |