Statistics.du.se
D-level Essay in Statistics 2009
How to Analyze Change from Baseline:
Absolute or Percentage Change?
Högskolan Dalarna
Lars Rönnegård
781 88 Borlänge
Tel vx 023-778000
Richard Stridbeck
How to Analyze Change from Baseline:
Absolute or Percentage Change?
In medical studies, it is common to have measurements before and after some medical interventions. How to measure
the change from baseline is a common question met by researchers. Two of the methods often used are absolute changeand percentage change. In this essay, from statistical point of view, we will discuss the comparison of the statistical powerbetween absolute change and percentage change. What's more, a rule of thumb for calculation of the standard deviation ofabsolute change is checked in both theoretical and practical way. Simulation is also used to prove both the irrationality of theconclusion that percentage change is statistical ine¢ cient and the nonexistence of the rule of thumb for percentage change.
Some recommendations about how to measure change are put forward associated with the research work we have done.
Key Words: Absolute Change, Percentage Change, Baseline, Follow-up, Statistical Power, Rule of Thumb.
change to evaluate the change of weight. Neovius (2007) alsochose absolute change as the change measurement in their
obesity research, while Kim (2009) chose percentage changeto measure the fat lost in di¤erent part of an obese man'sbody in a weight loss program. In a cystic …brosis clinical
In medical studies, a common way to measure treatment study, Lavange (2007) used percentage change as well. We
e¤ect is to compare the outcome of interest before treat-
see that, both of the two methods have been used in di¤erent
ment with that after treatment. The measurements before
kinds of clinical studies.
and after treatment are known as the baseline (B) and the
The properties of absolute change and percentage change
follow-up (F ), respectively. How to measure the change from
have been discussed by Tönqvist (1985). From his point of
baseline is a common question met by researchers. There
view, one of the advantages of percentage change is that per-
are many methods that can be used as the measure of di¤er-
centage change is independent of the unit of measurement.
ence. Two of them, which are used in a lot of clinical studies,
For instance, a man who weighs 100 Kg lost 10% of weight
are absolute change (C = B
F ) and percentage change
after a treatment, i.e. 10 Kg. Equivalently, he lost 22.05
F ) B). In di¤erent books and articles, absolute
pounds (1Kg = 2:2046 Pounds). 10 Kg and 22.05 pounds are
change may also be called change, while percentage change is
essentially the same weight, but the absolute change scores
also called relative change.
are di¤erent. However, no matter what the unit of measure-
There is a simple example that will show us the di¤erence
ment is, the percentage change is 10% all the time. More
between absolute change and percentage change more clearly:
details about the advantage of absolute change can be found
Two obese men A and B participate in a weight loss program.
in the article of Tönqvist (1985). Although there are many
Their weights at the beginning of the program are 150 Kg and
advantages for the two change measurement methods, Tön-
100 Kg, respectively. When they …nish the program, the man
qvist (1985) did not give any recommendations about how to
A who weighs 150 Kg lost 15 Kg, while another man lost 10
make a choice between absolute change and percentage change
Kg. From the example, we see that, the man A lost 5 Kg more
based on these properties.
than that the man B, but the percent of weight they lost are10% in both cases. We want to know, which change measure-
In other literatures, several suggestions about which
ment is best to show the treatment e¤ect of the weight loss
method to choose are mentioned. Vickers (2001) suggested
avoiding using percentage change. That is because he com-
In di¤erent clinical studies, either absolute change or per-
pared the statistical power of di¤erent methods by doing
centage change may be chosen. In the study of healthy dieting
a simulation and concluded that percentage change from
and weight control, Waleekhachonloet (2007) used absolute
baseline is statistically ine¢ cient. Kaiser (1989) also gave
How to Analyze Change from Baseline: Absolute or Percentage Change?
some recommendations for making a choice between absolute
2. Comparison of the statistical
change and percentage change. He suggested using the change
power of absolute change and
measurement that has less correlation with baseline scores. Atest statistic developed by Kaiser (1989) was also derived, i.e.
percentage change
the ratio of the maximum likelihood of absolute change tothat of percentage change. The absolute change is recom-mended if the value of the test statistic is larger than one,
In clinical research, it is common to test whether there is a
treatment e¤ect after a medical intervention. In order to
while percentage change is preferred when it is less than one.
test the treatment e¤ect, it's necessary to choose a suitable
That is, a simple rule helping researchers to make a choice
measurement of the di¤erence between baseline and follow-up
Actually, the primary consideration for choosing a change
From a statistical point of view, an important criterion for
measurement method is di¤erent from di¤erent points of view.
a good statistical method is high statistical power. Therefore,
From a clinical point of view, we prefer to use a change mea-
from the two common change measurement methods, absolute
surement that may show the health-improvement for the pa-
change and percentage change, the one with a higher statis-
tients in a more observable way. For example, in study of
tical power will be preferred.
asthma, the primary outcome variable is often FEV (ForcedExpiratory Volume L/s).
The e¢ ciency of a treatment is
2.1 Statistical Power
evaluated by calculating the percentage change in FEV frombaseline. In hypertension studies, it is common to use the ab-
According to the hypothesis testing theory, statistical power
solute change in blood pressure instead of percentage change.
is the probability that a test reject the false null hypothesis.
From statistical point of view, we prefer the method which
The de…nition of statistical power can be expressed as
has the highest statistical power as Vickers (2001) did.
equation (1).
Another issue concerned by researchers is the standard
deviation of the treatment e¤ect (change scores). For two
Statistical power = P (reject H0 j H0 is False)
medical interventions that may lead to the same expected
change, the e¤ect of the intervention that has a smaller stan-
0 is the null hypothesis.
In a t-test, equation (1) can be rewritten as equation (2).
dard deviation seems more stable and e¤ective. And clini-cians may always prefer that medical intervention. Since itis not practical for researchers to get all the interested exper-
Statistical power
P (reject H0 j H0 is False)
imental datasets which recorded the details of baseline and
follow-up scores for each patient, there is a rule of thumb
21 . It may help calculating the stan-
dard deviation of change scores from the standard deviation
where t =2 is the t-value under the signi…cant level
of baseline scores.
two-side t-test, and pt is the p-value of the t-test.
The aim of this essay is to show, the statistical e¢ ciency
From expression (2), we may see that the larger the ex-
of percentage change under some conditions in contrast with
pected absolute value of the t-statistic is, the higher the sta-
Vickers' (2001) conclusion, and the rationality of the rule of
tistical power will be.2 Equivalently, the smaller the expected
thumb. The essay is organized as follows. The second section
p-value is, the higher the statistical power will be. Therefore,
is the comparison of the statistical power of absolute change
from di¤erent measurement methods, we will choose the one
and percentage change by constructing a test statistic under
that has a larger expected absolute value of t-statistic or a
certain distribution assumption. In the third section, the ra-
smaller expected p-value.
tionality of rule of thumb is discussed in a theoretical way.
Simulations of some of the issues discussed in section two and
2.2 A Clinical Example of Blood Pressure
three are carried out in the fourth section. The …fth section
is an empirical investigation of the usefulness of the rule ofthumb by using some real datasets. Finally, in the discussion
To easily interpret the di¤erence of two measurement meth-
section, we discuss the results got from the previous sections,
ods, an example from a clinical trial is shown in Table 1. In
and give some suggestions.
the table, there are the records of the supine systolic bloodpressures (in mmHg) for 5 patients before and after takingthe drug captopril.
Let (Bj; Fj) denote a baseline/follow-up pair of scores for
patient j in the treatment group, j = 1; 2;
1 Personal communication Prof. Johan Bring, E-mail: [email protected] However, it must be emphasized that, in this essay, "statistical power" actually means something slightly di¤erent from this.
Zhang, L. and Han, K. (2009)
How to Analyze Change from Baseline: Absolute or Percentage Change?
can get absolute change Cj = Bj
Fj and percentage change absolute change and percentage change are asymptotic nor-
Bj for patient j by calculating from the mally distributed, i.e.
baseline Bj and follow-up Fj scores immediately.
In this example, j is the patients' ID number, and here
n = 5. In columns 2 and 3, there are baseline and follow-
up scores for each patient. Absolute change and percentage
where C and C are the mean and the standard deviation
change that calculated from baseline and follow-up scores are
P are the mean and the standard deviation of
shown in column 4 and 5, respectively.
P , respectively.
In this case, t-test can be used for both absolute change
Table 1. Supine systolic blood pressure (in mmHg) for 5
and percentage change. For absolute change, the null hypoth-
patients with moderate essential hypertension, immediately
esis of t-test is H0 : C = 0. From C
before and after taking the drug captopril3
the t-statistic for an absolute change t-test is
C is an estimate of the standard deviation of ab-
solute change.
Similarly, in the percentage case, the null hypothesis of
t-statistic for a percentage change t-test is
From Table 1, we see that there is a decreasing e¤ect for
the blood pressure of each patient after taking the drug cap-
topril. Absolute change and percentage change show the de-
crease in di¤erent ways. From a statistical point of view, we
where bP is an estimate of the standard deviation of per-
should compare the statistical power for the two methods.
centage change.
We have mentioned the relation between statistical power
and the absolute value of t-statistic. We know that, when
2.3 Comparison of Statistical Power
the signi…cant level
is …xed, if the expected absolute value
We have mentioned that Vickers (2001) compared the statis-
of the t-statistic of absolute change is larger, the statistical
tical power of di¤erent methods by doing a simulation. How-
power of that will be higher. The opposite is also true, i.e.
ever, his conclusion just based on an ideal simulation proce-dure, and he did not compare the statistical power theoreti-cally. Kaiser (1989) developed a test statistic which comparedthe maximum likelihood of the two methods. It has nothing
to do with statistical power. But Kaiser (1989) gave an idea
that it is easier to do comparison by constructing a ratio test
Statistical P ower of Absolute Change
For comparison of the statistical power of the two meth-
Statistical P ower of P ercentage Change
ods, we construct a ratio test statistic by using the test sta-tistic or p-value of the treatment e¤ect test. Before that, we
where E (jtCj) and E (jtP j) are the expected absolute value
need to know the distributions of absolute change and per-
of the t-statistic of absolute change and percentage change,
centage change. That is because, for di¤erent distributions of
absolute change or percentage change, di¤erent test methods
So, when R > 1, absolute change has higher statisti-
will be used. In order to construct a ratio test statistic, we
cal power than percentage change, and we choose absolute
should know the test statistics used in both numerator and
change. If R < 1, the percentage change with the higher
denominator of the ratio test statistic.
statistical power is preferred.
In the case of small sample size, it is common to assume
that one of the distributions of absolute change and percent-
2.3.1 When t-test is Suitable for both Absolute
age change is normal. In some speci…c situation, both ab-
Change and Percentage Change
solute change and percentage change may be normally dis-
When the sample size n of the clinical experiment is large,
tributed. Even though for a dataset that is not normally dis-
according to the Central Limit Theorem, both the mean of
tributed, if the distribution is close to normal distribution or
3 Hand, DJ, Daly, F, Lunn, AD, McConway, KJ and Ostrowski, E (1994): A Handbook of Small Data Sets. London: Chapman and Hall.
Zhang, L. and Han, K. (2009)
How to Analyze Change from Baseline: Absolute or Percentage Change?
the distribution is symmetric without extreme observations,
statistical power than percentage change, and we choose ab-
t-test may also be used. In that situation, the test statistic R
solute change. For R0 > 1, the percentage change is preferred.
is also applicable.
In this section, another ratio test statistic R0 for non-
If we simulate some datasets, by using the ratio test sta-
normal distribution situation is discussed. This is the supple-
tistic R, we can compare the statistical power of the absolute
ment of the normal distribution case. In the following simula-
change and percentage change of the datasets that we simu-
tion part, we will concentrate more on the normal distribution
lated. In contrast with Vickers' (2001) claim, some datasets
case shown in subsection 2.3.1, and the details in subsection
with R < 1 will be shown, which re‡ects that percentage
2.3.2 will not be discussed any more.
change has higher statistical power than absolute change un-der some conditions. In the simulation section, we will talkmore about the comparison of the statistical power of the twomethods.
3. Rule of Thumb for the Standard
2.3.2 When the t-test is not suitable for At Least One
Deviation of Change Scores
For the cases when the assumptions for the t-test are notsatis…ed for at least one of the tests, another test should be
The standard deviation of the treatment e¤ect is an im-
portant parameter that is of interest in the planning of
considered. Wilcoxon rank sum test4 is an alternative method
studies. The standard deviation of the change scores is the
proposed by Wilcoxon (1945). Bonate (2000) mentioned that
focus in the second section of this essay. For the case when
it is the non-parametric counterpart to the paired samples
t-test may be used instead of a non-parametric test, in or-
t-test and should be used when normal assumptions are vio-
der to calculate the ratio test statistic R, we should work out
lated. He suggested that the Wilcoxon rank sum test is always
both the mean and the standard deviation of absolute change
a better choice when the distribution of the data is unknown
and percentage change …rst. It is easy to get these values in
or uncertain. Therefore, when the paired samples t-test does
case the datasets of the experiment which give the baseline
not work, we choose Wilcoxon rank sum test instead.
and follow-up scores for each patient are known. However, in
Since we can not use t-statistic to construct the ratio test
clinical research, it is not always possible and practical to get
statistic any more, we may choose to use the expected p-value
the scores for each patient, especially in the planning phase of
of the treatment e¤ect test.
a study. If we require some datasets to support our research
Similarly to (5), according to what is mentioned in equa-
work, we may …nd some experiment datasets interesting for
tion (2), we may construct another ratio test statistic R0 by
our research from experiments that someone else has done.
taking the ratio of expected p-value, i.e.
One of the good ways to …nd the datasets is searching from
published clinical articles. Most of the time, we may …nd some
examples in these articles which show us the summary of the
baseline scores, the follow-up scores, and their standard devi-
ations. And we may get relevant datasets from these tables.
Statistical P ower of Absolute Change
However, the scores for each patient in these clinical research
Statistical P ower of P ercentage Change
articles are seldom published. In this case, how can we know
the standard deviation of the change scores?
C ) and E (pP ) are the expected p-value of ab-
solute change and percentage change, respectively.
There is a rule of thumb which describes the relationship
For the cases that t-test still works, E (p
between the standard deviation of the change scores and that
of the baseline scores.
pected p-value of the t-test for absolute change and percentage
change, respectively. When Wilcoxon rank sum test is used
instead of t-test, E (p
W ilcoxon) and E (pP ) =
E (pP W ilcoxon). E (pC W ilcoxon) and E (pP W ilcoxon) arethe expected p-value of the Wilcoxon rank sum test for ab-
3.1 Theoretical Derivation of the Rule of
solute change and percentage change, respectively. Therefore,
Thumb for Absolute Change
we may get three di¤erent alternative forms for the ratio test
The general expression for the rule of thumb of absolute
statistic R0.
We have talked about that the smaller the expected p-
value is, the higher the statistical power will be. When the sig-
is …xed, if R0 < 1, absolute change has higher
4 Wilcoxon rank sum test is a non-parametric test for assessing whether two independent samples of observations come from the same distribution.
Zhang, L. and Han, K. (2009)
How to Analyze Change from Baseline: Absolute or Percentage Change?
where k is a constant that should be determined; SD (C)
If we assume that SD (B) = 1, then we get SD (C) =
and SD (B) are the standard deviation of absolute change
2r. The smooth curve in Figure 1 shows the relation-
and baseline scores, respectively.
ship between the standard deviation of the absolute change
The relationship between the standard deviation of ab-
SD (C) and the correlation coe¢ cient r. We see that SD (C)
solute change scores and that of the baseline scores can be
decreases from 1:4 to 0 as the correlation coe¢ cient increases
derived from properties of the variance of C,
from 0 to 1. When r 6 0:8, SD (C) roughly has a linear de-crease. After that, when the correlation coe¢ cient r tends to1, the ratio decreases quickly to 0.
where r = cov (B; F )
V ar (B) V ar (F ) is the correla-
tion coe¢ cient between baseline and follow-up scores.
We assume that V ar (F ) = mV ar (B), where m is the
ratio of the variance of follow-up scores to that of baseline
scores. In a speci…c case, m is a constant which may be cal-culated from the known dataset. Then, equation (9) can be
V ar (B) V ar (F )(10)
V ar (B) mV ar (B)
Correlation Coefficient r
Equivalently, from equation (10), we get the relation equa-
Figure 1. Relation curve between the standard deviation of
absolute change SD (C) and the correlation coe¢ cient r
when SD (B) = 1.
Thus, in equation (8), k = 1
shows that the rule of thumb is determined by the correlation
Therefore, when r
0:75, the empirical form of the rule
coe¢ cient r and the ratio m.
2 holds. If the correlation
When the baseline and follow-up scores have the same
coe¢ cient r changes to another value, the form of the rule
variance, m = 1, then we get k = 1
of thumb will be also changed. When r tends to 1, a little
pression of the rule of thumb becomes
change in r may result in a signi…cant change in the standarddeviation of absolute change.
3.3 Rule of Thumb for Percentage Change
Therefore, the standard deviation of the absolute change
is connected to the standard deviation of the baseline scores
Earlier we stated that P = (B
via the correlation coe¢ cient r.
centage change is the ratio of absolute change to the baselinescore. Then we get
3.2 The Relation between the Standard Devi-ation of Absolute Change and the Correlation
The empirical form of the rule of thumb is shown in expression
Since percentage change is a ratio of two variables, its dis-
tribution is uncertain. It is hard to derive the expression of
(12) shows the relation between the standard deviation of the
the standard deviation of percentage change from the stan-
absolute change and that of the baseline scores. So, how does
dard deviation of baseline scores as we did in equation (10).
the standard deviation of absolute change depend on the cor-
We have discussed the rule of thumb for SD (C), and we
relation coe¢ cient?
see that the standard deviation of percentage change depends
Zhang, L. and Han, K. (2009)
How to Analyze Change from Baseline: Absolute or Percentage Change?
not only on the absolute change C but also on the baseline
the patients in the treatment group, the …nal follow-up scores
score B. If we …x the value of C, B may also keep on changing
F all have an absolute decrease of 5 units from F 0 after the
from one sample to another. As a result, it seems there is no
medical intervention, while there is no change of the follow-up
stable relationship between the standard deviation of percent-
scores for patients in the control group, i.e.
age change SD (P ) and the baseline score B. A rule of thumbfor percentage change may, therefore, not be stated. This con-
clusion will be proved in the following simulation part.
He changed the correlation coe¢ cient r, and got di¤erent
simulation results under di¤erent correlation coe¢ cient. Us-ing these simulation results, he calculated statistical power for
each method and made the statistical ine¢ ciency conclusion.
In the following subsections, we will do simulations based
on Vickers' (2001) method. But some change and improve-
In section 2, we discussed the comparison of the statistical ment will be done to his code.
power of absolute change and percentage change, by con-
structing a ratio test statistic based on normal distribution.
In the third section, we discussed the rule of thumb for ab-
4.1.1 A Case that Percentage Change Has Higher Sta-
solute change and percentage change theoretically.
This section will do some simulations to show the prob-
In Vickers'(2001) simulation method, a …xed absolute change
lems that we have discussed in a practical way. The …rst
from the simulated follow-up scores F 0 to the …nal follow-up
thing we want to prove is, in contrast with Vickers' (2001)
scores F was set to each patient in the treatment group. If we
conclusion, that percentage change can be statistically e¢ -
change the …xed absolute change to a …xed percentage change,
cient under some conditions. The second thing that will be
maybe we will get something di¤erent. To be more random-
proved is the di¢ culty of de…ning a rule of thumb for percent-
ized, just like what may happen in practice, we use a random
percentage change instead of …xed percentage change. Thepercentage changes P are simulated from a normal distribu-
Percentage tion. We should notice that the changes we did to Vickers'
Change under Some Conditions
(2001) simulation will result in the change of the correlationcoe¢ cient between the baseline and follow-up scores. The cor-
Vickers (2001) suggested avoiding using percentage change,
relation coe¢ cient of the baseline and follow-up scores in the
because of his conclusion that percentage change from base-
simulation result is not the r that we used in equation (13) any
line is statistically ine¢ cient. He made that conclusion based
more, even though the real value of the correlation coe¢ cient
on the comparison of statistical power calculated from his
may be very close to the value we used in the simulation.
simulation results.
However, since the correlation coe¢ cient will not a¤ect the
Vickers (2001) did the simulation in the following way.
comparison of the statistical power for the two methods, we
First, he simulated 100 pairs of baseline and follow up scores
will give the value of used in each simulation procedure, but
for 100 patients. The baseline scores B are simulated from
do not talk more about it. What's more, in the following sim-
a normal distribution, i.e. B
N (50; 10). In order to get
ulation, we just concentrate on the patients in the treatment
100 scores B, he simulated 100 B0 …rst, B0
he got B from the equation B = B0 + 50. He also simu-
We have developed a ratio test statistic R in section 2, and
lated another 100 scores Y , Y
N (0; 10), which are de…ned
we will use it to do the comparison between absolute change
as the post-treatment scores of the control group. Then the
and percentage change. The simulation can be divided into
follow-up scores F 0 are simulated from B0 and Y by using
two steps. In step 1, we simulate 100 pairs of baseline/follow-
the equation (13). We should note that F 0 is not the …nal
up scores. In the second step, the test statistic R is calculated
follow-up scores.
based on the scores we simulated in step 1.
From equation (5), we know that, in order to simulate
a dataset such that R < 1, we should let the percentage
N (0; 10), we obtain that
change have a large mean and small standard deviation. So,
N (50; 10). Finally, Vickers (2001) simulated 100 g from
in this case, we simulate P from the normal distribution
Binomial (1; 0:5) for each patient. These patients who got
N (0:5; 0:01). We set r = 0:75 and simulate B from the dis-
g = 1 were put into the treatment group, and the other pa-
tribution N (200; 20). According to Vickers (2001) simulation
tients were put in the control group. So, there are nearly
method, we obtain a dataset of scores. Figure 2 shows a part
50 patients in both treatment group and control group. For
of the simulation results of baseline and follow-up scores in
Zhang, L. and Han, K. (2009)
How to Analyze Change from Baseline: Absolute or Percentage Change?
step 1. From Figure 2, we see that there is a nearly 50%
In order to show a more general result, we repeat the pro-
decrease from the baseline score for each patient.
cedure in both step 1 and step 2 100 times, and check thedistribution of R. As shown in Figure 3, the solid line on theleft is the distribution of R based on the datasets we simu-lated. We see that, when P
N (0:5; 0:01), the value of R is
much less than 1. As a result, in this case, percentage changehas a higher statistical power.
In this simulation, we set the percentage change P nor-
mally distributed with a large mean and small standard de-viation. However, it is unreasonable to have such a small
standard deviation in practice. If we increase the standarddeviation of P , what kind of result will come to us?
Figure 3 shows the distribution of R under di¤erent stan-
dard deviation of P . We see that, the value of the test sta-
tistic R increases as the standard deviation of P increases.
Although R increases, it is still less than 1. In this case, weprefer percentage change to absolute change.
4.1.2 A Case that Absolute Change Has A LittleHigher Statistical Power
B N(200,20), P N(0.5,0.01)
In the last section, we simulated a case where percentagechange had a higher statistical power, which is in contrast
Figure 2. Change from Baseline Scores to Follow-up Scores
with Vickers' (2001) conclusion. If we consider more about
the simulation method, we should notice that we used per-centage change to do simulation in that case. It may be afactor which a¤ects the simulation results such that percent-age change has a higher statistical power.
B N(200,20), P N(0.5,SD(P))
Figure 3. Distribution of R (r = 0:75).
B N(200,20), C N(100,5)
Since the test statistic R is the ratio of two expected val-
Figure 4. Change from Baseline Scores to Follow-up Scores
ues, in order to calculate the expected value, we repeat the
score simulation procedure in step 1 100 times, and then weget 100 datasets of scores. In the second step, using the 100datasets, we work out the value of R, and check if it is less
Now, we just change P
N (0:5; 0:01) to C
and keep other conditions the same. Part of the baseline and
Zhang, L. and Han, K. (2009)
How to Analyze Change from Baseline: Absolute or Percentage Change?
follow-up scores are shown in Figure 4. It seems similar withthe scores in Figure 2. This is because we set the expectedabsolute change to 100, which is 50% of baseline scores. So,in a similar absolute change case, what kind of result we willget?
B N(200,SD(B)), C N(Mean(C),10)
Figure 6. Distribution of R (r = 0:75).
We have done 3 simulations based on a modi…cation of
Vickers'(2001) method so far. The …rst one shows that per-centage change has higher statistical power. The other two
B N(200,20), P N(100,SD(C))
show that percentage has nearly the same statistical power
Figure 5. Distribution of R (r = 0:75).
with absolute change.
All of them proved that percent-
age change can be statistical e¢ cient under some conditions.
Therefore, Vickers (2001) conclusion is not correct.
Figure 5 shows the distribution of R under di¤erent stan-
4.2 Nonexistence of Rule of Thumb for Per-
dard deviation of C. Comparing with the distributions in
Figure 3, we …nd it has a di¤erent kind of change when thestandard deviation of C changes. The distributions in Figure
We have discussed the rule of thumb for percentage change
3 mainly perform a location di¤erence, while the distributions
theoretically in section 3. In this section we will simulate
in Figure 5 have di¤erent kurtosis and spread.
another dataset to check if the rule of thumb for percentage
Even though the expected value of R in Figure 5 is larger
change exists. The simulation will show how the standard de-
than 1, it is really close to 1. In this case, it seems both
viation of percentage change SD (P ) depends on the baseline
absolute change and percentage change can be used. The dif-
ference between the statistical powers of the two methods is
This simulation is also based on Vickers' (2001) sim-
ulation method.
In this case, the baseline scores follows
N (50; 10), and the percentage change has a distribution
N (0:1; 0:02). Following the simulation steps, we will
4.1.3 Another Case that Percentage Change Has Lit-
get a score dataset, and the standard deviations of absolute
tle Di¤erence with Absolute Change
change and percentage change can be calculated.
After we get the baseline and follow-up scores, we make a
In this case, we reduce the standard deviation of the baseline
simple transformation that both baseline and follow-up scores
scores to 10 and compare the results with that of the previous
decrease 5 units, i.e.
From Figure 6, we …nd that the expected value of the ratio
test statistic is much more close to 1. If we also reduce themean of C, then R is completely less than 1. This is another
After transformation, we get a new dataset of baseline and
case that shows, under some conditions, the statistical powers
follow-up scores, and calculate the standard deviations of ab-
of the two methods are nearly the same.
solute change and percentage change of new scores. Repeat
Zhang, L. and Han, K. (2009)
How to Analyze Change from Baseline: Absolute or Percentage Change?
the simulation procedure 100 times, each time we may get
standard deviation of absolute change does not change. Ac-
4 standard deviations, SD (C), SD
tually, we can prove that in a theoretical way.
Then, we calculate the mean of 100 simulation results for the
4 standard deviations, respectively. When the correlation co-e¢ cient changes, we get the relation curve between the stan-
After the transformation, C does not change. Therefore,
dard deviation of change scores and the correlation coe¢ cient
the standard deviation of absolute change will not change,
both before and after transformation.
From …gure 7, we also see that, the standard deviations of
percentage change under di¤erent correlation coe¢ cients be-
come larger after transformation. The smaller the correlationcoe¢ cient is, the larger the change of the standard deviation
of percentage change will be.
We have mentioned that SD (P ) = SD (C
case, C does not change, but B becomes smaller. As a result,the standard deviation of percentage change becomes larger.
This re‡ects that the standard deviation of percentage change
Before T ransformation
depends on the baseline scores. Therefore, it's di¢ cult to have
After T ransformation
a rule of thumb for the standard deviation of the percentagechange based on only the standard deviation of the baseline
Correlation Coefficient r
B N(50,10), P N(0.1,0.02)
5. Demonstration of the Rule of
Thumb for Absolute Change
We have discussed the rule of thumb for absolute change
theoretically in the third section of this essay. A gen-
eral expression of the rule of thumb is given in equation (12).
We know that, when r
0:75, the empirical form of the rule of
thumb in expression (7) holds. In this section, we will concen-
trate on the demonstration of the rule of thumb for absolute
change. We collect some real datasets to check whether the
Before T ransformationAfter T ransformation
rule of thumb works well.
The real datasets are searched from examples of clinical
research articles and medical literatures. As shown in Table
2, we have collected two kinds of datasets. The datasets of
the …rst 10 cases contain the scores for each patient, while thedata sets of the last 5 cases just contain some data summary,
e.g. mean, standard deviation, etc., of the baseline scores and
Correlation Coefficient r
absolute change scores.
B N(50,10), P N(0.1,0.02)
If we know the scores for each patient, we can calculate
Figure 7. Relation curve between the standard deviation of
not only the standard deviations but also the correlation co-
absolute change (above) or percentage change (below) and
e¢ cient between the baseline scores and the follow-up scores.
the correlation coe¢ cient r before and after transformation.
So, for the …rst 10 cases, we also get the value for the general
From …gure 7, we observe that, after transformation, the
form of the rule of thumb.
Zhang, L. and Han, K. (2009)
How to Analyze Change from Baseline: Absolute or Percentage Change?
Table 2. Comparison of the real standard deviation and the value got from the rule of thumb for absolute change
Comparing the values of SD (C) and SD (B)
In this case, SD (C) = 2:68 and
are obvious di¤erences between the two values for these real
2rSD (B) = 2:66, the two values are nearly the same,
datasets. In some cases, the di¤erence between SD (C) and
which re‡ect that the rule of thumb are also a¤ected by the
2 is very large.
For the cases that has a correlation coe¢ cient between 0.6
If we also take m into account, we will get the real value of
and 0.9, which is close to 0.75, the di¤erence between SD (C)
the standard deviation of absolute change. Actually, we have
2 may be acceptable. For example, in case 1,
proved that in equation (11) of section 3.
r = 0:63, it is close to 0.75. In this case, SD (C) = 8:99 and
When we know nothing about the correlation between the
2 = 8:09 , it seems that the two values are close
baseline and follow-up scores, just like the last …ve cases in
to each other. However, in case 5, when r = 0:17, the value
Table 2, the rule of thumb may not be suitable. In this case,
of SD (C) is nearly three times of the value of SD (B)
we should be more careful.
This is not acceptable. These facts show that the rule of
From the analysis based on real datasets in this section,
thumb in expression (7) is valid when r
0:75 or when r is
we learned that when the ratio m = V ar (F )
close to that value.
to 1 and the correlation coe¢ cient is nearly 0.75, the rule of
If we take the correlation coe¢ cient r into account, by
2 will be practical. If these con-
comparing the values of SD (C) and
ditions are not satis…ed, it is not a good rule to follow. If we
…nd that the values of
2rSD (B) are closer to SD (C)
ignore the two conditions and insist on using the rule, as we
2, especially when m
know from Table 2, it may result in a big mistake.
5 Douglas G.Alman (1991): Practical Statistics for Medical Research. London: Chapman and Hall. Page 4756 Douglas G.Alman (1991): Practical Statistics for Medical Research. London: Chapman and Hall. Page 4757 Pagano M, Gauvreau K (2000): Principles of Biostatistics, Second Edition, Duxbury. Table B.158 Pagano M, Gauvreau K (2000): Principles of Biostatistics, Second Edition, Duxbury. Table B.159 Bradstreet, T.E. (1994) "Favorite Data Sets from Early Phases of Drug Research - Part 3." Proceedings of the Section on Statistical Education
of the American Statistical Association. <http://www.math.iup.edu/ tshort/Bradstreet/part3/part3-table3.html> 2009-06-08
1 0 Hand, DJ, Daly, F, Lunn, AD, McConway, KJ and Ostrowski, E (1994): A Handbook of Small Data Sets. London: Chapman and Hall.
1 1 Ryan, Joiner, Cryer (1985): Minitab Handbook, Second Edition. PWS-KENT Publishing Company. Page 318, Pulse Data1 2 Bonate P (2000): Analysis of Pretest-Posttest Design. Boca Raton: Chapman and Hall/CRC. Table 3.11 3 Bonate P (2000): Analysis of Pretest-Posttest Design. Boca Raton: Chapman and Hall/CRC. Table 3.41 4 Bonate P (2000): Analysis of Pretest-Posttest Design. Boca Raton: Chapman and Hall/CRC. Table 9.11 5 Waleekhachonloet O, Limwattananon C, Limwattananon S, Gross C (2007): Group behavior therapy versus individual behavior therapy for
healthy dieting and weight control management in overweight and obese women living in rural community. Obesity Research & Clinical Practice,1: 223-232. Table 3
1 6 Neovius M, Rössner S (2007): Results from a randomized controlled trial comparing two low-calorie diet formulae. Obesity Research & Clinical
Practice, 1: 165-171. Table 1 & Table 2
1 7 Neovius M, Rössner S (2007): Results from a randomized controlled trial comparing two low-calorie diet formulae. Obesity Research & Clinical
Practice, 1: 165-171. Table 1 & Table 2
1 8 Neovius M, Rössner S (2007): Results from a randomized controlled trial comparing two low-calorie diet formulae. Obesity Research & Clinical
Practice, 1: 165-171. Table 1 & Table 2
1 9 Neovius M, Rössner S (2007): Results from a randomized controlled trial comparing two low-calorie diet formulae. Obesity Research & Clinical
Practice, 1: 165-171. Table 1 & Table 2
Zhang, L. and Han, K. (2009)
How to Analyze Change from Baseline: Absolute or Percentage Change?
in more detail. It has higher statistical power than the twomethods we talked about. But for the people who are not
6. Discussion and Conclusions
statisticians, this method can not be understood as easily asabsolute change or percentage change.
In this essay we compared the use of absolute change and From a clinical point of view, clinicians may prefer to
percentage change. According to the de…nition of statis-
choose the method that will show the health-improvement
tical power, we developed a ratio test statistic R under cer-
more obviously. Some researchers may choose the method
tain distribution assumption, which can help us decide which
that may be understood by most people that are interested
method will be used, absolute change or percentage change.
in his research. Sometimes, we may make a choice just based
When R > 1, absolute change has a higher statistical power.
on some empirical information. Most of the time, the choice
In that case, we prefer absolute change to percentage change.
depends more on the research work and the researcher's own
If R < 1, we choose percentage change.
Based on Vickers'(2001) simulation method, with the help
of the ratio test statistic, we did some simulations to com-pare the statistical power of the two methods. In contrastwith Vickers' (2001) conclusion that the percentage change
is statistical ine¢ cient, we simulated some datasets in whichpercentage change has higher statistical power, or has nearlythe same statistical power with absolute change. In this way,
Bonate P. (2000): Analysis of Pretest-Posttest Design.
we showed that percentage can be statistical e¢ cient under
Boca Raton: Chapman and Hall/CRC.
some conditions.
Kaiser L. (1989): Adjusting for baseline: change or
Another issue often concerned by researchers is the stan-
percentage change? Statistics in Medicine, 10: 1183-
dard deviation of change scores. There is a rule of thumb
that may help us get the standard deviation of change scoresquickly from the standard deviation of the baseline scores.
Kim M. et al. (2009): Comparison of epicardial, ab-
The general form of the rule of thumb for absolute change can
dominal and regional fat compartments in response to
be derived in a theoretical way. From the derivation in section
weight loss.
Nutr Metab Cardiovasc Dis, 1:7.
3, we know that, when the ratio m = V ar (F )
0:75, the empirical form of the rule of thumb
2 holds. We also checked these con-
avange L., Engels J. and Accurso F. (2007): An-
alyzing percent change in cystic …brosis clinical trials.
ditions in a practical way by collecting some real data to
The 21st Annual North American Cystic Fibrosis Con-
compare the real value and the value got from the rule of
ference, Anaheim, California.
thumb. And we got the same conclusion. So, if we knownothing about the correlation between baseline and follow-up
Neovius M. and Rössner S. (2007): Results from a
scores, we should be more careful to use the rule.
randomized controlled trial comparing two low-calorie
For percentage change, a rule of thumb does not exist.
diet formulae. Obesity Research & Clinical Practice, 1:
That is because the standard deviation of percentage change
depends on the baseline scores, and it is very hazardous tostate a rule. We also proved this by doing a simulation with
Törnqvist L., Vartia P. and Vartia Y. (1985): How
a simple transformation. The simulation result showed how
should relative changes be measured? American Statis-
the standard deviation of percentage change depends on the
tician, 39: 43-46.
baseline scores.
Vickers A. (2001): The use of percentage change from
In this essay, we didn't give any rules to make a choice
baseline as an outcome in a controlled trial is statis-
between absolute change and percentage change. We devel-
tically ine¢ cient: a simulation study.
oped a ratio test statistic which may be helpful, but it is not a
Research Methodology, 1:6.
good rule to tell us how to make a choice. That is because, inthe ratio test statistic, expected value should be used. For a
speci…c dataset in practice, we can not calculate the expected
Limwattananon S. and Gross C. (2007):
value. That is the limitation of the test statistic.
behavior therapy versus individual behavior therapy
Actually, there is not a most optimal method to tell us
for healthy dieting and weight control management in
which method to choose. From a statistical point of view,
overweight and obese women living in rural community.
we would like to choose the method with higher statistical
Obesity Research & Clinical Practice, 1: 223-232.
power. Beside the two change measurement methods from
Wilcoxon F. (1945): Individual comparisons by rank-
baseline, there are also some other methods. One of them
ing methods, Biometrics Bull., 1, 80.
is analysis of covariance, which is mentioned by both Vick-ers (2001) and Kaiser (1989). Bonate (2000) discussed thisZhang, L. and Han, K. (2009)
How to Analyze Change from Baseline: Absolute or Percentage Change?
#correlation coefficient
#percentage change
method="stack",main="Change from Baseline to Follow-up")for(i in (1:length(b))){
}mtext(side=1,line=3, "B N(200,20), P N(0.5,0.01)")
2 0 Responsible Programmer: Ling Zhang.
Zhang, L. and Han, K. (2009)
How to Analyze Change from Baseline: Absolute or Percentage Change?
#correlation coefficient
f<-b*r+y*(1-r 2) 0.5+200h<-rnorm(n,0.5,s)
#percentage change
#absolute value of tc, i.e.
#absolute value of tc, i.e.
#expected value of jtcj
#expected value of jtpj
#value of the ratio test statistic R
Zhang, L. and Han, K. (2009)
How to Analyze Change from Baseline: Absolute or Percentage Change?
#correlation coefficient
f<-b*r+y*(1-r 2) 0.5+200h<-rnorm(n,100,5)
method="stack",main="Change from Baseline to Follow-up")
for(i in (1:length(b))){
}mtext(side=1,line=3, "B N(200,20), C N(100,5)")
#correlation coefficient
f<-b*r+y*(1-r 2) 0.5+200h<-rnorm(n,100,s)
#absolute value of tc, i.e.
#absolute value of tp, i.e.
#expected value of jtcj
#expected value of jtpj
#value of the ratio test statistic R
Zhang, L. and Han, K. (2009)
How to Analyze Change from Baseline: Absolute or Percentage Change?
for(muc in c(100,50)){
#correlation coefficient
f<-b*r+y*(1-r 2) 0.5+200h<-rnorm(n,muc,10)
#absolute value of tc, i.e.
#absolute value of tp, i.e.
#expected value of jtcj
#expected value of jtpj
#value of the ratio test statistic R
Zhang, L. and Han, K. (2009)
How to Analyze Change from Baseline: Absolute or Percentage Change?
xlab="B N(200,SD(B)), C N(Mean(C),10)",ylab="Density",main="Distribution of R")
#correlation coefficient
#percentage change
#follow-up score before transformation
#follow-up score after transformation
#baseline score before transformation
#baseline score after transformation
Zhang, L. and Han, K. (2009)
How to Analyze Change from Baseline: Absolute or Percentage Change?
mtext(side=1,line=4, "B N(50,10), P N(0.1,0.02)")for(i in 1:21){
mtext(side=1,line=4, "B N(50,10), P N(0.1,0.02)")
Zhang, L. and Han, K. (2009)
Source: http://www.statistics.du.se/essays/D09_Zhang%20Ling%20&%20Han%20Kun.pdf
The Egyptian Cabinet Information and Decision Support Center Center for Future Studies Sustainable Cities in Egypt Learning from Experience: Potentials and Preconditions for New Cities in Desert Areas Dr. Nisreen El-Lahham Dr. Waleed Hussen September 2009 Sustainable Cities in Egypt
Review Ocular side-effects of urological Nikolaos A. Kostakopoulos, Vasileios G. Argyropoulos Department of Urology, IASO General Hospital, Athens, Greece Corresponding author: Nikolaos A. Kostakopoulos Email: '[email protected]' Η αυξανόμενη γήρανση του πληθυσμού έχει σαν αποτέλεσμα συχνότερη εμφάνιση συμπτωμάτων καλοήθους υπερπλασίας του προστάτη, ασταθούς κύστεως και στυτικής δυσλειτουργίας. Αυτό έχει σαν επακόλουθο ευρεία συνταγογράφηση. φαρμάκων, όπως αναστολέων της φωσφοδιεστεράσης, αντιμουσκαρινικών παραγόντων και α-αναστολέων που προκαλούν μερικές φορές σοβαρές παρενέργειες από τους οφθαλμούς. Σ΄αυτή την ανασκόπηση θα αναφερθούν περιληπτικά οι διάφορες δυνητικές οφθαλμικές παρενέργειες, η συχνότητά τους, η φυσική τους ιστορία και η σημασία τους για τον κλινικό γιατρό.