This file presents a discussion of the semi-log regression model. For an empirical example and additional discussion, please look at An empirical semi-log model of wages
The Semi-Logarithmic Function
The semi-logarithmic functional form is commonly used in econometrics because its
coefficients represent useful concepts that are easily interpreted. To illustrate some basic
properties of this specification consider the following model:
where P is the market price of a house, S is the size of the house in square feet, D is a dummy variable that takes the value of 1 if the house has a garage (zero otherwise) and T is a time trend variable that has values 1, 2 , 3,...., 36. T indicates in which of 36 consecutive months the house sold. The dependent variable in equation [1] is the natural logarithm of price. To examine the how the right hand side variables affect the Price of a house we need to take the exponential of both sides of [1] since the exponential function gives the anti-log of the natural logarithm:
Equation [3] is derived from [2] by using the fact that exp(a + b) = exp(a)exp(b).
Interpreting Coefficients: Continuous Variables
In this section it will be shown that the coefficient on the continuous
variable S, (size) represents the effect of a change in S on price. More
specifically, b1 is the percentage change in P (price) that
follows from a unit change in size. For example if b1 =
0.02, a
one square foot increase in size will raise price by 2%. This price
response is identical for houses of all sizes, identical for all dates T
and is the same for houses that have a garage (D = 1) and for houses that
have no garage (D = 0).
One way to show this result is to differentiate P with respect to size, S. Recall that if
The last line shows that b is the percentage change in y per unit change in x. When this idea is applied to equation [2] we get
An alternative way to derive this results is to consider the effect of a small change in size,
S on price. Label the resulting change in price as P. From equation [3]
we have:
The left hand side of [5] represents one plus the percentage change in
P since the denominator is the initial value of P and the numerator is the
initial value plus the change in P. When S is set to 1, equation [5]
implies that the percentage change in P is exp(b1)
- 1. A scientific calculator confirms that for small values of x,
exp(x) - 1 @= x (Here, @= means "approximately = to"). For
example, if x = 0.01,
exp(x) @= 1.01005
and if x = .03, exp(x) @= 1.0305. These examples confirm that
if
the coefficient b1 is small, it represents the percentage
change in P that results from a unit change in S.
Interpreting Coefficients: Dummy Variables
In this section we show that b2, the value of a garage, is the coefficient on D in the sense that the percentage difference in the prices of houses that have garages compared to those that do not is b2. This percentage difference is the same for houses of all sizes and it is
independent of time. For houses that have a garage, D = 1 and for those that do not, D =
0. We can use equation [3] to calculate the ratio of the prices of these two types of
houses.
Since the ratio of prices is approximately (1 + b2) the
percentage increase in price due to
the presence of a garage is approximately b2. For example
if b2 = 0.06 then a garage
raises the price of a house by approximately 6%.
Interpreting Coefficients: The Time Trend
The theory that has been presented above applies directly to the time trend coefficient which represents the percentage increase in price that follows from a unit increase in T. When the frequency of time is measured in months, as in the current example, a unit increase in T means that one month has passed so the coefficient on T represents the monthly percentage increase in prices. This percentage increase is the same for every month within the sample and is the same for houses of all sizes with or without garages. This conclusion follows from the fact that the derivative
is simply equal to b3 and is therefore independent of T, S and D.