Prices Analytical Series
A new approach for estimating the Computer Equipment, Software and Supplies Index in the Consumer Price Index
Release date: February 17, 2021
The Consumer Price Index (CPI) measures the change in prices
of consumer goods and services over time. To accurately reflect trends in the
market and consumer behavior, Statistics Canada periodically updates the
methods applied to various components of the CPI.
The release of the January 2021 CPI (published on February
17th, 2021) marks the implementation of enhancements to the
calculation of the computer equipment, software, and supplies index.
The computer equipment, software, and supplies index represents
0.42% of the 2017 basket, and 4.1% of the recreation, education, and reading index,
a major component of the CPI.
Enhancements to the Index
The computer equipment, software, and supplies index in the
CPI measures the monthly changes over time in the price of laptops, desktops,
monitors, and printers. Previously, Statistics Canada used the household
component of the computer and peripherals price indexNote Note as a proxy
for their price movements. In order to better measure price change for these
items, enhancements have been made to the index including:
- A
new, mainly web-scraped, data source covering more product brands;
- Automation
of data cleaning to ensure an accurate and more timely index;
- A
new statistical model which is updated monthly with new input data and
parameters for the measurement of laptop and desktop price changes.
These enhancements will allow for improved coverage of
computer equipment, software, and supplies products in the CPI through an
increase in the number of observations used to calculate the index, as well as
a timelier receipt of data to feed the index. In addition, prices will be
collected weekly from retailer websites, supplemented with other surveys and/or
sources such as Statistics Canada’s retail sales dataNote , market data on
brand and product expenditures, and supplementary data collected online.
The New Methodology
The nature of the consumer electronics industry and its rate
of technological advancement cause the frequent replacement of outgoing items
with new and improved products. Therefore, in order to estimate pure price
movement and to control for quality changesNote ,
an updated and improved statistical methodNote is used for desktops and laptops to impute monthly prices of incoming and
outgoing itemsNote .
For monitors and printers, which tend to change less frequently, a pure matched
model price index will remain in use.
The log of monthly prices for laptops and desktops are
modelled as a function of a set of explanatory variables using a random forestNote algorithm. While
each product has a separate model, the explanatory variables used are mostly
the same. The variation in log price is thus explained by characteristics such
as storage space, storage type, total RAM, type of RAM, display size (for
desktops this variable is set to zero if the desktop in question is not an
all-in-one desktop), number of CPU cores, CPU speed, CPU brand, GPU brand, product
weight, the presence of a touch screen (laptops only), item manufacturer, and
item retailer. For categorical variables, categories with low counts or
observations with unknown values are grouped together as ‘other’.
In the next steps, price relativesNote for each brand
within each product are calculated (for example, laptops from brand X). These
are then aggregated to product relatives, which are then aggregated to the final
relative. The brand relatives for each of laptops and desktops are calculated
as the exponent of the arithmetic mean of differences in log price between
periods (using an imputed log price where necessary)Note :
where,
where,
is the log observed
price of the th observation in period ,
is the log price
imputed using the model estimated in period ,
is the number of
observations in period of a given
brand within the product,
and is the observation’s
sales, given by its retailer’s previous year sales split evenly amongst all
observations of the product type within the retailer, divided by previous
year’s total sales across all retailers.
This can be expressed as:
where is the previous
year’s computer and computer peripherals sales by a given retailer.
This is done so that the total weight of a retailer is held
constant for the year, preventing unwanted index movements caused by composition
effectsNote .
Accurate and reliable price index estimates for monitors and
printers are possible without statistical modelling due to lower product churn
and a slower pace of technological change for these products. Thus, the brand
price relatives for these goods are calculated simply as the exponent of the
arithmetic mean of differences in log observed prices between periods:
where,
For the estimation of each product level index, the
geometric mean of each product’s brand price relatives is then taken, with
their corresponding expenditure weights, to obtain a product level index , i.e.:
where the weight of the brand of a product in is the previous
period share of price updated expendituresNote , i.e.,
and,
Up to this point, geometric means are used at the product
and product-brand level aggregations in order to account for potential
substitution effectsNote between individual product models and between different brands of the same
product. However, the final index aggregation is calculated as the sum of
current period constant quality expenditures over the sum of previous period
constant quality expenditures. The current period constant quality expenditures
are calculated as the sum of previous period component expenditures multiplied
by the corresponding price relatives. This can be done since it
is unlikely that there is substitution between the different product groups (for
example, a printer cannot be substituted for a laptop). The ratio of constant
quality expenditures defined above can be simplified as the weighted arithmetic
mean of product indices, so the final computer equipment, software, and
supplies index movement at time , can be expressed as:
where
is the product’s
share of computer equipment, software and supplies expenditures during time so that:
and is given by the sum of price
updated product-brand expenditures;
for all .
Example
In order to illustrate the index calculation process described
above, we present an example using a set of fictitious data.
Say at time t for brand m of product p there are 3 observations a, b, and c. Observations b and c are observed in both the current and previous period, whereas a enters the CPI sample this period. Observations a and b are sold
by retailer r1, and c is sold by retailer r2. The sales of
computer products by r1 is equal to the sales of computer products by r2.
The current and previous period prices of the observations are given by table 1.
Table 1
Fictitious price observations for brand m of product p.
Table summary
This table displays the results of Fictitious price observations for brand m of product p.. The information is grouped by Observation (appearing as row headers), Previous period log price, Current period log price and (appearing as column headers).
| Observation |
Previous period log price |
Current period log price |
|
| a |
7.1 (imputed via model) |
7.0 |
-0.1 |
| b |
6.0 |
6.0 |
0.0 |
| c |
6.9 |
7.0 |
0.1 |
If we use a model to impute the prices of non-continuities
for product p, and the weight of a, b, and c are 0.25,
0.25, and 0.50, then the product-brand index movement
is calculated as . If we rely on a pure matched-model approach for product p, then a has no price
relative and is thus excluded, and the weight of b and c are 0.50,
and 0.50. The product-brand index movement
would then be
calculated as .
Now, say at time t the set of product-brand indices are
calculated for product p, which include movements for brands m1, m2,
and m3. The current and previous period price movements, price updated
expendituresNote ,
and weights are given by table 2.
Table 2
Set of fictitious expenditures, weights, and price movements for some brands
Table summary
This table displays the results of Set of fictitious expenditures. The information is grouped by Brand (appearing as row headers), , , , and
(appearing as column headers).
| Brand |
|
|
|
|
|
| m1 |
0.900 |
1.000 |
50 |
|
45/365 |
| m2 |
1.000 |
1.025 |
100 |
|
100/365 |
| m3 |
1.100 |
0.950 |
200 |
|
220/365 |
The product index movement
is then calculated
as 1.00045/365
1.025100/365
0.950220/365 = 0.976.
Lastly, for final index movement aggregation, say at time t there is a movement for each of laptop, desktops, monitors, and printers. The
current period price movement, previous period expenditures, and weights used
are given by table 3.
Table 3
Fictitious weights and price movements for computer equipment, software, and supplies products
Table summary
This table displays the results of Fictitious weights and price movements for computer equipment. The information is grouped by Product (appearing as row headers), , and
(appearing as column headers).
| Product |
|
|
|
| Laptops |
0.976 |
365 |
365/600 |
| Desktops |
1.010 |
120 |
120/600 |
| Monitors |
0.980 |
55 |
55/600 |
| Printers |
1.050 |
60 |
60/600 |
The final index movement
is calculated as
In summary
With the incorporation of web-scraped data and
methodological updates, the new computer equipment, software, and supplies
index constitutes an important enhancement towards the measurement of price
change for goods which are critical to the digital economy. As of the release
of the January 2021 CPI this new index replaces the use of the computers and
peripherals price index as a proxy. Due to the changes in methodology and data,
users are advised not to make year over year index comparisons until a full
year has passed since implementation. If users need more information they may contact the authors: Lance Taylor (lance.taylor2@canada.ca) or Roobina Keshishbanoosy (roobina.keshishbanoosy@canada.ca) through the info unit (statcan.cpddisseminationunit-dpcunitedediffusion.statcan@canada.ca).