Natural disasters and technological upgrading measured by changes in demand for ICT labour: Estimating the impacts with text

doi:10.21203/rs.3.rs-3307259/v1

Download PDF

Research Article

Natural disasters and technological upgrading measured by changes in demand for ICT labour: Estimating the impacts with text

https://doi.org/10.21203/rs.3.rs-3307259/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 07 Aug, 2024

Read the published version in Natural Hazards →

You are reading this latest preprint version

Extensive literature has studied the economic impact of natural disasters. However, specific impacts on labour markets have received less attention. Using a massive earthquake (> 8.0 M_w) that struck Chile in 2010 and proprietary data from a Chilean online job board (4,136 job postings published between 2008 and 2012), we examine changes in demand for Information and Communications Technologies, ICT, related labour as a proxy for technological upgrading, by assuming that ICT and related technologies drive much of the technical change in production. We implement a structural topic model to discover and estimate the difference in the prevalence of ICT and Construction labour. Our results show that ICT labour does not change. In contrast, Construction labour significantly differed after the disaster, suggesting that reconstruction activities led to employment differences. Our results suggest that there was no substantive technological replacement following the earthquake.

JEL Classification: J20, Q54, O33

labour markets

natural disasters

technological upgrading

creative destruction

ICT labour

The study of social and economic impacts caused by natural disasters has become increasingly important due to the higher exposition of the global population to these shocks. However, research on the impacts on labour is less abundant (Kirchberger, 2017), with most research documenting impacts on aggregated labour outputs such as unemployment, participation rates, and wages (Brown et al., 2006; Kirchberger, 2017; Xiao & Feser, 2014; Zissimopoulos & Karoly, 2010). Consequently, research focusing on aggregated labour might hide impacts on particular labour sub-groups or sectorial labour (How & Kerr, 2019; Zissimopoulos & Karoly, 2010). For example, little attention has been paid to labour employed by the ICT (Information and Communications Technologies) sector in the context of the current technological change in production that is supposed to be mainly driven by ICT and related computer-based technologies (Acemoglu & Autor, 2011; Almeida et al., 2020; Hwang & Shin, 2017). In this regard, some suggest that disasters can be considered episodes or substantial events affecting the pace of technological change (Crespo Cuaresma et al., 2008; Okuyama, 2003; Okuyama et al., 2004; Skidmore & Toya, 2002). Nevertheless, it remains uncertain whether disasters can speed up the advancement of ICT-related technology by replacing destroyed machinery with updated, ICT-compatible equipment. There is no conclusive answer to this question now. We test this assumption by evaluating changes in demand for ICT-related, as a proxy for faster technology adoption rate, using the 27th of February 2010 Biobío earthquake, which struck Chile's Central Region in 2010.

Studies show inconclusive evidence regarding natural disasters as forces affecting technological upgrading and economic and labour outputs. On the one hand, some show that replacing damaged capital goods with updated equipment in the aftermath of disasters can improve economic growth (Benson & Clay, 2004; Crespo Cuaresma et al., 2008; Loayza et al., 2012; Toya & Skidmore, 2007). Disasters can lead to increased industrial growth (Loayza et al., 2012) and increased physical capital accumulation (Leiter et al., 2009). However, others reported that natural disasters do not significantly affect subsequent economic growth (Cavallo et al., 2013). In addition, benefits from capital upgrading have been linked to countries with higher levels of development because of better institutions, policies, and financial systems, among other factors (Crespo Cuaresma et al., 2008; Toya & Skidmore, 2007). Technology upgrades in post-disaster scenarios usually face financial and time constraints (Benson & Clay, 2004; Di Pietro & Mora, 2015). More importantly, some analyses of various disasters from a pool of countries have suggested significant adverse impacts of disasters on technological innovation, measured by the number of patent applications (Chen et al., 2021). Therefore, we cannot establish that disasters are unequivocally a source of adjustment for technological change and, consequently, for changes in demand for labour.

On the other hand, employment adjustments can result from reconstruction efforts unrelated to technological improvements. For instance, when labour is a substitute for damaged or missing physical equipment, a disaster will lead to positive employment impacts (e.g., more demand), especially in the construction sector (Belasen & Polachek, 2009; Skidmore & Toya, 2002). Also, Leiter et al. (2009) reported employment growth, given the higher physical capital accumulation in regions affected by disasters. However, even if a catastrophe promotes a more significant capital stock, it does not necessarily imply positive impacts on labour participation. Tanaka (2015) found a negative impact on employment, despite over-investment in physical capital. Tanaka speculates that a decreased population in the affected area may be a possible reason. A lower population might result from direct impacts on labour (e.g., death, injuries) or indirect, like forced displacements. The extent to which workers can stay in the labour market after a disaster also influences potential technological replacements.

About our stated assumption that disasters may positively affect the demand for ICT employment because of subsequent higher levels of ICT technology adoption, it must be borne in mind that no existing study has examined the role of natural disasters in explaining changes in demand for ICT labour. Most studies have analysed changes in aggregated labour, hiding impacts on sub-groups (How & Kerr, 2019; Zissimopoulos & Karoly, 2010). Overall, the natural disasters literature emphasises the importance of ICT technologies in coping with problems in the aftermath of catastrophes, where ICT plays a vital role in reducing disaster fatalities, managing recovery costs and dealing with other aspects of disaster management (Benali & Feki, 2018; Toya & Skidmore, 2015; Walker, 2012). Yet, more attention has been paid to ICT labour in the context of other shocks, like recessions and pandemics. It has been suggested that recessions affect ICT employment negatively (Holm & Østergaard, 2015). Conversely, the recent COVID-19 pandemic has affected the ICT workforce relatively less than other occupations, given the prevalence of teleworking in their sector and their lower exposure to social or face-to-face interactions (Pouliakas & Branka, 2020; Redmond & Mcguinness, 2020). Yet the overall lack of studies on the role of disasters in explaining differences in ICT employment impedes our understanding of disasters' impacts on disaggregated groups or specialised labour. More importantly, this research field of natural disasters' impacts on labour requires a cumulative number of cases to support an explanatory framework strong enough to enable us to understand how employment is affected (Jara & Faggian, 2018).

But the insufficiency of studies is a general problem in the literature examining the interaction between labour and natural disasters. According to Jiménez et al. (2020), between 1900 and November 2019, only 118 articles on the effects of disasters on labour were published in indexed journals. Most of them refer to Japan, the US and China. Only few studies appeared for Chile, for example, Jiménez & Cubillos (2010) and, Jiménez et al. (2020). Some additional research can be found in other sources, with Jara & Faggian (2018) and, Sanhueza et al., (2012) as the only studies referencing impacts on labour. As noted above, the lack of published studies might also be attributed to a publication bias, whereby significant findings generate higher chances of publication (Klomp & Valckx, 2014). Additionally, since disasters’ interruption of economic activities is usually only temporary, most past research has focused on shorter-term impacts since it is more difficult to identify long-term effects (Jiménez et al., 2020). We contribute to this limited literature focusing on ICT labour.

Examining how natural disasters might accelerate the ICT-intense technical change rate proxied by changes in demand for ICT is relevant to countries like Chile. First, Chile supplies an environment that is particularly suitable for studying the impacts of disasters like earthquakes. Ten of the most destructive earthquakes, i.e., 8 M_w and above¹, hit Chile in the past century (Barrientos & CSN Team, 2018). In the last decade, three earthquakes over this magnitude affected different Chilean regions in 2010, 2014 and 2015, characterising Chile as a site of recurring earthquakes. Secondly, technical change has been an important driver of the economic development seen by Chile in the last 40–50 years (Beyer et al., 1999; Gallego, 2012), and indicators covering assets like hardware, telecommunications, and software show that the share of ICT in total investment for Chile has been growing, resulting in important ICT capital formation (ECLAC, 2013). In this regard, examining the impacts of disasters and how they are related to technical change is an added step towards understanding changes in demand for employment, especially ICT labour.

We explore the impact of disasters on technological replacement by examining the text content of a collection of 4,136 online job postings published two years before and two years after the event in most Chilean regions affected by the 27 February 2010 Biobío earthquake (M_w 8.8). Our pre-disaster period is represented by data availability from January 2008; the decision to use a two-year post-disaster span assumes that the economic scenario in the second year after the earthquake might provide a more stable basis for making technological replacement decisions.

We apply a set of techniques based on the text data of our collection of job postings to evaluate changes in demand for ICT labour. However, our sample lacks a variable to filter ICT- specific job postings. Besides, ICT occupations or job titles can vary widely. Hence, our modelling and estimation strategy relies on the Structural Topic Model, STM, developed by Roberts et al. (2013, 2016). As a topic model, STM uncovers word co-occurrence patterns across a collection of documents, i.e., our sample of job posting ads, to estimate a set of word clusters or topics. Next, we identify the ICT-related topic that best represents ICT labour, and we examine changes in its prevalence by applying a treatment effect estimation. We identify whether the job postings were published before or after the disaster, where the post-disaster period corresponds to our treated period. In this regard, different to recurrent topic model approaches, STM incorporates document metadata, i.e., the date of job posting publication, to structure the document collection. In terms of results, we expect a higher prevalence of ICT labour after the disaster because of the rapid adoption of equipment compatible with ICT. Also, as pointed out in the literature, we expect that our natural experiment positively influences topics standing for Construction labour, given the recovery and reconstruction activities.

Our results show that the prevalence of the topic representing ICT labour does not significantly change after the earthquake. Conversely, the Construction labour topic prevalence significantly differs after the disaster, i.e., the prevalence increased. These findings suggest that reconstruction activities lead to differences in Construction employment while we do not observe changes in ICT labour. Thus, our results do not support the view on substantial technological replacements occurring after the 27th of February 2010 Biobío earthquake of a kind that impacted the labour market, particularly the demand for ICT labour.

Some policy recommendations emerge from our results. For example, although most of the policy on the recovery process was focused on returning to normal or pre-disaster circumstances, policymakers can take advantage of recovery activities, considering more technology upgrading initiatives. Also, our findings on improvements in Construction employment raise some policy issues, given the temporary nature of reconstruction activities and the predominance of low or unskilled workers in this sector. Consequently, a policy is needed to promote transitions to permanent jobs or training for workers (most of them vulnerable) to mitigate the eventual lack of income once the reconstruction finishes.

The structure of our research is as follows: firstly, we introduce our conceptual framework. Following that, we provide details regarding the context of disaster occurrence and data. Then, we elaborate on our methodological strategy and present and discuss our findings. Finally, in the conclusion section, we summarise our argument and offer policy recommendations based on our results.

Conceptually, we set out a simple framework to examine the interactions between natural disasters, labour markets, and technological change. It combines extensions of growth models like the Solow-Swan model (Solow, 1956; Swan, 1956) and a more literal explanation of the Schumpeterian creative-destruction hypothesis (Aghion & Howitt, 1992; Schumpeter, 1976). We do not test any post-disaster theoretical predictions since there is no comprehensive theory in this literature, and assumptions regarding expected impacts from the aftermath of disasters are many and varied (Coffman & Noy, 2011). Some combined other approaches to investigate the impacts on labour markets of natural disasters (see e.g., Kirchberger, 2017).

An extended Solow-Swan model provides insights into resource allocation involving labour and capital for economic recovery in the aftermath of disasters (Okuyama, 2003). It can compare the effects resulting from the destruction and subsequent upgrading of capital goods on the steady state of the economy and eventual recovery. The main assumption is that older and outdated capital goods are more prone to be damaged by a catastrophe because of vulnerabilities, including weaker structures, mechanical fatigue due to age, and outdated regulations, from which updated equipment is free (Okuyama, 2003). Related to the creative-destruction hypothesis, originally, this conceptual idea gives prominence to the effects of competition between new consumer goods, new markets, and new technologies. These dynamics incessantly transform the economic structure from within; that is, the creative destruction process permanently destroys the old and creates the new. In the natural disasters’ literature, the concept refers literally to the process of technology replacement after a catastrophe (Crespo Cuaresma et al., 2008). This sudden turnover of capital might represent a positive jump in technological improvement.

We develop our research according to the frameworks above to evaluate how natural disasters can positively affect the pace of technical change, resulting in positive impacts on employment. The model encompassing disaster impact evaluation responds to researchers' attempts to develop conceptual and theoretical foundations, such as the works of Okuyama (2003) and, Okuyama et al., (2004), among others. We assume that a technical change embodied in ICT capital goods covering assets like hardware equipment, telecommunications, and software, among others, drives much of the technological change in production. In this sense, the expected jump in technological adoption would imply that much of the technology replacement after a disaster will be based on ICT capital goods. This rapid move towards ICT-compatible equipment might improve the demand for ICT labour. We have defined ICT labour by identifying those occupations involved in the provision of goods and services related to the ICT sector, and we expect positive changes in their demand due to the recent occurrence of disasters. Following the International Classification of Occupations (ILO, 2012), some examples of these ICT occupations are Systems Analysts, Software Developers, Database Designers and Administrators, Computer programmers, Computer Network and Systems Professionals and Technicians.

The 27^{th of} February 2010 Biobío earthquake is considered the second most severe in Chile's history and one of the ten strongest worldwide since these events have been recorded by instruments (Barrientos & CSN Team, 2018; Contreras & Winckler, 2013; M. Jiménez et al., 2020; Sanhueza et al., 2012). The seismic event, and subsequent tsunami, affected several regions in the central and central-south regions of the country that are inhabited by approximately 80% of the Chilean population. The estimated destruction included 500,000 damaged houses, 12,000 injured people, over 400 deaths and an economic cost of US$30,000 million (NOAA, 2019). This earthquake has been used as a natural experiment in other studies examining the link between natural disasters and labour. The topics of studies on the earthquake have included its impact on perceived stress and job satisfaction (Jiménez & Cubillos, 2010) and on employment participation, unemployment rate and lack of access to social security (M. Jiménez et al., 2020; Karnani, 2015; Sanhueza et al., 2012; Sehnbruch et al., 2017). Most of this evidence suggests that the earthquake negatively affected the labour market in the short run. However, in the long term, it has been suggested that the recovery process attenuates these negative impacts, which is facilitated by the government's efforts and other institutional factors (Jiménez et al., 2020). We add to this literature by considering the potential role of the 27 February 2010 Biobío in explaining changes in workforce sub-groups like ICT labour.

Our data corresponds to a sample from the online job ads dataset provided by www.trabajando.com, one of the principal internet labour market intermediaries in Chile. Past studies have used this data to examine the impact on wages of job skills and job search behaviour, among other aspects of labour markets (Banfi et al., 2019, 2022; Banfi & Villena-Roldán, 2019; Ramos et al., 2013).

We filter the Chilean regions considered to be most affected by the 27 February 2010 Biobío earthquake, i.e., the regions (in Spanish) VI de O'Higgins, VII del Maule, VIII del Biobío (ECLAC, 2010; Sanhueza et al., 2012). Other studies included some additional regions such as Región Metropolitana, V de Valparaíso and the IX de La Araucanía (Jiménez et al., 2020; Karnani, 2015) but these regions were less affected (ECLAC, 2010).

We use the job posts published from January 2008 to March 2012. Using the job postings publication date, we create a dummy indicating whether the job post was published after the disaster (treated period), $27F$, which is specified as follows:

$27F=\left\{\begin{array}{cc}1& if the post is published between March 2010- March2012\\ 0& if the post is published between January 2008-Februrary 2010\end{array}\right.$

(3.1)

Our pre-disaster period is represented by data available from 2008 and the occurrence of the disaster on 27 February 2010. The post-disaster definition relies on short-run impacts, considering not only the first year after the disaster’s occurrence but also the second year. Unlike past studies evaluating only one post-disaster year (see, e.g., Karnani, 2015), we consider that one year might be a very short period for considering decisions on technological replacements and potential ICT labour hiring. Besides, firms might be coping with several potential restrictions (e.g., financial and labour shortages) during the first post-disaster year. We consider that the economic scenario in the second year after the disaster might supply a more stable basis for making these decisions. Also, we have not considered more years in the post-disaster span to balance the number of observations between the pre-and post-disaster span properly.

After filtering by affected regions and periods before and after the disaster, our sample consists of 4,136 online job posts. Table 3.1 shows the distribution of our sample according to pre-and post-disaster periods.

Table 3.1

Distribution of online job ads in the most affected regions by pre and post-disaster periods
Period	Number of job post ads
Pre-Disaster (January 2008 – February 2010)	1,720
Post-Disaster (March 2010 – March 2012)	2,416
Total	4,136

From our collection of job posts, we concatenate three open text variables (job title, job description and job-specific requirements). These concatenated text variables, along with the date of publication, correspond to our input for performing our estimation strategies, as detailed in the next section.

The probabilistic or statistical topic models, TM, pioneered by Latent Dirichlet Allocation, LDA (Blei et al., 2003), are tools designed for analysing and understanding large text corpora based on words’ co-occurrence. TM are known as "unsupervised techniques" since they infer topics' content from a collection of texts or corpus rather than assume them as supervised techniques that require ex-ante definitions of topics (Roberts et al., 2014). Since we only observe the documents, TM aim to infer the latent or hidden topics by estimating how words are distributed in topics and topics in documents. Conceptually, we refer to topics as distributions or mixtures of words that belong to a topic with a certain probability or weight. These weights indicate how important a word is in a given topic. In this context, documents are distributed over topics where a single document can be composed of multiple topics, and words can be shared across topics. Thus, we can represent a document as a vector of proportions that shows the share of words belonging to each topic (Roberts et al., 2014).

TM allow us to evaluate the importance of topics in the documents. The sum of shares of topics across all topics in a document, the so-called document-topic proportions, is one. Equally, the sum of the word probabilities or topic-word distributions for a given topic is also one (Roberts et al., 2019). The input for TM is the collection of our raw job postings transformed into a document-term matrix representation, DTM. DTM represents the corpus of our words or terms as a bag of words or terms². DTM is usually sparse and allows us to analyse the data using vectors and matrix algebra to filter and weigh the essential features of our document collection. Also, a critical input is the number of topics to be considered in the model. The researcher must choose this number based on some criterion (e.g., the held-out log-likelihood proposed by Wallach et al. (2009), or it can be estimated following strategies developed for this purpose (e.g., the Anchor Words algorithm developed by Lee & Mimno, 2014).

Most TM assume that document collections are unstructured since all documents arise from the same generative model without considering additional information (Roberts et al., 2014). Instead, in this study, we implement the STM developed by (Roberts et al., 2013, 2016). STM incorporates document metadata into the standard TM approach to structure the document collection, i.e., STM accommodates corpus structure through document-level covariates affecting topical prevalence. This feature contrasts with other TM like LDA. Thus, the critical contribution of STM is to include the covariates in the prior distributions for document-topic proportions and topic-word distributions. These document-level covariates can affect the topical prevalence, i.e., the proportion of each document devoted to a given topic, and we can measure these changes (Roberts et al., 2013). Also, we can evaluate the topical content, which refers to the rate of word use within a given topic, but we do not implement this evaluation here.

In this study, we applied the STM topical prevalence model, which examines how much each topic contributes to a document as a function of explanatory variables or topical prevalence covariates. In our case, the covariate corresponds to our dummy $27F$ stated by Eq. (3.1), showing that our collection of job postings comes from the pre-and post-disaster periods. Next, we examine the topical prevalence variation between these two periods by applying a treatment effect regression.

In the next sections, we describe the specification and estimation of the STM topical prevalence model.

4.1. STM Topic-prevalence model specification

This section and the subsequent 4.2 follow the descriptions and technical guidelines detailed in Roberts et al. (2013, 2014, 2016, 2019) and Grajzl & Murrell (2019). As a model based on word counts, STM defines a data-generating process for each document, and the observed data are used to find the most likely values for the parameters specified by the model.

The specification starts by indexing the documents by $d\in \left\{1\dots D\right\}$ and each word in the documents by $n\in \left\{1\dots {N}_{d}\right\}$ in our DTM representation. The observed words, ${w}_{d,n}$, are unique instances of terms from a vocabulary of size $V$ (our corpus of interest) that we indexed by the $v\in \left\{1\dots V\right\}$. Regarding the addition of covariates for examining the topical prevalence, a designed matrix denoted by $X$ holds this information. Each row defines a vector of document covariates for a given document. $X$ has dimension $D\times P$ (where $p$ indexes the covariates in the design matrix $X$, $p\in \left\{1\dots P\right\}$ ). The rows of $X$ are represented by ${x}_{d}$. Finally, the specification of the number of topics $K$ is indexed by $k\in \left\{1\dots K\right\}$.

Overall, the generative process considers each document, $d$, as beginning with a collection of ${N}_{d}$ empty positions, which are filled with terms. Since our data is represented as a DTM or bag of words representation, we can assume that, for a given document, all positions are interchangeable, i.e., the choice of topic for any empty position is the same for all positions in that document (Grajzl & Murrell, 2019). The filling process starts with the number of topics chosen by the researcher (details below in section 4.2.1.2) to build a vector of parameters of dimension $K$ of a distribution that produces one of the topics $k\in \left\{1\dots K\right\}$ for each position in $d$. This vector is the so-called topic-prevalence vector since it contains the probabilities that each of the $k$ topics is assigned to a singular empty position. STM models the topic-prevalence vector as a function of the covariates to estimate the document properties’ influence on topic-prevalence. The process continues with selecting terms from the $V$ vocabulary to generate a $k$-specific vector of dimension $V,$ which will contain the probabilities of each term to be chosen to fill an empty position.

Formally, the generative process for each $d$, given the vocabulary of size $V$ and observed words $\left\{{w}_{d,n}\right\}$, the number of topics $K$, and the design matrix $X$, for our STM Topic-prevalence model specification can be represented as a four-step method. First, we draw the topic-prevalence vector from a logistic-normal generalised linear distribution (Roberts et al., 2019), with a mean vector parameterised as a function of the vector of covariates. This specification allows the expected topic proportions to vary as a function of the document-level covariates, as follows:

${\overrightarrow{\theta }}_{d}|{X}_{d}\gamma , {\Sigma }\sim\text{L}\text{o}\text{g}\text{i}\text{s}\text{t}\text{i}\text{c}\text{N}\text{o}\text{r}\text{m}\text{a}\text{l}\left({X}_{d}\gamma , {\Sigma }\right)$,

(4.1)

where ${\overrightarrow{\theta }}_{d}$ is the topic-prevalence vector for document $d$, ${X}_{d}$ is the 1-by-$p$ vector, $\text{a}\text{n}\text{d} \gamma$ is the $p$-by-$(K-1)$ matrix of coefficients. ${\Sigma }$ is a $\left(K-1\right)$ -by- $(K-1)$ covariance matrix that allows for correlations in the topic proportions across documents. The covariates’ addition to the model allows the observed metadata to influence the frequency of discussion in the corpus for a given topic. In our specification, the covariate corresponds to the $27F$ dummy stated by Eq. (3.1).

Secondly, given the topic-prevalence vector ${\overrightarrow{\theta }}_{d}$ from Eq. (4.1), for each $n$word within document $d$, which is the process of filling the empty positions $n\in \left\{1\dots {N}_{d}\right\},$ a topic is sampled and assigned to that position from a multinomial distribution as follows:

${z}_{d,n}\sim\text{M}\text{u}\text{l}\text{t}\text{i}\text{n}\text{o}\text{m}\text{i}\text{a}\text{l}\left({\overrightarrow{\theta }}_{d}\right)$,

(4.2)

where ${z}_{d,n}$ is the topic assignment of words based on the document-specific distribution over topics, where the ${k}^{th}$ element of ${z}_{d,n}$ is one and the rest are zero for the selected $k$.

Thirdly, we form the document-specific distribution over terms representing each topic $k$ choosing specific vocabulary words $v$as follows:

${\beta }_{d,k,v}|{z}_{d,n}\propto \text{exp}\left({m}_{v}+{k}_{k,v}\right),$

(4.3)

where ${\beta }_{d,k,v}$ is the probability of drawing the $v$-th word in the vocabulary to fill a position in document $d$ for topic $k$. ${m}_{v}$ is the marginal log frequency estimated from the total word counts of term $v$ in the vocabulary $V,$ representing the baseline word distribution across all documents. ${k}_{k,v}$ is the topic-specific deviation for each topic $k$ and term $v$ over the baseline log-transformed rate for term $v$. ${k}_{k,v}$ represents the importance of the term, given the topic. The logistic transformation of ${m}_{v}$ and ${k}_{k,v}$ converts their sum into probabilities for use in the subsequent and final step, which refers to drawing an observed word conditional on the chosen topic.

Fourthly, the observed word ${w}_{d,n}$ is drawn from its distribution over the vocabulary $V$ to fill a position $n$ in document $d$ as follows:

${w}_{d,n}\sim\text{M}\text{u}\text{l}\text{t}\text{i}\text{n}\text{o}\text{m}\text{i}\text{a}\text{l}\left({\beta }_{d,k,1},\dots ,{\beta }_{d,k,V}\right)$

(4.4)

Also, default regularising prior distributions are used for $\gamma$ in Eq. (4.1) and $k$ in Eq. (4.3). The regularising prior distributions refer to zero mean Gaussian distribution with shared variance parameter i.e.${\gamma }_{p,k}∼Normal\left(0,{\sigma }_{k}^{2}\right)$ and ${\sigma }_{k}^{2}∼Inverse-Gamma\left(\text{1,1}\right)$ (Roberts et al., 2016), where $p$ and $k$ indexes the covariates and topics, respectively, as shown above.

4.2. STM Topic-prevalence Model and effect estimation

This section outlines the techniques used to process our text data, to estimate the number of topics, the parameters inference of our STM Topic-prevalence model and, based on these parameters, to estimate the effect of our natural experiment on topic-prevalence. We use R packages like Quanteda (Benoit et al., 2018) to manage and analyse text data. The STM specification, estimation, and treatment effect analysis are performed using the Stm R package (Roberts et al., 2016, 2019, 2020).

4.2.1. Pre-processing and DTM representation

We perform standard pre-processing procedures on our collection of 4,136 job postings (see section 3 for details). As pointed out above, since our analysis does not deal directly with text data but is performed on specific text features such as word frequencies, we construct a DTM representation (Welbers et al., 2017). We apply cleaning, tokenisation, and stemming, among others, as standard pre-processing procedures to construct our DTM. We use unigrams (unique words) and bigrams (two consecutive words) as tokens or features. Using bigrams allows us to capture text structure or context that we cannot see using single words. For example, in the case of some job titles with generic words like "Engineer", including bigrams might make tokens more comprehensible since we are observing terms like "Software Engineer", "Construction Engineer", etc. We also apply the removal of infrequent terms by dropping features that do not appear in at least ten documents.

4.2.2 Estimating the number of topics, $\varvec{K}$, and the STM topic prevalence model parameters

We estimate $K$ by applying the Anchor Words algorithm (Lee & Mimno, 2014). This technique infers $K$ by finding an approximated convex hull or the smallest convex polygon in a multi-dimensional word co-occurrence space given by our DTM representation. The central assumption of the Anchor Words algorithm is separability, i.e., each topic has a specific term that appears only in the context of that topic. This separability assumption implies that the terms corresponding to vertices are anchor words for topics. Alternatively, the non-anchor words correspond to the point within the convex hull. We expect a $K$ of between 5 and 50, which is the range suggested for a small collection of documents, i.e., a few hundred to a few thousand (Roberts et al., 2020), like our sample.

Also, since there is no true $K$ parameter (Lee & Mimno, 2014; Roberts et al., 2016, 2019), we apply a $K$ data-driven search as a confirmatory analysis. Therefore, we conduct an examination across different topic numbers to select the proper specification from the computation of diagnostics, such as the held-out log likelihood(Wallach et al., 2009) and residuals analysis (Taddy, 2012). The held-out log-likelihood test evaluates the prediction of words within the document when those words have been removed from the document to estimate the probability of unseen held-out documents (given some training data). For the best specification, on average, we will observe a higher probability of held-out documents indicating a better predictive model. In practical terms, we plot the number of topics and their held-out likelihood to look for some breaks in this relationship as a diagnostic showing that additional topics are not improving this likelihood much. Related to the residual analysis, it evaluates the variance overdispersion of the multinomial described by Eq. (4.2) within the data-generating process. An appropriate number of topics will restrict this dispersion. We are interested in the number of topics with lower values in a plot showing $K$ and their estimated dispersion or residual level.

Regarding the STM Topic-prevalence model estimation, the strategy takes the DTM, $K$ and the covariate and returns fitted model parameters. To put it differently, given the observed data, $K$ and our $27F$ dummy, we estimate the most likely values for the model parameters specified by maximizing the posterior likelihood (see section 4.1). As a result, we can examine the proportion of job postings devoted to a given topic, or topical prevalence, over the $27F$ dummy. However, as occurs in this kind of probabilistic model, the STM posterior distribution is intractable. Therefore, we apply the approximate inference method implemented by Roberts et al. (2019). This method, the so-called partially-collapsed variational expectation-maximization algorithm, posterior variational EM, gives us, upon convergence, the estimates of our STM Topic-prevalence model. We discuss our convergence evaluation below.

Another complexity that follows from the intractable nature of the posterior is the starting value of the parameters: in our case, this is the initial mixture of words for a given topic. This complexity is known as initialization, and our estimation depends on how we approach it. We specified the initialization method using the default choice named "Spectral"³. The spectral algorithm is recommended for a large number of documents like ours (Roberts et al., 2020). The described estimation is executed with a maximum number of 200 posterior variational EM iterations subject to meeting convergence. Convergence is examined by observing the change in the approximate variational lower bound. The model is considered converged when the change in the approximate variational lower bound between the iterations becomes very small (default value is 1e-5). We use functionalities included in the R package Stm (Roberts et al., 2020) to estimate $K$ and STM topic-prevalence model parameters.

In practical terms, the STM Topic-prevalence estimation described above allows us to measure how much a given topic contributes to each of our online job postings. We interpret our result by inspecting the estimated mixture of terms associated with topics. We include the most important terms for each topic using metrics like the highest probability and FREX terms (Roberts et al., 2019). FREX⁴ measures the exclusivity of that term to a given topic. This association between terms, documents and topics results from the estimated model. However, for the sake of clarity, we name each topic according to our interpretation of the set of terms that motivates each of them. Thus, we can find topics associated with ICT labour. Since we specified the topical prevalence as a function of the $27F$ dummy (see Eq. 4.1 related statements), we can measure the ICT labour topic prevalence variation between the pre-and post-disaster periods.

4.2.3. Treatment effect estimation and evaluation

Once we have estimated our STM Topic-prevalence model, the fitted parameters allow us to estimate a regression using the online job postings as units or documents, $d$, to evaluate the influence of our dummy $27F$ defined by Eq. (3.1) on topic-prevalence for a topic $j$ (Roberts et al., 2019). Since $27F$ indicates whether the job posting was published in the period before the earthquake impact or after, i.e., in the post-disaster or “treated” period (see section 3), we can study how the prevalence of topics changes in the aftermath of the disasters. In other words, we evaluate the "treatment effect" of the disaster on the topical prevalence by examining changes in topics’ proportions over our sample of job postings published after the earthquake. The effect estimates are analogous to Generalized Linear Models, GLM, coefficients (Roberts et al., 2013).

We compute the topic proportions from the $\theta$ matrix where each column is the topic-prevalence vector for document $d$, ${\overrightarrow{\theta }}_{d}$ (see Eq. (4.1)), and rows are $d$. Thus, each element ${\theta }_{d,j}$ is the probability of job posting $d$ being assigned to topic $j$. As an illustration, in a model with only two topics, we consider the probability of each job posting for each of these two topics. In this example, for job posting $d$ we can denote its proportions over the two topics as ${\theta }_{d,1}$ and ${\theta }_{d,2}$ where ${\theta }_{d,1}+{\theta }_{d,2}=1$. Thus, the regression to evaluate the treatment effect where the topic proportions for a given topic are the outcome variable can be represented as

${\overrightarrow{\theta }}_{d}=\alpha +\beta *{27F}_{d}$

(4.5)

where $\alpha$ is the intercept and $\beta$ is the coefficient to be estimated. A significant $\beta$ can be interpreted as changes (positive or negative) in topical prevalence because of our dummy standing for the post-disaster period.

The effect estimation procedure in the Stm R package relies on simulated draws of topic proportions from the EM variational posterior (see section 4.2.1.2) to compute the coefficients. We use the default value of 25 simulated draws to compute an average over all the results. This procedure randomly samples topic proportions from the estimated topic proportion distributions for each job repeatedly posted to estimate any given effect. Also, as suggested by the software's authors, we include estimation uncertainty of the topic proportions in uncertainty estimates, or "Global" uncertainty, using the method of composition (Roberts et al., 2019). Regression table results will display the various quantities of interest (e.g., coefficients, standard error, t-distribution approximation). The procedure uses 500 simulations (default value) to obtain the required confidence intervals in the standard error computation (drawn from the covariance matrix of each simulation) and a t-distribution approximation (Roberts et al., 2020). We also show our results visually by displaying the contrast produced by the change in topical prevalence, shifting from the pre-disaster to the post-disaster periods, using the mean difference estimates in topic proportions.

Regarding the evaluation of our estimation, although the robustness of the treatment effect estimation implemented here in terms of spurious effect has been validated by using several tests (e.g., Monte Carlo experiments as detailed by Roberts et al., 2014), we still apply a permutation test to evaluate the robustness of our findings. The procedure estimates our model 100 times, where each run applies a random permutation of our $27F$ dummy to the job postings or documents. Then, the largest effect on our topics of interest is calculated. We would find a substantial effect, regardless of how we assigned the treatment to documents, if the results connecting treatment to topics were an artefact of the model (Roberts et al., 2014). Alternatively, we would find a treatment effect only when the assignment of our $27F$ dummy aligned with the true data. We present the results of our permutation tests by plotting the contrast between our permutated model and the true model for our topics of interest.

5.1. Pre-processing and DTM representation

Once we applied cleaning, tokenization and stemming, our DTM matrix is compounded by 4,136 documents, 63,038 features (99,9% sparse) and one covariate ($27F$ dummy). However, we find an important number of features belonging only to a few documents. In this regard, we remove infrequent terms by dropping features not appearing in at least ten documents. As a result, our DTM now has 4,129 documents and 2,748 terms whose frequency is in the range [11, 2,095].

Table 5.1 shows the 15 most frequent terms in our DTM representation. Overall, the terms refer to the most frequent words in job titles and job areas that characterize our collection of job postings, such as sales, customer service, commercial, and management. Also, in the column "Document frequency", the last column in Table 5.1, we can observe how frequently the features are allocated to documents. For example, in the second row, "client" in Spanish ("customer" in English) is the most represented feature since it is found in 1,210 job postings.

Table 5.1

The 15 most frequent DTM terms
Feature (stem word in Spanish)	Feature (in English)	Frequency	Rank	Document frequency
vent	sales	2,095	1	908
client	customer	2,083	2	1210
tecnic	technical	1,857	3	1102
manej	handling	1,644	4	1098
comercial	commercial	1,637	5	881
profesional	professional	1,557	6	1130
equip	team	1,400	7	1045
ingenier	engineering	1,397	8	774
servici	service	1,356	9	930
nivel	level	1,313	10	981
gestion	management	1,183	11	758
control	control	1,030	12	646
respons	responsibility	1,008	13	887
administr	management	1,004	14	617
administracion	management	999	15	610
Note: Own English translation of features considering the most probable Spanish stem word

5.2 Estimating $\varvec{K}$ and STM Topic-prevalence model parameters

This section shows the findings from our estimation strategies detailed in section 4.2.1.2. The number of topics applying the Anchor Words algorithm yielded a $K$ equal to 53. Our alternative data-driven search of $K$ produces similar results, as shown in Fig. 5.1. The left-hand plot corresponds to the held-out log-likelihood application. We see a "break" between 40 and 50 topics. After that point, we see more minor improvements in the log-likelihood by adding more topics. In the case of the residual analysis, the right-hand side plot of Fig. 5.1 shows the lower dispersion levels between 50 and 60 topics. In this regard, we can validate our $K$ equals 53 since this quantity falls approximately within the estimated ranges from both data-driven measures.

Figure 5.2 shows the distribution of the expected topic proportions for the 53 topics over our job posting distribution. The x-axis corresponds to the expected topic proportion, and topic labels highlight the three words of the highest probability (stem words in Spanish).

The highest topic proportion in Fig. 5.2 corresponds to Topic #50 with the associated terms "vent", "ejecut", and "ejecut_vent". Translated into English, these terms are sales, executive, and sales executive, respectively, implying that most of our collection of jobs is devoted to sales-related jobs. We examine the 53 topics and name them based on the ten most probable words and FREX terms (See footnote 4). In the Appendix (see section A), we show the full details of high probability and FREX terms and our proposal of names for topics (in Spanish and English).

Returning to Fig. 5.2, we look at topics standing for ICT labour. We find that Topic # 33 (top half of Fig. 5.2) can be interpreted as an ICT labour topic, given that the most probable terms, i.e., stem words in Spanish, are "informat", "desarroll" and, "program". As non-stem English words, these words would be informatics, development and programming, respectively. Additional FREX terms include English words like data, support, and database (see Topic #33 in the Appendix, section A). Furthermore, software or programming languages belong to this topic (e.g., SQL, PHP). We do not observe other topics with similar terms, suggesting that only our topic of interest contains the expected mixture of ICT-related words.

We adopt the same approach to interpreting the rest of our topics: analysing the higher probability and FREX top words. Topics refer to occupational or economic areas (e.g., Sales, Accountancy, Logistics, Health, Education). Also, some of them correspond to specific job titles (e.g., Retail Store Manager, Management Assistants) and job posting sections (e.g., job posting rewards, job posting qualifications requirements). Furthermore, we cannot interpret some topics (we have denoted them as "Undefinable") since we do not see a clear concept emerging from the mixture of words.

In the next section, we examine the treatment effect of the disaster on the topical prevalence of our ICT labour topic. Also, for comparative purposes, we examine the Construction labour topic (Topic #13 in the top half of Fig. 5.2) since reconstruction activities are expected to encourage the post-disaster prevalence of this topic.

5.3. Effect estimation of the earthquake

This section outlines the effect estimation results, as described in section 4.2.1.3. We focus on the prevalence of ICT labour and Construction labour topics. In Table 5.2, we present the results for the regression represented by Eq. (4.5).

Table 5.2

Effect treatment regression results for ICT labour and Construction labour topics prevalence
Topic	Variable	Estimate	Std. Error	t value	Pr(>\|t\|)
#33 – ICT labour	$Intercept$	0.025684	0.002664	9.643	< 2e-16	***
#33 – ICT labour	$27F$	-0.00536	0.003446	-1.555	0.12
#13 – Construction labour	$Intercept$	0.018878	0.00275	6.866	7.59e-12	***
#13 – Construction labour	$27F$	0.013366	0.003864	3.459	0.000548	***
Note: *, and, * denote significance at 1%, 5% and 10%, respectively.

The first two rows in Table 5.2 stand for the ICT labour topic coefficients. We can see that the $27F$ covariate is not statistically significant, using the ICT topical prevalence as the output variable. In contrast,$27F$ is significant (p-value < 0,01) and positive for the Construction labour topical prevalence. These findings show that the prevalence of the ICT labour topic does not change, suggesting no difference in demand for ICT labour. Conversely, the Construction topic prevalence is significantly different and positive after the disaster, suggesting that reconstruction activities occurred in the earthquake's aftermath. Visually, Fig. 5.3 shows that topical prevalence differed significantly and positively between the pre-disaster and post-disaster periods only for the Construction labour topic.

Figure 5.4 shows the results of our permutation test (see section 4.2.1.3 for details). For the ICT labour topic (left-hand plot), the permutation output suggests that our results of no change in topic proportions are robust since the models with a random permutation of our $27F$ dummy and our model with the true assignment of our variable, shown by the red line on the top of the plot, have effect sizes around zero. In the case of the Construction labour topic (right-hand plot), most estimated models have effect sizes grouped around zero. However, the model including the true assignment of our $27F$ dummy, shown by the red line on the top of the plot, is a result that is far to the right of zero. Thus, the relationship between the treatment and examined topics arises within the sample, and it is not driven by the estimation method itself.

This study examined the impact of the 27th of February Biobío earthquake on demand for ICT labour as a proxy for technological replacement. We do not find evidence that this large earthquake (> 8 M_W) influenced the demand for ICT labour, which was represented by a topic featuring ICT-related terms from our job postings collection. This ICT labour topic corresponds to one of the 53 discovered by applying our STM-Topical Prevalence modelling and estimation strategy. Our number of topics is as expected, given the number of our job postings and the data-driven measures.

Our treatment effect regression results show that the ICT labour topic prevalence did not change in the earthquake's aftermath. This result suggests no substantive technological change in the most affected regions. We do not have enough data to measure region-specific impacts. This lack of evidence does not support our conceptual framework’s main prediction that the expected technological upgrading with ICT-compatible equipment would lead to faster growth in demand for ICT labour. Unlike other studies on shocks like pandemics and recessions, as far as we know, this is the first study that has attempted to link ICT labour with natural disasters. Most of the literature emphasises the importance of ICT and related technologies in coping with disaster prevention and disaster management.

We can speculate as to the reasons why we have not observed evidence of technological upgrading after the earthquake. First, there is the sectorial structure of the Chilean economy. Assuming that older and outdated physical assets are more prone to be damaged by an earthquake because of weaker structure, mechanical fatigue, and other vulnerabilities (Okuyama, 2003), there is a relatively low representation of the sectors accounting typically for these tangible physical assets, like the manufacturing industry. As Chile has grown, its economic development has been more concentrated in the services sector, which accounts for mainly intangible assets, while manufacturing and other sectors have declined (de la Torre et al., 2013; Parro & Reyes, 2017). In the Chilean GDP structure, the services sector accounts for more than half of the GDP, whereas the manufacturing sector decreased from over 20% in the 1980s to 10% by 2010 (World Bank, 2022). Consequently, the potential negative impact of a disaster on an underrepresented sector like manufacturing might be untraceable. In addition, the predominance of the services sector also can explain the lack of evidence since it has been suggested that this sector, given the intangible nature of its assets and operations, does not suffer the impact of natural disasters as severe as, for example, manufacturing (Doytch, 2020).

Secondly, comparative studies also suggest Chile may be well equipped to cope with disasters due to building policies, and economic conditions. For example, severe economic damage was expected in the aftermath of the 27th of February Biobío earthquake because it affected the central regions of the country, where most of the economic activity and population are concentrated. However, the detrimental effects on the economy were much less than those observed in low-income countries like Haiti, when it was hit by a less severe earthquake (7 M_W) in January 2010 (Cavallo & Noy, 2010; Congressional Research Service, 2010). Another possibility suggested by past studies is that economic innovations usually appear when the economy completely recovers from a disaster (Park et al., 2017). In this regard, a longer-term analysis could capture technological upgrading by observing changes in demand for ICT labour.

As for our findings on Construction employment, they are in line with our expectations and past studies (Belasen & Polachek, 2009; Skidmore & Toya, 2002). The positive impact on this labour sector suggests reconstruction activities occurred in the earthquake's aftermath. This positive influence might occur as labour is substituted for damaged or missing physical capital in this sector. Some consequences of reconstruction activities leading to growth in Construction employment may be a potential decrease of workers in other sectors, such as Agriculture, attracted by better salaries (Kirchberger, 2017). In this regard, some authors suggest that rebuilding activities favour unskilled and less-educated workers due to increased demand for the Construction sector, a highly intensive employer of unskilled labour (Di Pietro & Mora, 2015). Less favoured groups, like migrants, can also see improvements in their labour outputs during recovery stages (How & Kerr, 2019). The analysis of these positive influences of disasters on labour is beyond the scope of this study, but it represents an opportunity for further research.

There are some methodological caveats to the study that deserve mention. First, there is potential ambiguity in the discovered topics. As a result, we cannot interpret some of them. This difficulty might be more significant for researchers without prior knowledge of the data or analysing text in a foreign language. Secondly, given that STM is recent, its utility and limitations are still developing. In the case of the treatment effect estimation implemented in this study, there have been some warnings about the modelling of topic proportions, such as that STM ignores the fact that proportions belong to the interval [0, 1] and the regression approach combining Bayesian and frequentist methods (Schulze et al., 2021). Improvements in tackling these limitations should be implemented in future versions of STM.

As suggestions for future research, we would suggest focusing on a more disaggregated analysis, theoretical development and extending the post-disaster period under examination. The importance of research differentiating labour groups or other distributions of workers lies in its ability to facilitate the identification of the worst affected or most favoured workers, either in the aftermath of a disaster or during economic recovery. Typically, aggregated analysis hides the impact on sub-groups (How & Kerr, 2019; Zissimopoulos & Karoly, 2010). Regarding theoretical developments, some authors have made economic generalisations about disaster dynamics by combining conceptual frameworks(e.g., Kirchberger, 2017; Okuyama, 2003) as reproduced in this study. However, much theoretical work remains to be done. Regarding the extension of the post-period examination, as pointed out above, technological replacements might be not only a short-run but also a middle or long-run decision.

Our analysis speculates on two potential policy implications. First, policymakers can take advantage of recovery activities, considering to a greater extent the potential for technological upgrading. This is of particular interest for countries or regions exposed to disasters like Chile, where the lack of technological upgrading in planning recovery activities might explain why we cannot observe technological replacements. Policymakers usually emphasise aspects like disaster risk reduction to improve resilience, where upgrading is mainly planned for infrastructure since disasters threaten sustainable development (Bello et al., 2021). However, a recovery process promoting technological replacements for firms could exploit and encourage potential technological adoption after disasters (Benson & Clay, 2004; Doytch, 2020). For example, policies could promote upgrading through fiscal incentives (e.g., tax reductions, and financial support). Countries receiving greater inflows of external capital in the aftermath of disasters, such as foreign direct investment, FDI, could attract this investment by focusing on technological upgrading (Doytch, 2020). Other highly seismic countries, like Japan, supply abundant liquidity to mitigate the financial constraints on businesses located in affected areas (Okazaki et al., 2019).

In the case of Chile, as one of the region’s strongest economies, after the 27th of February 2010 Biobío earthquake, it had a good chance of receiving support from international financial institutions (e.g., the World Bank, International Monetary Fund), not only for reconstruction(Congressional Research Service, 2010) but also for technological upgrading. Nevertheless, to the best of our knowledge, there was no strategy to consider the issue discussed here. Therefore, we would encourage policymakers to take advantage of reconstruction activities promoting potential technological upgrading through fiscal incentives, mitigation of financial restrictions, and policies targeting replacing industrial technology, as discussed above. In turn, this “forced” upgrading might improve demand for highly skilled workers in the ICT and related technologies sector.

Secondly, more attention must be paid to disaggregated labour, for example, lesser favoured workers employed in recovery activities. These activities supply job opportunities for these workers that might not exist otherwise, which is desirable from a policy perspective. However, reconstruction activities typically employ low-skilled or unskilled workers, as usually occurs in the Construction sector (Rodríguez-Oreggia, 2013), and this unskilled labour appears at the lowest end of the Construction sector’s wages (Sisk & Bankston, 2014). In addition, these low-paying jobs are often dangerous. For example, in the aftermath of Hurricane Katrina, it has been suggested that an undocumented and foreign-born labour force carried out the most unsafe reconstruction activities, like demolition (Trujillo-Pagan, 2012). Bearing this in mind, policymakers should promote strategies focused on these most vulnerable workers, such as improving workers’ prospects by retraining to mitigate the eventual lack of income once the recovery process finishes. Also, more attention should be paid to work safety policies since hard and hazardous jobs usually employ less favoured workers.

The impact on ICT employment derived from technological upgrading due to impacts of disasters has not received attention. Nevertheless, disasters can be an opportunity to accelerate technology adoption, which can positively impact demand for specialised labour like ICT workers.

We explored the influence of the 27th of February 2010 Biobío earthquake on demand for ICT labour as a proxy for a technological replacement event. Our findings using open-text data from online job postings, alongside our topic modelling and treatment effect estimations, show that demand for ICT labour did not significantly change in the aftermath of the earthquake. Given these results, we would assert that there was no significant technological upgrading of destroyed equipment by capital goods compatible with ICT in the most affected regions. However, we observed an increase in Construction labour. Therefore, and as expected, reconstruction activities featured strongly in the recovery process.

Our lack of support for the influence on ICT labour of shocks like the examined earthquake might reflect features characteristic of Chile, such as building policies, economic conditions, and the size of the manufacturing sector. Furthermore, technological replacements might occur in the medium or long run or, possibly, when the recovery activities finish. In this regard, future research should examine periods beyond our post-disaster span of two years. Also, we encourage further research, analysing disaggregated labour and developing more theoretical foundations for better conceptualising interactions between disasters, labour, and technology.

Finally, we discussed some policy implications given our lack of support for changes in demand for ICT labour and the increase in construction employment during the recovery process. On the one side, we encourage policies considering technology upgrading as part of recovery process planning. On the other, we recommend that more should be done to improve the prospects and safety of lesser favoured workers employed in reconstruction activities.

Acknowledgements:

The author acknowledges the financial support from the National Agency for Research and Development (ANID) / Scholarship Program / DOCTORADO BECAS CHILE/2017–72180253.

“The author acknowledges the financial support from the National Agency for Research and Development (ANID) / Scholarship Program / DOCTORADO BECAS CHILE/2017 – 72180253. The author also wishes to express his gratitude to Trabajando.com for granting access to online job posting databases.”

“The authors have no relevant financial or non-financial interests to disclose.”

Acemoglu, D., & Autor, D. (2011). Skills , Tasks and Technologies : Implications for Employment and Earnings. In O. Ashenfelter & D. Card (Eds.), Handbook of Labor Economics (Vol. 4B, pp. 1043–1171). Elsevier Science & Technology, Oxford, UK. https://doi.org/10.1016/S0169-7218(11)02410-5
Aghion, P., & Howitt, P. (1992). A Model of growth through creative destruction. Econometrica, 60(2), 323–351. https://doi.org/10.2307/2951599
Almeida, R. K., Fernandes, A. M., & Viollaz, M. (2020). Software Adoption, Employment Composition, and the Skill Content of Occupations in Chilean Firms. Journal of Development Studies, 56(1), 169–185. https://doi.org/10.1080/00220388.2018.1546847
Banfi, S., Choi, S., & Villena-Roldán, B. (2019). Deconstructing Job Search Behavior (No. 92482; MPRA Working Paper). https://mpra.ub.uni-muenchen.de/92482/
Banfi, S., Choi, S., & Villena-Roldán, B. (2022). Sorting On-line and On-time. European Economic Review, 146, Article 19/706. https://doi.org/10.1016/j.euroecorev.2022.104128
Banfi, S., & Villena-Roldán, B. (2019). Do High-Wage Jobs Attract More Applicants? Directed Search Evidence from the Online Labor Market. Journal of Labor Economics, 37(3), 715–746. https://doi.org/10.1086/702627
Barrientos, S., & CSN Team, N. S. C. (2018). The Seismic Network of Chile. Seismological Research Letters, 89(2A), 467–474. https://doi.org/10.1785/0220160195
Belasen, A. R., & Polachek, S. W. (2009). How Disasters Affect Local Labor Markets: The Effects of Hurricanes in Florida. Journal of Human Resources, 44(1), 251–276. https://doi.org/10.3368/jhr.44.1.251
Bello, O., Bustamante, A., & Pizarro, P. (2021). Planning for disaster risk reduction within the framework of the 2030 Agenda for Sustainable Development. In Project Documents: Vol. LC/TS.2020.
Benali, N., & Feki, R. (2018). Natural disasters, information/communication technologies, foreign direct investment and economic growth in developed countries. Environmental Economics, 9(2), 80–87. https://doi.org/10.21511/ee.09(2).2018.06
Benoit, K., Watanabe, K., Wang, H., Nulty, P., Obeng, A., Müller, S., & Matsuo, A. (2018). quanteda: An R package for the quantitative analysis of textual data. Journal of Open Source Software, 3(30), 774. https://doi.org/10.21105/joss.00774
Benson, C., & Clay, E. J. (2004). Understanding the economic and financial impact of natural disasters (No. 4; Disaster Risk Management Series).
Beyer, H., Rojas, P., & Vergara, R. (1999). Trade liberalization and wage inequality. Journal of Development Economics, 59(1), 103–123. https://doi.org/10.1016/S0304-3878(99)00007-3
Blei, D. M., Ng, A. Y., & Edu, J. B. (2003). Latent Dirichlet Allocation. Journal of Machine Learning Research, 3, 993–1022.
Brown, S. P., Mason, S. L., & Tiller, R. B. (2006). The effect of Hurricane Katrina on employment and unemployment. Monthly Labor Review, 129(August), 52–69.
Cavallo, E., Galiani, S., Noy, I., & Pantano, J. (2013). Catastrophic Natural Disasters and Economic Growth. The Review of Economics and Statistics, 95(5), 1549–1561. https://doi.org/10.1162/REST_a_00413
Cavallo, E., & Noy, I. (2010). The economics of natural disasters (No. 124; IDB Working Paper Series). https://doi.org/10.4337/9781785365980
Chen, Y.-E., Li, C., Chang, C.-P., & Zheng, M. (2021). Identifying the influence of natural disasters on technological innovation. Economic Analysis and Policy, 70(20), 22–36. https://doi.org/10.1016/j.eap.2021.01.016
Coffman, M., & Noy, I. (2011). Hurricane Iniki: measuring the long-term economic impact of a natural disaster using synthetic control. Environment and Development Economics, 17, 187–205. https://doi.org/10.1017/S1355770X11000350
Congressional Research Service. (2010). Chile earthquake: U.S. and international response (CRS Report for Congress).
Contreras, M., & Winckler, P. (2013). Pérdidas de vidas, viviendas, infraestructura y embarcaciones por el tsunami del 27 de Febrero de 2010 en la costa central de Chile. Obras y Proyectos, 14, 6–19. https://doi.org/10.4067/s0718-28132013000200001
Crespo Cuaresma, J., Hlouskova, J., & Obersteiner, M. (2008). Natural disasters as creative destruction? Evidence from developing countries. Economic Inquiry, 46(2), 214–226. https://doi.org/10.1111/j.1465-7295.2007.00063.x
de la Torre, A., Levy Yeyati, E., & Pienknagura, S. (2013). Latin America and the Caribbean as Tailwinds Recede: In Search of Higher Growth (p. 60). https://doi.org/10.1596/978-0-8213-9975-0
Di Pietro, G., & Mora, T. (2015). The effect of the L’Aquila earthquake on labour market outcomes. Environment and Planning C: Government and Policy, 33(2), 239–255. https://doi.org/10.1068/c12121r
Doytch, N. (2020). Upgrading destruction?: How do climate-related and geophysical natural disasters impact sectoral FDI. International Journal of Climate Change Strategies and Management, 12(2), 182–200. https://doi.org/10.1108/IJCCSM-07-2019-0044
ECLAC. (2010). The Chilean earthquake of 27 February 2010: an overview. https://www.cepal.org/en/publications/3161-chilean-earthquake-27-february-2010-overview
ECLAC. (2013). The digital economy for structural change and equality. https://repositorio.cepal.org/handle/11362/35954
Gallego, F. A. (2012). Skill Premium in Chile: Studying Skill Upgrading in the South. World Development, 40(3), 594–609. https://doi.org/10.1016/j.worlddev.2011.07.009
Grajzl, P., & Murrell, P. (2019). Toward understanding 17th century English culture: A structural topic model of Francis Bacon’s ideas. Journal of Comparative Economics, 47(1), 111–135. https://doi.org/10.1016/j.jce.2018.10.004
Holm, J. R., & Østergaard, C. R. (2015). Regional Employment Growth, Shocks and Regional Industrial Resilience: A Quantitative Analysis of the Danish ICT Sector. Regional Studies, 49(1), 95–112. https://doi.org/10.1080/00343404.2013.787159
How, S. M., & Kerr, G. N. (2019). Earthquake Impacts on Immigrant Participation in the Greater Christchurch Construction Labor Market. Population Research and Policy Review, 38(2), 241–269. https://doi.org/10.1007/s11113-018-9500-6
Hwang, W. S., & Shin, J. (2017). ICT-specific technological change and economic growth in Korea. Telecommunications Policy, 41(4), 282–294. https://doi.org/10.1016/j.telpol.2016.12.006
ILO. (2012). The International Standard Classification of Occupations (ISCO-08).
Jara, B., & Faggian, A. (2018). Labor market resilience and reorientation in disaster scenarios. In Resilience, Crisis and Innovation Dynamics (pp. 153–168). Edward Elgar Publishing, Cheltenham UK.
Jiménez, A., & Cubillos, R. (2010). Estrés percibido y satisfacción laboral después del terremoto ocurrido el 27 de Febrero de 2010 en la Zona Centro-Sur de Chile. Terapia Psicologica, 28(2), 187–192. https://doi.org/10.4067/s0718-48082010000200007
Jiménez, M., Jiménez, M., & Romero-Jarén, R. (2020). How resilient is the labour market against natural disaster? Evaluating the effects from the 2010 earthquake in Chile. Natural Hazards, 104(2), 1481–1533. https://doi.org/10.1007/s11069-020-04229-9
Karnani, M. (2015). Labor Shakes: Mid-Run Effects of the 27F Earthquake on Unemployment (No. 68935; Munich Personal RePEc Archive).
Kirchberger, M. (2017). Natural disasters and labor markets. Journal of Development Economics, 125, 40–58. https://doi.org/10.1016/j.jdeveco.2016.11.002
Klomp, J., & Valckx, K. (2014). Natural disasters and economic growth: A meta-analysis. Global Environmental Change, 26(1), 183–195. https://doi.org/10.1016/j.gloenvcha.2014.02.006
Lee, M., & Mimno, D. (2014). Low-dimensional embeddings for interpretable anchor-based topic inference. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 1319–1328.
Leiter, A. M., Oberhofer, H., & Raschky, P. A. (2009). Creative disasters? Flooding effects on capital, labour and productivity within European firms. Environmental and Resource Economics, 43(3), 333–350. https://doi.org/10.1007/s10640-009-9273-9
Loayza, N. V., Olaberría, E., Rigolini, J., & Christiaensen, L. (2012). Natural Disasters and Growth: Going Beyond the Averages. World Development, 40(7), 1317–1336. https://doi.org/10.1016/j.worlddev.2012.03.002
NOAA, N. G. D. C. (2019). National Geophysical Data Center / World Data Service (NGDC/WDS): Significant Earthquake Database. https://doi.org/doi:10.7289/V5TD9V7K
Okazaki, T., Okubo, T., & Strobl, E. (2019). Creative destruction of industries: Yokohama City in the Great Kanto Earthquake, 1923. Journal of Economic History, 79(1), 1–31. https://doi.org/10.1017/S0022050718000748
Okuyama, Y. (2003). Economics of natural disasters: A critical review (No. 12; Research Paper).
Okuyama, Y., Hewings, G. J. D., & Sonis, M. (2004). Measuring Economic Impacts of Disasters: Interregional Input-Output Analysis Using Sequential Interindustry Model. In Y. Okuyama & S. Chang (Eds.), Modeling Spatial and Economic Impacts of Disasters. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24787-6_5
Park, J. Y., Son, M., & Park, C. K. (2017). Natural disasters and deterrence of economic innovation: A case of temporary job losses by Hurricane Sandy. Journal of Open Innovation: Technology, Market, and Complexity, 3(1). https://doi.org/10.1186/s40852-017-0055-2
Parro, F., & Reyes, L. (2017). The rise and fall of income inequality in Chile. Latin American Economic Review, 26(3), 31. https://doi.org/10.1007/s40503-017-0040-y
Pouliakas, K., & Branka, J. (2020). EU Jobs at Highest Risk of COVID-19 Social Distancing: Will the Pandemic Exacerbate Labour Market Divide? (No. 13281; Discussion Paper Series). https://doi.org/10.2801/968483
Ramos, J., Coble, D., Elfernan, R., & Soto, C. (2013). The Impact of Cognitive and Noncognitive Skills on Professional Salaries in An Emerging Economy, Chile. The Developing Economies, 51(1), 1–33. https://doi.org/10.1111/deve.12000
Redmond, P., & Mcguinness, S. (2020). Who can work from home in Ireland? (No. 87; Survey and Statistical Report Series). https://doi.org/10.26504/sustat87
Roberts, M. E., Stewart, B. M., & Airoldi, E. M. (2016). A Model of Text for Experimentation in the Social Sciences. Journal of the American Statistical Association, 111(515), 988–1003.
Roberts, M. E., Stewart, B. M., & Tingley, D. (2019). Stm: An R package for structural topic models. Journal of Statistical Software, 91(2). https://doi.org/10.18637/jss.v091.i02
Roberts, M. E., Stewart, B. M., Tingley, D., & Airoldi, E. M. (2013). The Structural Topic Model and Applied Social Science. NIPS 2013 Workshop on Topic Models: Computation, Application, and Evaluation.
Roberts, M. E., Stewart, B. M., Tingley, D., & Benoit, K. (2020). R Package “stm.” https://doi.org/10.1111/ajps.12103
Roberts, M. E., Stewart, B. M., Tingley, D., Lucas, C., Leder-Luis, J., Gadarian, S. K., Albertson, B., & Rand, D. G. (2014). Structural topic models for open-ended survey responses. American Journal of Political Science, 58(4), 1064–1082. https://doi.org/10.1111/ajps.12103
Rodríguez-Oreggia, E. (2013). Hurricanes and labor market outcomes: Evidence for Mexico. Global Environmental Change, 23(1), 351–359. https://doi.org/10.1016/j.gloenvcha.2012.08.001
Sanhueza, C., Contreras, D., & Denis, Á. (2012). Terremoto y sus efectos sobre el bienestar: un análisis multidimensional. Persona y Sociedad, 26(1), 43–66. https://doi.org/10.53689/pys.v26i1.5
Schulze, P., Wiegrebe, S., Thurner, P. W., Heumann, C., Aßenmacher, M., & Wankmüller, S. (2021). Exploring Topic-Metadata Relationships with the STM: A Bayesian Approach. ArXiv, abs/2104.0.
Schumpeter, J. (1976). The Process of Creative Destruction. In Capitalism, Socialism and Democracy (pp. 81–86). Routledge, London. https://doi.org/10.4324/9780203202050_chapter_vii
Sehnbruch, K., Agloni, N., Imilan, W., & Sanhueza, C. (2017). Social Policy Responses of the Chilean State to the Earthquake and Tsunami of 2010. Latin American Perspectives, 44(4), 24–40. https://doi.org/10.1177/0094582X16648955
Sisk, B., & Bankston, C. L. (2014). Hurricane Katrina, a Construction Boom, and a New Labor Force: Latino Immigrants and the New Orleans Construction Industry, 2000 and 2006–2010. Population Research and Policy Review, 33(3), 309–334. https://doi.org/10.1007/s11113-013-9311-8
Skidmore, M., & Toya, H. (2002). Do natural disasters promote long-run growth? Economic Inquiry, 40(4), 664–687. https://doi.org/10.1093/ei/40.4.664
Solow, R. M. (1956). A contribution to the theory of economic growth. The Quarterly Journal of Economics, 70(1), 65–94. https://doi.org/10.2307/1884513
Swan, T. (1956). Economic growth and capital accumulation. Economic Record, 32(2), 334–361. https://doi.org/10.1111/j.1475-4932.1956.tb00434.x
Taddy, M. A. (2012). On estimation and selection for topic models. Proceedings of the 15th International Conference on Artificial Intelligence and Statistics (AISTATS), 1184–1193.
Tanaka, A. (2015). The impacts of natural disasters on plants’ growth: Evidence from the Great Hanshin-Awaji (Kobe) earthquake. Regional Science and Urban Economics, 50, 31–41. https://doi.org/10.1016/j.regsciurbeco.2014.11.002
Toya, H., & Skidmore, M. (2007). Economic development and the impacts of natural disasters. Economics Letters, 94(1), 20–25. https://doi.org/10.1016/j.econlet.2006.06.020
Toya, H., & Skidmore, M. (2015). Information/communication technology and natural disaster vulnerability. Economics Letters, 137, 143–145. https://doi.org/10.1016/j.econlet.2015.10.018
Trujillo-Pagan, N. (2012). Neoliberal disasters and racialisation: The case of post-Katrina Latino labour. Race & Class, 53(4), 54–66. https://doi.org/10.1177/0306396811433986
Walker, D. N. (2012). Communication technology in disaster management [Master Thesis, Wayne State University]. http://digitalcommons.wayne.edu/oa_theses
Wallach, H. M., Murray, I., Salakhutdinov, R., & Mimno, D. (2009). Evaluation methods for topic models. Proceedings of the 26th International Conference On Machine Learning, ICML 2009, 4, 1105–1112.
Welbers, K., Van Atteveldt, W., & Benoit, K. (2017). Text Analysis in R. Communication Methods and Measures, 11(4), 245–265. https://doi.org/10.1080/19312458.2017.1387238
World Bank. (2022). World Bank Open Data. http://data.worldbank.org
Xiao, Y., & Feser, E. (2014). The unemployment impact of the 1993 US midwest flood: A quasi-experimental structural break point analysis. Environmental Hazards, 13(2), 93–113. https://doi.org/10.1080/17477891.2013.777892
Zissimopoulos, J., & Karoly, L. A. (2010). Employment and Self-employment in the wake of hurricane Katrina. Demography, 47(2), 345–367. https://doi.org/10.1353/dem.0.0099

M_w refers to the Moment Magnitude scale, which is usually used for measuring earthquakes’ “size”. The M_w values are proportional to an earthquake’s total energy release (NOAA, 2019)
We use “words” and “terms” as interchangeable concepts which can refer to a unique word or unigram, two words or bi-gram, and so on.
The spectral initialization is based on the technique of moments, and it employs a spectrum decomposition (non-negative matrix factorization) of the word co-occurrence matrix (Roberts et al., 2016, 2020).
FREX terms correspond to labelled terms using a variation of the Frequency-Exclusivity algorithm available in the Stm R Package.
A spurious effect estimation refers to the model estimating an effect when the effect is actually zero.
We apply the test available in the Stm R package. In this test, rather than using the true assignment of our
$27F$
dummy, the
$27F$
variable is randomly assigned to a job posting with a probability equal to its empirical probability in the sample.
The potential issue regarding mixing Bayesian and frequentist methods arises from the way in which each technique approaches its parameters. For example, while for the Bayesian method the parameters are random variables, the parameters for the frequentist framework are fixed.

Appendix.docx

Download PDF

Journal Publication

published 07 Aug, 2024

Read the published version in Natural Hazards →

Editorial decision: Major revisions
03 Nov, 2023
Reviewers agreed at journal
14 Sep, 2023
Reviewers invited by journal
14 Sep, 2023
Editor assigned by journal
29 Aug, 2023
First submitted to journal
29 Aug, 2023

You are reading this latest preprint version

Natural disasters and technological upgrading measured by changes in demand for ICT labour: Estimating the impacts with text

Status:

Journal Publication

Version 1

Abstract

Figures

1. Introduction

2. Conceptual framework

3. Disaster context and data

4. Structural topic modelling, STM

4.1. STM Topic-prevalence model specification

4.2. STM Topic-prevalence Model and effect estimation

4.2.1. Pre-processing and DTM representation

4.2.3. Treatment effect estimation and evaluation

5. Results

5.1. Pre-processing and DTM representation

5.2 Estimating \(\varvec{K}\) and STM Topic-prevalence model parameters

5.3. Effect estimation of the earthquake

6. Discussion

7. Conclusion

Declarations

Acknowledgements:

References

Footnotes

Supplementary Files

Status:

Journal Publication

Version 1

Topic	Variable	Estimate	Std. Error	t value	Pr(>\|t\|)
#33 – ICT labour	\(Intercept\)	0.025684	0.002664	9.643	< 2e-16	***
#33 – ICT labour	\(27F\)	-0.00536	0.003446	-1.555	0.12
#13 – Construction labour	\(Intercept\)	0.018878	0.00275	6.866	7.59e-12	***
#13 – Construction labour	\(27F\)	0.013366	0.003864	3.459	0.000548	***
Note: *, and, * denote significance at 1%, 5% and 10%, respectively.