Skip to main content

Herding and investor sentiment after the cryptocurrency crash: evidence from Twitter and natural language processing

Abstract

Although the 2022 cryptocurrency market crash prompted despair among investors, the rallying cry, “wagmi” (We’re all gonna make it.) emerged among cryptocurrency enthusiasts in the aftermath. Did cryptocurrency enthusiasts respond to this crash differently compared to traditional investors? Using natural language processing techniques applied to Twitter data, this study employed a difference-in-differences method to determine whether the cryptocurrency market crash had a differential effect on investor sentiment toward cryptocurrency enthusiasts relative to more traditional investors. The results indicate that the crash affected investor sentiment among cryptocurrency enthusiastic investors differently from traditional investors. In particular, cryptocurrency enthusiasts’ tweets became more neutral and, surprisingly, less negative. This result appears to be primarily driven by a deliberate, collectivist effort to promote positivity within the cryptocurrency community (“wagmi”). Considering the more nuanced emotional content of tweets, it appears that cryptocurrency enthusiasts expressed less joy and surprise in the aftermath of the cryptocurrency crash than traditional investors. Moreover, cryptocurrency enthusiasts tweeted more frequently after the cryptocurrency crash, with a relative increase in tweet frequency of approximately one tweet per day. An analysis of the specific textual content of tweets provides evidence of herding behavior among cryptocurrency enthusiasts.

Introduction

Cryptocurrencies have grown rapidly in popularity, especially among non-traditional investors (Mattke et al. 2021). Consequently, the motivations underlying the decisions of many cryptocurrency investors are not always purely financial, with investors exhibiting substantial levels of herding behavior with respect to cryptocurrencies (Ooi et al. 2021). In fact, the culture developing around cryptocurrency enthusiasts engaging in herding behavior is rich and complex (Dodd 2018). The volatility of cryptocurrencies can vary substantially, and smaller cryptocurrencies (e.g., Dogecoin) are especially influenced by the decisions of herding-type investors (Cary 2021).

Because cryptocurrency investors are known to hold on to cryptocurrencies for ideological and cultural reasons, even when the return on cryptocurrency investments is negative (Mattke et al. 2021), the cryptocurrency crash of 2022 has the potential to drastically affect investor sentiment, especially among those who exhibit loyalty to cryptocurrencies despite negative wealth consequences. Magnifying this concern, Vidal-Tomás et al. (2019) showed that herding behavior among cryptocurrency investors is particularly strong in down markets.

The May 2022 cryptocurrency crash was one of the largest crashes in the history of cryptocurrency. Sparked by the collapse of the stablecoin Terra, the entire cryptocurrency market crashed (De Blasis et al. 2023). Before the crash, Terra was the third-largest cryptocurrency ecosystem after Bitcoin and Ethereum (Liu et al. 2023). Terra and its tethered floating-rate cryptocurrency (i.e., Luna) became valueless in only three days, representing the first major run on a cryptocurrency (Liu et al. 2023). The spillover effects on other cryptocurrencies have been widespread, with the Terra crash affecting the connectedness of the entire cryptocurrency market (Lee et al. 2023). Although an attempt to stabilize the stablecoin was made, the creator was ultimately charged and arrested for securities fraud (Judge 2023). While this was the first dissolution of a stablecoin cryptocurrency, it is worth mentioning that scholars have shown that pre-crypto stablecoins have existed, and failed (e.g., the Bank of Amsterdam from the 17th through the early 19th centuries, which tied its currency to silver and gold coins (Frost et al. 2020)). The cryptocurrency community has much to learn from the history of currency; in many cases, its ideas and attitudes are far from novel.

The consequences of an unregulated cryptocurrency market were not constrained by the cryptocurrency crashes examined in this study. Only months after the cryptocurrency crash of May 2022, the FTX collapsed (i.e., the Futures Exchange, formerly the world’s third largest cryptocurrency exchange and hedge fund). This led to a bank run and, ultimately, multiple internationally renowned figures in the cryptocurrency community being imprisoned for fraudFootnote 1 These failures cost tax payers billions of dollars in the form of bailouts for cryptocurrency investors. These are not the only costs society faces due to cryptocurrency; cryptocurrency is also the medium of exchange for $76 billion of illegal activity, with approximately 46% of Bitcoin transactions representing illegal transactions (Foley et al. 2019).

To date, research on this crash has primarily focused on spillovers among different cryptocurrencies or certain commodities. While some research on investor-level sentiment has been published, studies have not explicitly tested the differences in responses between hardcore cryptocurrency enthusiasts and traditional investors who may have held some cryptocurrencies in their portfolios. Thus, given the severity, significance, and recency of this crash in the cryptocurrency market, it is critical to ascertain whether cryptocurrency enthusiasts behave in a fundamentally differently manner than traditional investors, especially in the context of a negative shock. If so, this could potentially lead to greater volatility and is a further reason for regulating the cryptocurrency market. Accordingly, this study seeks to fill the gap in the literature by providing evidence that the May 2022 cryptocurrency crash disproportionately affected herding-type cryptocurrency enthusiasts (relative to traditional investors) as measured by its impact on the sentiment of tweets. This is accomplished by applying natural language processing techniques to harvested Twitter data to quantify the text of tweets, classifying Twitter users into cryptocurrency enthusiasts and a control group, and then feeding this data into a difference-in-differences (DID) model to estimate the potential differential effect of cryptocurrency crashes on herding-type cryptocurrency investors relative to traditional investors. Additionally, this paper analyzes the specific textual content of the tweets in each group to further assess the presence of herding behavior. Such an analysis is important because the presence of herding generates further cause for regulating cryptocurrency markets as herding is known to lead to bubbles (Haykir and Yagli 2022).

The results of the DID regressions confirm that the sentiment of tweets from these herding-type cryptocurrency investors became relatively more negative after the cryptocurrency crash, providing evidence of a decline in investor sentiment among herding-type cryptocurrency investors relative to that of traditional investors. The DID estimators estimated in this study are best interpreted as the magnitude of the differential response to the cryptocurrency crash between cryptocurrency enthusiasts and traditional investors. Critically, the significant effect estimated here indicates that these two groups behaved in fundamentally different ways, confirming that they are indeed distinct.

Additionally, the results show that cryptocurrency enthusiasts began to tweet relatively more often after the cryptocurrency crash, suggesting that multiple behavioral changes occurred as a consequence of the crash. This provides further evidence that cryptocurrency enthusiasts and traditional investors are fundamentally different groups, with distinct responses to similar stimuli. Evidentiary, a classification of the specific textual content of tweets in each group, reveals evidence of herding behavior among cryptocurrency enthusiasts but not among traditional investors. Furthermore, a large portion of this herding behavior exhibited by cryptocurrency enthusiasts is centered on related cultural artifacts such as non-fungible tokens (NFTs).

Literature review

Despite the fact that many cryptocurrencies (e.g., Bitcoin) have a history of bubbles (Chaim and Laurini 2019), many cryptocurrency enthusiasts routinely invest excessively in them. This seemingly irrational behavior can lead to people tying a large proportion of their financial well-being to cryptocurrency. Because financial distress can lead to deteriorated mental health (Starkey et al. 2013) and, by extension, more negative sentiment among investors, the cryptocurrency crash of May 2022 presents a major concern regarding the well-being of herding-type cryptocurrency investors.

The community of investors in cryptocurrencies is diverse, especially among more established cryptocurrencies such as Bitcoin (Dodd 2018). However, cryptocurrencies in general, and many smaller, less-established cryptocurrencies in particular, have a core group of ideologues that form the basis of the community (Ooi et al. 2021). These ideologically motivated communities are typically very libertarian (Obreja 2022), with many members more concerned with belonging to the community and holding cryptocurrency than maximizing the return on their investment (Mattke et al. 2021). Understanding the nature of the communities around cryptocurrencies is important because these communities are critical predictors of the growth and popularity of cryptocurrency in terms of both investing and mining (Al Shehhi et al. 2014).

The libertarian nature of the cryptocurrency community is particularly relevant given the prevalence of confirmation bias, political and information silos, and the growing number of calls to regulate cryptocurrencies. The strong role of confirmation bias among cryptocurrency investors has been documented (Zhang et al. 2019). While regulations may help increase public trust in cryptocurrencies and protect investors and members of this community are among the most likely to benefit from them, (Giudici et al. 2020) the cryptocurrency community is largely ideologically opposed to any regulation.

Collectivist behavior exhibits itself in the cryptocurrency community in other ways. Although perhaps unprincipled, herding behavior among cryptocurrency investors is a well-documented phenomenon (Kallinterakis and Wang 2019). According to Haykir and Yagli (2022), herding behavior in cryptocurrency was prominent during the global COVID-19 pandemic. A study of 50 cryptocurrencies also revealed evidence of herding behavior among investors (da Gama Silva et al. 2019). Specific events have been found to increase herding behavior among cryptocurrency investors, including the expiration date of Bitcoin futures on the Chicago Mercantile Exchange (Blasco et al. 2022). Generally, herding behavior tends to be at its highest when uncertainty is high (Bouri et al. 2019). Combining this with the result from Vidal-Tomás et al. (2019) that herding is strongest when markets are down, we can see that the cryptocurrency crash of 2022 is an important event that can be used to study the behavior of cryptocurrency investors.

Social media is one of the richest sources of data for studying investor behavior. Researchers can study investors’ behavior and motivations by collecting social media data and using natural language processing (NLP) techniques (Zhou 2018). The most commonly used NLP technique is sentiment analysis (Liu 2010). A prominent example of the use of sentiment analysis in the finance literature comes from Gao et al. (2022), who used the announcement of the Chinese “dual carbon” target to study the impacts of investor sentiment on the volatility of (green) stock returns.

Several studies generally consider the role of investor sentiment in stocks (Baker and Wurgler 2006, 2007; Baker et al. 2012; Da et al. 2015). In addition, Seok et al. (2019) and Xu and Zhou (2018) examined the role of investor sentiment in Korean and Chinese stocks, respectively. However, the application of sentiment analysis to financing does not end with the stock market. Using data on bettor sentiment, Avery and Chevalier (1999) showed that bettor sentiment affects the point spread in football games.

Other important papers related to this one include Kou et al. (2023) which demonstrates how fuzzy methods of studying facial expressions are relevant to making sustainable financial decisions. Kou et al. (2024) used fuzzy methods to impute expert financial decisions that are “based on the golden ratio”. Another paper by Kou et al. (2021) used fuzzy methods to study fintech investments in the context of European banks. Leaving the realm of fuzzy methodologies, Kou et al. (2021) developed a model to predict bankruptcies among small and medium sized businesses.Footnote 2

Turning to the effects of investor sentiment on cryptocurrencies, the literature remains plentiful. Cryptocurrencies do not always respond to new information in the same manner as traditional investments Rognone et al. (2020). This is particularly important because the sentiment analysis of both news (Lamon et al. 2017) and social media (Philippas et al. 2019) has been linked to changes in cryptocurrency prices. Mai et al. (2018) built on these results by showing that not only did social media sentiment affect cryptocurrency markets but also that such effects were driven by the sentiment of low-frequency posters, not high-frequency posters. Furthermore, relevant sentiment data from social media have been shown to affect the volatility of cryptocurrency markets (Ahn and Kim 2021) and liquidity (Yue et al. 2021) and can predict bubbles in cryptocurrency markets (Phillips and Gorse 2017). Several studies have considered the effects of the sentiment of (or pertaining to) influential figures on cryptocurrency prices, most notably Ante (2023) and Cary (2021).

In the case of Cary (2021), there was a severe negative impact on the price of Dogecoin attributable to the action of the crypto-tastemaker, who affixed their celebrity to the cryptocurrency. This raises an important, more general concern: given that Anastasiou et al. (2021) showed that sentiment correlates with the risk of a crash in the cryptocurrency market, what happens when there is a major drop in the market?

The May 2022 crash was not the first to occur in cryptocurrency markets. For instance, crashes occurred during 2017–2018 (Cross et al. 2021) and 2013–2014 (Bouri et al. 2017).

Herding behavior among investors is common in cryptocurrency crashes (Li et al. 2023). Examples of observed herding in cryptocurrency markets include a study by Vidal-Tomás et al. (2019), who presented evidence of herding in the lead up to the 2017–2018 cryptocurrency crash. Similarly, Shu et al. (2021) found proof that herding caused a bubble in Bitcoin in 2021. Bouri et al. (2019) studied herding over a longer period of time, finding it to be a persistent feature of cryptocurrency markets that ebbed and flowed over time. Raimundo et al. (2022) found that herding behavior was particularly prominent in cryptocurrency markets during periods of market stress. A typical approach to measuring sentiment throughout the literature is to find a source of relevant text, typically from social media, perform sentiment analysis on the text, and relate the results from the sentiment analysis to the price of a cryptocurrency (Abraham et al. 2018).

Although there are abundant studies on herding, not all such papers can be cited reasonably in this literature review. For example, although they did not study cryptocurrency specifically, Yousaf et al. (2018) found that Ramadan does not lead to herding behavior in the Pakistani stock market. Similarly, Yousaf and Ali (2020b) studied spillovers between Bitcoin, Ethereum, and Litecoin before and during the COVID-19 pandemic by comparing spillovers during the periods October 3, 2018 to December 31, 2019, and January 1, 2020 to April 1, 2020 using a vector autoregressive asymmetric generalized autoregressive conditional heteroskedasticity (VAR-AGARCH) framework. Building on this, Yousaf and Ali (2020a) also studied spillovers between Bitcoin, Ethereum, and Litecoin before and during the COVID-19 pandemic by comparing spillovers during the periods January 1, 2019 to December 31, 2019 and January 1, 2020 to April 22, 2020, using a VAR-DCC-GARCH framework. Building on this line of research, Yousaf and Ali (2021) examined spillovers between Bitcoin, Ethereum, and Litecoin, as well as the S &P 500, before and during the COVID-19 pandemic by comparing spillovers from May 21, 2019 to December 31, 2019 and from January 1, 2020 to May 21, 2020 using a BEKK-AGARCH framework. All three studies on cryptocurrency spillovers and COVID-19 have consistent results and, as of April 1, 2020, report a combined market cap of 76% for these three cryptocurrencies. The author of this paper highly recommends that the reader look at these three papers.Footnote 3

While much literature exists on how herding and sentiment affect prices, the literature on the opposite direction is sparse and considerable progress remains to be made regarding the effects of returns on sentiment. This study builds on the existing literature by providing empirical evidence that returns on financial investments affect investor sentiment, but, in the case of cryptocurrencies, in a non-homogeneous manner across different types of investors. Furthermore, this difference in behavior is tantamount to herding.

Theoretical motivation

Before presenting the data and methodology, it is important to explain why we should expect to see behavioral differences between cryptocurrency enthusiasts and traditional investors. While other assumptions are possible, the model presented here specifically assumes expected utility theory (Mas-Colell et al. 1995). This framework is ideal because it allows for a straightforward economic interpretation of behavioral expectations derived from the model, which helps build a simple intuition for why we should expect to see the empirical results that we do indeed observe, considering the following setup with two types of agents: type \(\theta _{E}\) represents cryptocurrency enthusiasts and type \(\theta _{I}\) represents traditional investors. All agents are utility maximizers that maximize the following intertemporal utility function in Eq. 1, where agents derive utility from their wealth \((W_{t})\) and consumption \((C_{t})\), which is proportional to wealth (i.e., \(C_{t} = \rho W_{t}\), where \(0<\rho <1\) to ensure that some consumption and savings occur), and from investments in Bitcoin \((B_{t})\). The parameters \(\alpha _{B}>0\), \(\alpha _{W}>0\), and \(\alpha _{C}>0\) describe the relative contributions of held Bitcoin, wealth, and consumption, respectively, to an individual’s total utility. The key assumption of the model is that the function \(f(\theta )\) is equal to \(\alpha _{B}\) for the cryptocurrency enthusiast type \(\theta = \theta _{E}\); however, for the traditional investor type \(\theta = \theta _{I}\), the value of \(f(\theta )\) is zero. This finding implies that traditional investors gain utility from cryptocurrencies as their wealth increases. Thus, the parameter \(\alpha _{B}\) describes the additional value (cultural or otherwise) ascribed to Bitcoin held by cryptocurrency enthusiast type agents. Importantly, this study assumes that \(\alpha _{B}\le \alpha _{W}\), such that \(\theta _{E}\)-type agents still aspire to grow their wealth.

$$\begin{aligned} U_{t}(\theta ) = f(\theta )\ln {(B_{t})} + \alpha _{w}\ln {(W_{t})} + \alpha _{c}\ln {(C_{t})} \end{aligned}$$
(1)

Next, we describe the evolution of wealth by using Eq. 2. Here, \(I_{t} = W_{t} - B_{t}\) represents traditional (non-cryptocurrency) investments. The parameters \(r_{I}\) and \(r_{B}\) denote the rates of return on traditional investments and Bitcoin, respectively. Importantly, this study assumes that the rate of return \(r_{I}\) is equal to the highest rate of return available to investors; if Bitcoin has the highest rate of return available to a traditional investor, \(r_{I} = r_{B}\). However, because this study focuses on investor behavior during the aftermath of the cryptocurrency crash of 2022, we assume that \(r_{i} > r_{B}\) for the remainder of this analysis.

$$\begin{aligned} \dot{W} = W_{t} - W_{t-1} = (1+r_{I})I_{t} + (1+r_{B})B_{t} - C_{t} \end{aligned}$$
(2)

By simply rearranging the terms and using the definitions of \(I_{t}\) and \(C_{t}\), we obtain the following description of the change in investor wealth over time (Eq. 3):

$$\begin{aligned} \dot{W} = r_{I}W_{t-1} + (r_{B}-r_{I})B_{t} - \rho W_{t} \end{aligned}$$
(3)

Given this simple model, the natural questions are as follows: (1) How much wealth will be invested in Bitcoin by \(\theta _{E}\)-type investors? (2) How does the evolution of wealth differ between the two investor types?

To answer the first question, we set equal partial derivatives of utility with respect to wealth and Bitcoin (Eq. 4).

$$\begin{aligned} \frac{\alpha _{W}}{W} = \frac{f(\theta )}{B} \end{aligned}$$
(4)

Rearranging the terms and using the definition of \(f(\theta )\) gives us the ratio of wealth invested in Bitcoin over time for each investor type:

$$\begin{aligned} \frac{B}{W} = {\left\{ \begin{array}{ll} \frac{\alpha _{B}}{\alpha _{W}} & \textrm{if}\ \theta = \theta _{E}\\ 0 & \textrm{if}\ \theta = \theta _{I} \end{array}\right. } \end{aligned}$$
(5)

As can be seen from Eq. 5, the optimal proportion of Bitcoin holdings is equal to the relative utility a \(\theta _{E}\)-type investor obtains from holding Bitcoin compared to their utility from wealth in general.

Turning our attention to the second question, the goal is to describe the difference in \(\frac{\dot{W}}{W}\) between the two investor types. Equation 6 states the evolution of wealth for \(\theta _{I}\)-type investors, and 7 states the evolution of wealth for \(\theta _{E}\)-type investors. The "Appendix" provides the derivation of these results.

$$\begin{aligned} & \frac{\dot{W}}{W} = \left( 1+\rho \right) \left( \frac{r_{I}}{1+r_{I}}\right) - \rho \end{aligned}$$
(6)
$$\begin{aligned} & \begin{aligned} \frac{\dot{W}}{W} = \left( 1+\rho \right) \left( \frac{r_{I}W + (r_{B}-r_{I})B}{(1+r_{I})W + (r_{B}-r_{I})B}\right) - \rho \end{aligned} \end{aligned}$$
(7)

Taking the difference between these two equations, we find that traditional investors’ wealth grows at a faster rate than in (Eq. 8; derivation in the "Appendix").

$$\begin{aligned} \frac{\dot{W}_{I}}{W_{I}} - \frac{\dot{W}_{E}}{W_{E}} = \frac{(1+\rho )\left( \frac{\alpha _{B}}{\alpha _{W}}\right) (r_{I}-r_{B})}{(1+r_{I}) \left( 1+r_{I}+\left( \frac{\alpha _{B}}{\alpha _{W}}\right) (r_{B}-r_{I})\right) } > 0 \end{aligned}$$
(8)

Thus, using a simple model, we show that cryptocurrency enthusiasts will experience a lower growth rate for wealth as a consequence of the utility they gain from holding Bitcoin. Given that changes in wealth can be reasonably expected to affect the sentiment embedded in relevant tweets, this derivation provides a formal justification for why we should expect to see changes in the sentiment of tweets among cryptocurrency enthusiasts in the aftermath of the cryptocurrency crash of 2022.

It is important to acknowledge that an expected utility framework is not the only way to motivate the empirical analysis in this study. The prospect theory is another means of framing this study. However, there is extensive value in establishing and deriving this expected utility model. Specifically, this study shows how non-financial factors, such as belonging to a community, can affect the utility-maximizing behavior of cryptocurrency enthusiasts. Essentially, while the cryptocurrency enthusiast’s position of holding crypto assets during a crash is not what a traditional investor would consider rational, it is rational from the perspective of a cryptocurrency enthusiast. This is important for policymakers when designing regulations for cryptocurrency markets.

Data and identification strategy

Data

The data used in this study were obtained from Twitter. To identify a differential effect linking the cryptocurrency crash to changes in the sentiment of cryptocurrency enthusiasts relative to traditional investors, we ultimately need to quantify the relevant aspects of tweets using sentiment analysis. These relevant aspects of tweets are referred to as affective states in the sentiment analysis literature (Xie et al. 2021) as a “positive,” “negative,” “neutral,” and an aggregate or “compound” score. Once all tweets are assigned scores for each affective state, all tweets by a given user during each of the two time periods (before and after the cryptocurrency market crash) the scores are combined and averaged to give a mean score for each user and affective state pair. This dataset also contains the frequency of tweets made by each user before and after the cryptocurrency crash. Because the state of the cryptocurrency market itself is likely to affect investor sentiment, the price of Bitcoin is also included. Table 1 presents the summary statistics, and the process for generating these data is described below.

Table 1 Summary statistics for the data

The data were generated using a four-step process. The first step was to curate a list of Twitter users for the potential treatment and control groups. This was done by searching for and collecting tweets containing at least one of a series of finance related terms including cryptocurrency specific terms such as “wagmi.” (An acronym for “We are all gonna make it,” “wagmi” is a rallying cry among herding-type cryptocurrency enthusiasts). This approach was chosen over other sample selection methods (e.g., the seed-based method proposed by Yang et al. (2015)) because it allows for a straightforward classification of users. While many different clustering methods could have been used to (bi)partition the users, such methods rely on the full sample of users having already been determined and, therefore, are not appropriate for use here (Li et al. 2021).Footnote 4 Twitter was selected as a data source for several reasons. First, when the data for the study were collected, the Twitter API was freely accessible to researchers. Second, Twitter users tend to post frequently, with short yet expressive posts, which is an ideal combination for this study. Third, a body of literature exists on extracting a representative sample of users from Twitter for a given research purpose (Vicente 2023; Mislove et al. 2011).

Once the tweets were collected, the second step was to partition the users into the treated and control groups for the DID regression. The treated group; that is, herding-type cryptocurrency enthusiasts, was defined via the existence of herding-type cryptocurrency enthusiast-specific keywords in tweets. If a herding-type cryptocurrency enthusiast-specific keyword was present in any of a user’s tweets, the user was classified as “treated.” The remaining users were classified as “controls” and may be thought of as more traditional investors (i.e., investors who did not exhibit outward evidence of herding-type behavior in the cryptocurrency market prior to the cryptocurrency crash of May 2022). It is important to note that these users may still invest in cryptocurrencies; however, such investment decisions are no different from any other investment decision.

With a large number of tweets (614,116) collected and users classified into treated and control groups, the third step was to define the time periods for “pre” and “post” event (pre = January 1–April 30; post = June 1–August 31), representing before and after the initial cryptocurrency crash in May 2022. All of the tweets were classified accordingly, and tweets between these two periods were omitted from the study. This choice of period allows us to look at persistent changes in investor sentiment, as opposed to the more transient changes that may have been observed during the omitted intermediate period.

Finally, the various affective states considered in the study were quantified as “positive,” “neutral,” “negative,” and an aggregate measure. This was achieved by performing sentiment analysis of the tweets using the Sentiment Intensity Analyzer tool from the Natural Language Toolkit (NLTK) module in Python. Individual tweets were tokenized, lemmatized, and stemmed, and all usernames were removed from the tweets. The output of the Sentiment Intensity Analyzer tool consists of four scores that quantify the presence of specific affective states in tweets: “positive,” “neutral,” “negative,” and “compound” (an aggregate measure). The scores assigned to each affective state come from the unit interval [0, 1], with the exception of neutral sentiment, which is mapped to the interval \([-1,1]\). For more details on performing sentiment analysis, please see Feldman (2013). For replicability, all code pertaining to this study is available from https://github.com/cat-astrophic/cryptocrash.

Relating these affective states to investor sentiment, if the cryptocurrency crash had negatively impacted investor sentiment, one consequence we might expect to observe would be relatively less positive sentiment expressed in tweets, meaning a greater prevalence of negative and neutral tweets in the post-crash period. This is because typically positive herding-type cryptocurrency enthusiasts may have either a higher sentiment baseline or employ a more positively measured diction relative to other users, including other herding-type cryptocurrency enthusiasts. Tweets by these users may become more “neutral,” meaning that although they no longer express explicitly positive sentiment on Twitter, they do not necessarily express explicitly negative sentiment. A practical example of this would be unimpassioned appeals within the herding-type investor community to hold a course that does not explicitly express dismay at the current state of the cryptocurrency market.

In addition to these four broad affective states, more nuanced emotion-specific variables were created in the same manner as the data for the four broad affective states, namely “anger,” “disgust,” “fear,” “sadness,” “negativity,” “surprise,” ‘trust,” “joy,” and “positivity.” As was the case for the broader affective states discussed above, these emotions were calculated using the NLTK module in Python and followed the data preprocessing methods outlined above.

Finally, we acquired data on the number of tweets that each user tweeted during each period. These data are included because significant results indicate that cryptocurrency enthusiasts changed not only their sentiment but also their behavior regarding Twitter usage.

Identification strategy

Given the nature of the research question and the data, two sets of ID models were used to determine whether cryptocurrency enthusiasts behaved fundamentally differently from traditional investors. The standard interpretation of the DID estimator is the average treatment effect of the treated units (ATT). However, in the context of this study, where the treated units are cryptocurrency enthusiasts and the control units are traditional investors, this tells us whether there is a differential response to the cryptocurrency crash between the two groups. If so, these two groups behave fundamentally differently from one another and thus represent two distinct types of investors.

In the first model in each set, we estimated the effect of cryptocurrency crashes on the mean tweet sentiment across all Twitter users in the sample. Here, the model is specified as shown in Eq. 9, where \(Y^{s}_{x,i,t}\) is the mean score for affective state s for tweet x by user i in period t, \(Treated_{x}\) is a dichotomous variable indicating whether tweet x was written by a herding-type cryptocurrency enthusiast, \(Post_{t}\) is a dichotomous variable indicating whether the day t of tweet x was before or after the cryptocurrency crash, \(\delta _{i}\) is a user-specific fixed effect, \(\epsilon _{i,t}\) is the error term, and \(\beta\) is the parameter of interest. Separate regressions were run for each affective state, and this approach was used for both the sentiment and emotion analyses.

$$\begin{aligned} Y^{s}_{x,i,t} = \beta \cdot (Treated_{x}\times Post_{t}) + \rho \cdot Treated_{x} + \tau \cdot Post_{t} + \gamma \cdot \Delta BTC_{t} + \delta _{i} + \epsilon _{x,i,t} \end{aligned}$$
(9)

To provide additional support for these regressions, we estimate the regression shown in Eq. 10, where we examine the user-level average values for each affective state in each of the two time periods. \(\beta\) remains the parameter of interest.

$$\begin{aligned} Y^{s}_{i,t} = \beta \cdot (Treated_{i}\times Post_{t}) + \rho \cdot Treated_{i} + \tau \cdot Post_{t} + \gamma \cdot \Delta BTC_{t} + \delta _{i} + \epsilon _{i,t} \end{aligned}$$
(10)

To test for another potential change in behavior among herding-type cryptocurrency enthusiasts, another DID model was specified, as shown in Eq. 11. Here, \(Tweets_{i,t}\) becomes the dependent variable; otherwise, we follow the specification from Eq. 10 and are again interested in estimating the parameter \(\beta\). Changes in the frequency at which a tweet can be an additional indicator of sentiment have not been previously considered in the literature.

$$\begin{aligned} Tweets_{i,t} = \beta \cdot (Treated_{i}\times Post_{t}) + \rho \cdot Treated_{i} + \tau \cdot Post_{t} + \gamma \cdot \Delta BTC_{t} + \delta _{i} + \epsilon _{i,t} \end{aligned}$$
(11)

Finally, these models cannot be presented without a discussion of endogeneity. When running a DID model, a key assumption for causality is that there can be no self-selection in the treatment. In this study, individual investors can choose whether they are willing to participate in a broader cryptocurrency culture (type \(\theta _{E}\) investors in the theoretical model). However, we do not estimate the causal effect of a policy but rather exploit an exogenous market shock to directly observe the differential responses of two distinct groups of investors to this shock. Therefore, the only potential endogeneity concern in this paper lies in potential correlations between the use of the term “wagmi” in tweets and the sentiment of the words surrounding “wagmi” in the tweet. However, as will be shown in the following section, if anything, this study underestimates the magnitude of the differential response to the May 2022 cryptocurrency crash between cryptocurrency traders and traditional investors.

Results

This section presents and discusses the regression results and textual evidence suggestive of herding behavior. First, we focus on the results of the tweet- and user-level regressions for broad affective states (i.e., compound, positive, negative, and neutral). The results are presented in Tables 2 and 3, respectively. Next, we take a more nuanced look at these affective states using the results from the tweet- and user-level regressions for the presence of specific emotions in the tweets. The results are presented in Tables 4 and 5, respectively. Third, we address the results of the regressions on the frequency at which users tweet (see Table 6). Finally, we analyze the specific textual content of the tweets and provide evidence of herding among herding-type investors but not among traditional investors.

Broad affective states

Beginning with the regressions for the four broad affective states (Tables 2 and 3), cryptocurrency enthusiasts saw a decrease and increase in negative sentiments and neutral sentiments in their tweets, respectively. The increase in neutral sentiment should not be surprising as it can be explained by a more subdued discourse among cryptocurrency enthusiasts that contains both increases in negative sentiment attributable to the cryptocurrency crash and the deliberately positive, collectivist, and perhaps even dogmatic, “wagmi” mantra. Conversely, the decrease in negative sentiment might be surprising given the negative nature of the cryptocurrency crash and its impact on cryptocurrency enthusiasts. Given that the cryptocurrency enthusiast community made a deliberate, collective effort to stay positive (“wagmi”), a decrease in negative sentiment makes sense. Since “wagmi” is a deliberate positive rallying cry, its use appears to have offset a decline in positive sentiment, leading to statistically insignificant results for both positive sentiment and the compound score. This is particularly important because the decrease in the price of Bitcoin (assuming it is correlated with investor sentiment), may have been partially offset by the collective effort to hold Bitcoin, despite the financial implications of the herding-type cryptocurrency enthusiasts, thus validating the model presented in Sect. "Theoretical Motivation".

These results suggest that cryptocurrency enthusiasts behave in a fundamentally different manner compared to traditional investors. From an economic policy standpoint, this is particularly important because it suggests the potential for herding. (The next section of this paper explores herding further.) A market flooded with participants who engage in herding behavior is more likely to have bubbles and eventually runs. As the cryptocurrency market is still largely unregulated, there is a need for regulations to prevent runs. While this would seem a dramatic claim in other contexts, a run in the cryptocurrency market occurred between the beginning of this study and its publication.Footnote 5

In the user-level regressions (Table 3), we can see that cryptocurrency enthusiasts are overall more positive, less negative, and less neutral and have higher compound scores than traditional investors. The statistical insignificance of the treated indicator in the tweet-level regressions suggests that user-level fixed effects account for the differences between the two user types. We also find that the change in the price of the Bitcoin variable was statistically significant and negative for neutral sentiment. This suggests that increased emotionality was present among finance-oriented Twitter users when Bitcoin prices went up.

These results confirm and build on the findings of previous studies (e.g., (Baker and Wurgler 2007)) that link investor sentiment to market conditions. In particular, these results expand the literature by showing that investor sentiment responds to market conditions and that there is profound heterogeneity in responses to changes in market conditions across different types of investors.

Table 2 Results for the tweet-level regressions for change in tweet sentiment
Table 3 Results for the user-level regressions for change in tweet sentiment

Emotions

Next, we focus on the emotion-specific regression results (Tables 4 and 5), which indicate that cryptocurrency enthusiasts experienced significant changes in their expression of two specific emotions: surprise and joy. Specifically, cryptocurrency enthusiasts expressed less surprise and joy in their tweets than more traditional investors. These results, especially the decrease in the expression of joy, suggest that there is indeed some fundamental difference between how cryptocurrency entrants and traditional investors invest in, utilize, and experience cryptocurrencies such as Bitcoin. The decrease in the expression of surprise is interesting because it suggests that cryptocurrency enthusiasts may have begun to anticipate the persistence of cryptocurrency crashes. This implies that cryptocurrency enthusiasts possibly held Bitcoin despite poor expectations of its future performance. These results strongly support the model provided in Sect. "Theoretical Motivation" in which cryptocurrency enthusiasts derive extra utility simply from holding cryptocurrency assets.

Similar to the regressions for the four broad affective states, the user-level regressions suggest stark differences in how the two groups communicate. Cryptocurrency opportunists appear to express less anger, disgust, fear, surprise, trust, joy, and positivity and tend to express more sadness and negativity. Finally, changes in the price of Bitcoin lead to a decrease in disgust and fear, which, in turn, results in an increase in trust. These results confirm the existing literature on the psychology of cryptocurrency enthusiasts.

As in the previous subsection, these results confirm and build on the literature that links investor sentiment and market conditions. In particular, we find that (1) the cryptocurrency crash caused a decrease in the expression of surprise and joy among herding-type cryptocurrency enthusiasts and (2) herding-type cryptocurrency enthusiasts have a distinct emotional profile compared to traditional investors. Cryptocurrency enthusiasts are prone to express themselves in sadder and more negative ways, with less trust, joy, anger, disgust, fear, and surprise than traditional investors. This suggests that a certain type of person (i.e., a certain set of personality traits) self-selects into a herding-type cryptocurrency group.

Table 4 Results for the tweet-level regressions for change in tweet emotion content
Table 5 Results for the user-level regressions for change in tweet emotion content

Tweet frequency

The final set of regressions examines the actual tweet behavior of users by studying the frequency of their tweets. Here, we see that cryptocurrency enthusiasts increased the frequency with which they tweet by over 100 tweets during the post-cryptocurrency crash period compared to the control group, which translates to an increase of more than one tweet per day relative to that of traditional investors. As shown in Table 6, these results are highly consistent across the specifications, demonstrating their robustness to the sentiments contained in the tweets. Moreover, they suggest that behavioral changes in cryptocurrency enthusiasts may be numerous and correlated as we found changes in both sentiment/emotionality and tweet frequency attributed to the same event. This builds on the existing literature by providing the first evidence that market conditions differentially affect investors’ use of social media when discussing investment-related topics.

Table 6 Results for the regressions for the volume of tweets by users

Textual evidence of herding

In this section, we present evidence suggesting the presence of herding among cryptocurrency enthusiasts by analyzing the specific textual content of tweets. To this end, we apply a manually augmented hierarchical clustering method to the most frequent terms found in the tweets using the following process.

First, after following the same data preprocessing steps outlined in the methodology for performing sentiment analysis; that is, tokenization, lemmatization, and stemming, a bag of (unique) words was created for each group. Once the unique words were identified, the frequency with which they appeared in the tweets was computed, and words appearing in at least 1/1000 tweets were identified. In total, 19 words are associated with traditional investors, whereas 57 are associated with cryptocurrency enthusiasts.

To provide evidence of herding, these frequent terms were classified using a hierarchical clustering method from SciPy in Python (scipy.cluster.hierarchy). This algorithm clusters terms based on their co-occurrence in tweets. The results (classes) of this algorithm were then manually updated to the final classes listed in Table 7.

Table 7 Final term classes from the herding analysis

These classifications support the notion of herding for two primary reasons. First, the disjoint nature of terms between the two groups of investors suggests that cryptocurrency enthusiasts represent their own “clique” within the online investing community. Second, across the classes for the terms commonly used by cryptocurrency enthusiasts, clear themes emerge as the dominating discourse. Class 1, a class of terms related to cryptocurrencies, is not surprising and does not necessarily imply the existence of herding behavior. However, Classes 2 and 3 suggest otherwise. Regarding Class 2, the fact that the NFT bubble was not observed as a common topic of discourse among traditional investors but was important enough to constitute its own class among cryptocurrency enthusiasts is qualitative evidence suggesting that cryptocurrency enthusiasts engage in herding behavior, at least regarding NFTs. Class 3 (i.e., the (“wagmi” class) suggests that this behavior extends to cryptocurrencies as well since it is, by definition, representative of the discourse related to holding cryptocurrency despite the nature of the market at that time. This is direct evidence of herding behavior among cryptocurrency enthusiasts but not traditional investors in the cryptocurrency market in the aftermath of the cryptocurrency crash in May 2022.

Finally, other important trends became apparent during the analysis. First, cryptocurrency enthusiasts use more current Internet vocabulary than traditional investors do. Examples include the use of emojis; no emojis were among the most frequent terms used by traditional investors, while five emojis appeared among the most common terms used by cryptocurrency enthusiasts. While this certainly reflects a significant cultural difference between the two groups, it could also reflect meaningful demographic differences. These differences and the elevated risk-seeking behavior observed among cryptocurrency enthusiasts fits the social identity model of risk-taking (Cruwys et al. 2021).

The second theme that emerged is the gendered nature of online investment communities. “He,” “bro,” “guy,” “ser,” “fam,” and “they,” were all among the most commonly used words used by the two groups in this study, yet no female-gendered words (e.g., “she”) appeared among the most common words. This suggests that online investment communities are largely male-dominated.

The last trend that emerged was within the community of cryptocurrency enthusiasts, concerns the use of the term “ser.” This term is used as a synonym for “sir,” but it also has a racist second meaning; it is used to mock Indian and East Asian cryptocurrency enthusiasts for their relatively more frequent use of “sir” in the online discourse (Limbu 2022). To emphasize the toxic nature of the online cryptocurrency enthusiast community, the source cited on the previous line, an article published on the popular cryptocurrency news site Blockchain.News, classified this behavior as “trolling,” minimizing the iniquity.

Conclusion

In the aftermath of the cryptocurrency crash of 2022, investor sentiment among cryptocurrency generators has changed relative to that of traditional investors, specifically, an increase in neutral sentiment and, surprisingly, a decrease in negative sentiment. This is particularly significant as the deliberate, collectivist approach to publicly displaying positivity and holding Bitcoin (“wagmi”) could have mitigated the magnitude of the crash to a small extent. These findings are also important as they provide further support that cryptocurrency enthusiasts will hold on to a cryptocurrency even when they could earn better returns by investing elsewhere. These results validate the model described in Sect. "Theoretical Motivation" of this paper. In summary, cryptocurrency enthusiasts and traditional investors exhibit visibly distinct behavioral patterns.

In addition to changes in investor sentiment, two other changes were observed in the behavior of cryptocurrency enthusiasts. First, there were changes in the specific emotional content of their tweets, specifically a decrease in surprise and joy. Second, the frequency at which these cryptocurrency enthusiasts tweet increased in the aftermath of the cryptocurrency crash of 2022, suggesting that public displays of loyalty to Bitcoin and/or the Bitcoin community are an important cultural practice that manifests itself in herding behavior. This reinforces the notion that herding and other collectivist behaviors are central to cryptocurrency community membership.

From a policy perspective, cryptocurrency markets must be regulated. The prevalence of herding behavior among cryptocurrency enthusiasts is not only present but also a core cultural component in this community. This could lead to the formation of bubbles and subsequent runs. As stated in the body of this paper, runs are not an abstract and unlikely concern but an observed consequence of this behavior. Given the gradually increasing role of cryptocurrencies in traditional portfolios, a failure to regulate the cryptocurrency market could lead to spillovers to other markets and negatively impact all investors. Only months after the cryptocurrency crash studied in this paper, the FTX Exchange collapsed, a bank run occurred, multiple leading global figures in the cryptocurrency market (as of May 2022) are now in prison, illicit activities have been financed, and taxpayers have lost billions of dollars bailing out cryptocurrency investors.

Another implication of this study is that we can identify potential herding-type cryptocurrency investors via social media. As researchers continue to study herding and other disconcerting phenomena in markets, this can be useful for various reasons, including targeting individuals for surveys or online experiments on social media. Additionally, the ability to identify herding investors on social media could allow targeted nudges designed to prevent herding in markets and increase market efficiency.

This study has two limitations. First, the herding results are largely, although not exclusively, qualitative. Causal analysis of herding behavior would be an excellent extension of this study. Second, the main results from the DID models may actually understate the true effects of the cryptocurrency crash on cryptocurrency enthusiasts as they deliberately emphasized positivity (“wagmi”), which could have impacted the sentiment scores assigned to tweets and users in this data set. An econometric consequence is a potential downward bias in the point estimates for negativity and a potential upward bias in the point estimates for positivity. If these biases are present, this further confirms the conclusions drawn in this study, and further analyses of this (and other related) phenomenon would be valuable extensions of this research. One possible way to expand the scope of this analysis is to collect data from a broader set of source materials.

So far, the importance of the results has been framed in the context of fundamental differences between traditional investors and cryptocurrency enthusiasts regarding their perceptions of, experiences with, and motivations for investing in cryptocurrencies and the consequential regulatory implications. While this consequence is incredibly important, there is another potential consequence of these results. While the literature is not conclusive regarding the sentiment of social media posts and mental health, if such a relationship exists, these results suggest that the mental health of cryptocurrency enthusiasts can be linked to the state of cryptocurrency markets. This is concerning for both financial and humanitarian reasons.

Availability of data and materials

All data and code is available at https://github.com/cat-astrophic/cryptocrash.

Notes

  1. Sam Bankman-Fried and Do Kwon are the two most notable, but many other fraud schemes have been caught across the world.

  2. I’d like to thank a particularly dedicated reviewer for suggesting these four papers.

  3. I’d like to thank another particularly dedicated reviewer for suggesting these four papers and bringing my attention to all that they imply.

  4. This paper was also suggested by one of the afore-thanked reviewers.

  5. This is a reference to the FTX collapse and ensuing run.

References

  • Abraham J, Higdon D, Nelson J, Ibarra J (2018) Cryptocurrency price prediction using tweet volumes and sentiment analysis. SMU Data Sci Rev 1(3):1

    Google Scholar 

  • Ahn Y, Kim D (2021) Emotional trading in the cryptocurrency market. Financ Res Lett 42:101912

    Article  Google Scholar 

  • Al Shehhi A, Oudah M, Aung Z (2014) Investigating factors behind choosing a cryptocurrency. In 2014 IEEE international conference on industrial engineering and engineering management, pp 1443–1447. IEEE

  • Anastasiou D, Ballis A, Drakos K (2021) Cryptocurrencies’ price crash risk and crisis sentiment. Financ Res Lett 42:101928

    Article  Google Scholar 

  • Ante L (2023) How elon musk’s twitter activity moves cryptocurrency markets. Technol Forecast Soc Chang 186:122112

    Article  Google Scholar 

  • Avery C, Chevalier J (1999) Identifying investor sentiment from price paths: the case of football betting. J Bus 72(4):493–521

    Article  Google Scholar 

  • Baker M, Wurgler J (2006) Investor sentiment and the cross-section of stock returns. J Financ 61(4):1645–1680

    Article  Google Scholar 

  • Baker M, Wurgler J (2007) Investor sentiment in the stock market. J Econ Perspect 21(2):129–152

    Article  Google Scholar 

  • Baker M, Wurgler J, Yuan Y (2012) Global, local, and contagious investor sentiment. J Financ Econ 104(2):272–287

    Article  Google Scholar 

  • Blasco N, Corredor P, Satrústegui N (2022) The witching week of herding on bitcoin exchanges. Financ Innov 8(1):1–18

    Article  Google Scholar 

  • Bouri E, Gupta R, Roubaud D (2019) Herding behaviour in cryptocurrencies. Financ Res Lett 29:216–221

    Article  Google Scholar 

  • Bouri E, Jalkh N, Molnár P, Roubaud D (2017) Bitcoin for energy commodities before and after the december 2013 crash: diversifier, hedge or safe haven? Appl Econ 49(50):5063–5073

    Google Scholar 

  • Cary M (2021) Down with the #dogefather: Evidence of a cryptocurrency responding in real time to a crypto-tastemaker. J Theor Appl Electron Commer Res 16(6):2230–2240

    Article  Google Scholar 

  • Chaim P, Laurini MP (2019) Is bitcoin a bubble? Phys A 517:222–232

    Article  Google Scholar 

  • Cross JL, Hou C, Trinh K (2021) Returns, volatility and the cryptocurrency bubble of 2017–18. Econ Model 104:105643

    Article  Google Scholar 

  • Cruwys T, Greenaway KH, Ferris LJ, Rathbone JA, Saeri AK, Williams E, Parker SL, Chang MX, Croft N, Bingley W et al (2021) When trust goes wrong: a social identity model of risk taking. J Pers Soc Psychol 120(1):57

    Article  Google Scholar 

  • Da Z, Engelberg J, Gao P (2015) The sum of all fears investor sentiment and asset prices. Rev Financ Stud 28(1):1–32

    Article  Google Scholar 

  • da Gama Silva PVJ, Klotzle MC, Pinto ACF, Gomes LL (2019) Herding behavior and contagion in the cryptocurrency market. J Behav Exp Financ 22:41–50

    Article  Google Scholar 

  • De Blasis R, Galati L, Webb A, Webb RI (2023) Intelligent design: stablecoins (in) stability and collateral during market turbulence. Financ Innov 9(1):85

    Article  Google Scholar 

  • Dodd N (2018) The social life of bitcoin. Theory, Culture Soc 35(3):35–56

    Article  Google Scholar 

  • Feldman R (2013) Techniques and applications for sentiment analysis. Commun ACM 56(4):82–89

    Article  Google Scholar 

  • Foley S, Karlsen JR, Putniņš TJ (2019) Sex, drugs, and bitcoin: how much illegal activity is financed through cryptocurrencies? Rev Financ Stud 32(5):1798–1853

    Article  Google Scholar 

  • Frost J, Shin HS, Wierts P (2020) An early stablecoin? the bank of amsterdam and the governance of money

  • Gao Y, Zhao C, Sun B, Zhao W (2022) 0effects of investor sentiment on stock volatility: new evidences from multi-source data in china’s green stock markets. Financial Innovation 8(1):1–30

    Article  Google Scholar 

  • Giudici G, Milne A, Vinogradov D (2020) Cryptocurrencies: market analysis and perspectives. J Ind Bus Econ 47(1):1–18

    Article  Google Scholar 

  • Haykir O, Yagli I (2022) Speculative bubbles and herding in cryptocurrencies. Financ Innov 8(1):1–33

    Article  Google Scholar 

  • Judge B, Eichengreen B, Zysman J (2023) The mirage of decentralized finance. Available at SSRN 4459315

  • Kallinterakis V, Wang Y (2019) Do investors herd in cryptocurrencies-and why? Res Int Bus Financ 50:240–245

    Article  Google Scholar 

  • Kou G, Dinçer H, Yüksel S, Alotaibi FS (2024) Imputed expert decision recommendation system for qfd-based omnichannel strategy selection for financial services. Int J Inf Technol Decis Mak 23(01):141–170

    Article  Google Scholar 

  • Kou G, Olgu Akdeniz Ö, Dinçer H, Yüksel S (2021) Fintech investments in european banks: a hybrid it2 fuzzy multidimensional decision-making approach. Financ Innov 7(1):39

    Article  Google Scholar 

  • Kou G, Pamucar D, Dinçer H, Yüksel S (2023) From risks to rewards: a comprehensive guide to sustainable investment decisions in renewable energy using a hybrid facial expression-based fuzzy decision-making approach. Appl Soft Comput 142:110365

    Article  Google Scholar 

  • Kou G, Xu Y, Peng Y, Shen F, Chen Y, Chang K, Kou S (2021) Bankruptcy prediction for smes using transactional data and two-stage multiobjective feature selection. Decis Support Syst 140:113429

    Article  Google Scholar 

  • Lamon C, Nielsen E, Redondo E (2017) Cryptocurrency price prediction using news and social media sentiment. SMU Data Sci Rev 1(3):1–22

    Google Scholar 

  • Lee S, Lee J, Lee Y (2023) Dissecting the terra-luna crash: evidence from the spillover effect and information flow. Financ Res Lett 53:103590

    Article  Google Scholar 

  • Li T, Chen H, Liu W, Yu G, Yu Y (2023) Understanding the role of social media sentiment in identifying irrational herding behavior in the stock market. Int Rev Econ Financ 87:163–179

    Article  Google Scholar 

  • Li T, Kou G, Peng Y, Philip SY (2021) An integrated cluster detection, optimization, and interpretation approach for financial data. IEEE Trans Cybern 52(12):13848–13861

    Article  Google Scholar 

  • Limbu A (2022) Slangs used in crypto space

  • Liu B et al (2010) Sentiment analysis and subjectivity. Handbook of natural language processing. Chapman and Hall/CRC, Boca Raton, pp 627–666

    Google Scholar 

  • Liu J, Makarov I, Schoar A (2023) Anatomy of a run: The terra luna crash. Technical report, National Bureau of Economic Research

  • Mai F, Shan Z, Bai Q, Wang X, Chiang RH (2018) How does social media impact bitcoin value? A test of the silent majority hypothesis. J Manag Inf Syst 35(1):19–52

    Article  Google Scholar 

  • Mas-Colell A, Whinston MD, Green JR et al (1995) Microeconomic theory, vol 1. Oxford University Press, New York

    Google Scholar 

  • Mattke J, Maier C, Reis L, Weitzel T (2021) Bitcoin investment: a mixed methods study of investment motivations. Eur J Inf Syst 30(3):261–285

    Article  Google Scholar 

  • Mislove A, Lehmann S, Ahn Y-Y, Onnela J-P, Rosenquist J (2011) Understanding the demographics of twitter users. In: Proceedings of the international AAAI conference on web and social media. vol 5, pp 554–557

  • Obreja DM (2022) The social side of cryptocurrency: exploring the investors’ ideological realities from romanian facebook groups. New Med Soc 26(5):14614448221092028

    Google Scholar 

  • Ooi SK, Ooi CA, Yeap JA, Goh TH (2021) ‘Embracing bitcoin: users’ perceived security and trust. Qual Quant 55(4):1219–1237

    Article  Google Scholar 

  • Philippas D, Rjiba H, Guesmi K, Goutte S (2019) Media attention and bitcoin prices. Financ Res Lett 30:37–43

    Article  Google Scholar 

  • Phillips RC, Gorse D (2017) Predicting cryptocurrency price bubbles using social media data and epidemic modelling. In: 2017 IEEE symposium series on computational intelligence (SSCI), pp 1–7. IEEE

  • Raimundo G Jr, Palazzi RB, Tavares RdS, Klotzle MC (2022) Market stress and herding: a new approach to the cryptocurrency market. J Behav Financ 23(1):43–57

    Article  Google Scholar 

  • Rognone L, Hyde S, Zhang SS (2020) News sentiment in the cryptocurrency market: an empirical comparison with forex. Int Rev Financ Anal 69:101462

    Article  Google Scholar 

  • Seok SI, Cho H, Ryu D (2019) Firm-specific investor sentiment and daily stock returns. N Am J Econ Financ 50:100857

    Article  Google Scholar 

  • Shu M, Song R, Zhu W (2021) The 2021 bitcoin bubbles and crashes-detection and classification. Stats 4(4):950–970

    Article  Google Scholar 

  • Starkey AJ, Keane CR, Terry MA, Marx JH, Ricci EM (2013) Financial distress and depressive symptoms among African American women: identifying financial priorities and needs and why it matters for mental health. J Urban Health 90(1):83–100

    Article  Google Scholar 

  • Vicente P (2023) Sampling twitter users for social science research: evidence from a systematic review of the literature. Qual Quant 57:1–41

    Article  Google Scholar 

  • Vidal-Tomás D, Ibáñez AM, Farinós JE (2019) Herding in the cryptocurrency market: CSSD and CSAD approaches. Financ Res Lett 30:181–186

    Article  Google Scholar 

  • Xie H, Lin W, Lin S, Wang J, Yu L-C (2021) A multi-dimensional relation model for dimensional sentiment analysis. Inf Sci 579:832–844

    Article  Google Scholar 

  • Xu H-C, Zhou W-X (2018) A weekly sentiment index and the cross-section of stock returns. Financ Res Lett 27:135–139

    Article  Google Scholar 

  • Yang SY, Mo SYK, Liu A (2015) Twitter financial community sentiment and its predictive relationship to stock market movement. Quant Financ 15(10):1637–1656

    Article  Google Scholar 

  • Yousaf I, Ali S (2020) The covid-19 outbreak and high frequency information transmission between major cryptocurrencies: evidence from the var-dcc-garch approach. Borsa Istanbul Rev 20:S1–S10

    Article  Google Scholar 

  • Yousaf I, Ali S (2020) Discovering interlinkages between major cryptocurrencies using high-frequency data: new evidence from covid-19 pandemic. Financ Innov 6(1):1–18

    Article  Google Scholar 

  • Yousaf I, Ali S (2021) Linkages between stock and cryptocurrency markets during the covid-19 outbreak: an intraday analysis. Singap Econ Rev. https://doi.org/10.1142/S0217590821470019

    Article  Google Scholar 

  • Yousaf I, Ali S, Shah SZA (2018) Herding behavior in Ramadan and financial crises: the case of the Pakistani stock market. Financ Innov 4(1):1–14

    Article  Google Scholar 

  • Yue W, Zhang S, Zhang Q (2021) Asymmetric news effects on cryptocurrency liquidity: an event study perspective. Financ Res Lett 41:101799

    Article  Google Scholar 

  • Zhang S, Zhou X, Pan H, Jia J (2019) Cryptocurrency, confirmatory bias and news readability-evidence from the largest Chinese cryptocurrency exchange. Account Financ 58(5):1445–1468

    Article  Google Scholar 

  • Zhou G (2018) Measuring investor sentiment. Annu Rev Financ Econ 10:239–259

    Article  Google Scholar 

Download references

Acknowledgements

I would like to thank the reviewers for the information they shared throughout the review process.

Funding

No external funding was used in this research.

Author information

Authors and Affiliations

Authors

Contributions

All work is the responsibility of Michael Cary.

Corresponding author

Correspondence to Michael Cary.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

I consent to publication in Financial Innovation.

Competing interests

No competing interests are declared

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Appendix

Derivation of Eq. 6

$$\begin{aligned} \begin{aligned} \frac{\dot{W}}{W}&= \frac{W_{t} - W_{t-1}}{W_{t}}\\&= \frac{(1+r_{I})W_{t-1} - C_{t} - W_{t-1}}{W_{t}}\\&= \frac{r_{I}W_{t-1}}{W_{t}} - \frac{C_{t}}{W_{t}}\\&= \frac{r_{I}W_{t-1}}{(1+r_{I})W_{t-1} - \rho [(1+r_{I})W_{t-1} - \rho W_{t}]} - \rho \\&= \frac{r_{I}W_{t-1}}{(1-\rho +\rho ^{2}-\dots )[(1+r_{I})W_{t-1} - (1+r_{I})W_{t-1}]} - \rho \\&= \left( 1+\rho \right) \left( \frac{r_{I}}{1+r_{I}}\right) - \rho \end{aligned} \end{aligned}$$

Derivation of Eq. 7

$$\begin{aligned} \begin{aligned} \frac{\dot{W}}{W}&= \frac{W_{t} - W_{t-1}}{W_{t}}\\&= \frac{(1+r_{I})I_{t-1} + (1+r_{B})B_{t-1} - C_{t} - W_{t-1}}{W_{t}}\\&= \frac{(1+r_{I})(W_{t-1} - B_{t-1}) + (1+r_{B})B_{t-1} - W_{t-1}}{W_{t}} - \frac{C_{t}}{W_{t}}\\&= \frac{r_{I}W_{t-1} + (r_{B}-r_{I})B_{t-1}}{W_{t}} - \rho \\&= \frac{r_{I}W_{t-1} + (r_{B}-r_{I})B_{t-1}}{(1+r_{I})(W_{t-1} - B_{t-1}) + (1+r_{B})B_{t-1} - \rho W_{t}} - \rho \\&= \frac{r_{I}W_{t-1} + (r_{B}-r_{I})B_{t-1}}{(1+r_{I})W_{t-1} + (r_{B}-r_{I})B_{t-1} - \rho W_{t}} - \rho \\&= \frac{r_{I}W_{t-1} + (r_{B}-r_{I})B_{t-1}}{(1+r_{I})W_{t-1} + (r_{B}-r_{I})B_{t-1} - \rho W_{t}} - \rho \\&= \frac{r_{I}W_{t-1} + (r_{B}-r_{I})B_{t-1}}{(1-\rho +\rho ^{2}-\dots )[(1+r_{I})W_{t-1} + (r_{B}-r_{I})B_{t-1}]} - \rho \\&= \left( 1+\rho \right) \left( \frac{r_{I}W_{t-1} + (r_{B}-r_{I})B_{t-1}}{(1+r_{I})W_{t-1} + (r_{B}-r_{I})B_{t-1}}\right) - \rho \\&= \left( 1+\rho \right) \left( \frac{r_{I}W + (r_{B}-r_{I})B}{(1+r_{I})W + (r_{B}-r_{I})B}\right) - \rho \\ \end{aligned} \end{aligned}$$

Derivation of Eq. 8

First, we derive the difference between \(\frac{\dot{W}_{I}}{W_{I}}\) and \(\frac{\dot{W}_{E}}{W_{E}}\) as

$$\begin{aligned} \begin{aligned} \frac{\dot{W}_{I}}{W_{I}} - \frac{\dot{W}_{E}}{W_{E}}&= \left( 1+\rho \right) \left( \frac{r_{I}}{1+r_{I}}\right) - \rho - \left( 1+\rho \right) \left( \frac{r_{I}W + (r_{B}-r_{I})B}{(1+r_{I})W + (r_{B}-r_{I})B}\right) + \rho \\&= (1+\rho )\left( \frac{r_{I} + r_{I}^{2} + \left( \frac{\alpha _{B}}{\alpha _{W}}\right) r_{B}r_{I} - \left( \frac{\alpha _{B}}{\alpha _{W}}\right) r_{I}^{2} - r_{I} - \left( \frac{\alpha _{B}}{\alpha _{W}}\right) r_{B} + \left( \frac{\alpha _{B}}{\alpha _{W}}\right) r_{I} - r_{I}^{2} - \left( \frac{\alpha _{B}}{\alpha _{W}}\right) r_{B}r_{I} + \left( \frac{\alpha _{B}}{\alpha _{W}}\right) r_{I}^{2}}{(1+r_{I})\left( 1+r_{I} +\left( \frac{\alpha _{B}}{\alpha _{W}}\right) (r_{B}-r_{I})\right) }\right) \\&= \frac{(1+\rho )\left( \frac{\alpha _{B}}{\alpha _{W}}\right) (r_{I}-r_{B})}{(1+r_{I}) \left( 1+r_{I}+\left( \frac{\alpha _{B}}{\alpha _{W}}\right) (r_{B}-r_{I})\right) } \end{aligned} \end{aligned}$$

All that remains to be shown is that the difference is positive. First, we consider the numerator. As \(\rho > 0\), the term \(1+\rho\) is positive. Similarly, because \(\alpha _{B}>0\) and \(\alpha _{W}>0\), ratio \(\frac{\alpha _{B}}{\alpha _{W}}\) is positive. Finally, assuming that \(r_{I} > r_{B}\), the difference \(r_{I}-r_{B}\) is positive. This means that the numerator is the product of three positive terms and is therefore positive.

Moving on to the denominator and following an argument similar to that showing that the numerator is positive, we must show that \(1+r_{I} > \left( \frac{\alpha _{B}}{\alpha _{W}}\right) (r_{I}-r_{B})\). Since \(0<(r_{I}-r_{B})<r_{I}\), and since \(0\le \left( \frac{\alpha _{B}}{\alpha _{W}}\right) \le 1\) it follows that \(1+r_{I}>r_{I}\ge \left( \frac{\alpha _{B}}{\alpha _{W}}\right) (r_{I}-r_{B})\) and thus the claim that \(\frac{\dot{W}_{I}}{W_{I}} - \frac{\dot{W}_{E}}{W_{E}} > 0\) is verified.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cary, M. Herding and investor sentiment after the cryptocurrency crash: evidence from Twitter and natural language processing. Financ Innov 10, 142 (2024). https://doi.org/10.1186/s40854-024-00663-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s40854-024-00663-x

Keywords

JEL classification