- Research
- Open Access
- Published:

# DeepPricing: pricing convertible bonds based on financial time-series generative adversarial networks

*Financial Innovation*
**volumeÂ 8**, ArticleÂ number:Â 64 (2022)

## Abstract

Convertible bonds are an important segment of the corporate bond market, however, as hybrid instruments, convertible bonds are difficult to value because they depend on variables related to the underlying stock, the fixed-income part, and the interaction between these components. Besides, embedded options, such as conversion, call, and put provisions are often restricted to certain periods, may vary over time, and are subject to additional path-dependent features of the state variables. Moreover, the most challenging problem in convertible bond valuation is the underlying stock return process modeling as it retains various complex statistical properties. In this paper, we propose *DeepPricing*, a novel data-driven convertible bonds pricing model, which is inspired by the recent success of generative adversarial networks (GAN), to address the above challenges. The method introduces a new financial time-series generative adversarial networks (*FinGAN*), which is able to reproduce risk-neutral stock return process that retains the unique statistical properties such as the fat-tailed distributions, the long-range dependence, and the asymmetry structure etc., and then transit to its risk-neutral distribution. Thus it is more flexible and accurate to capture the dynamics of the underlying stock return process and keep the rich set of real-world convertible bond specifications compared with previous model-driven models. The experiments on the Chinese convertible bond market demonstrate the effectiveness of *DeepPricing* model. Compared with the convertible bond market prices, our model has a better convertible bonds pricing performance than both model-driven models, i.e. Black-Scholes, the constant elasticity of variance, GARCH, and the state-of-the-art GAN-based models, i.e. FinGAN-MLP, FinGAN-LSTM. Moreover, our model has a better fitting capacity for higher-volatility convertible bonds and the overall convertible bond market implied volatility smirk, especially for equity-liked convertible bonds, convertible bonds trading in the bull market, and out-of-the-money convertible bonds. Furthermore, the *Long-Short* and *Long-Only* investment strategies based on our model earn a significant annualized return with 41.16% and 31.06%, respectively, for the equally-weighted portfolio during the sample period.

## Introductions

As an important part of the corporate bond market, convertible bonds with both equity- and debt-like properties have grown in popularity in the financial market. However, compared with the nearly 150-years industry practice of convertible bonds, the development of convertible bond pricing theory lags behind. This is not surprising as convertible bonds cannot simply be considered as a combination of equity and bonds but various terms of realistic convertible bonds, such as the possibility of early conversion, the callability by the issuer, and the putability by the holder, making it difficult to value.

Traditional theoretical research on convertible bond pricing can be roughly divided into three categories. The first pricing approach implies finding a closed-form solution to the valuation equation. It was initiated by Ingersoll (1977) who applies the contingent claims approach to the valuation of convertible bonds. Lewis (1991) develops a closed-form solution for convertible bonds that accounts for more complex capital structures. This kind of valuation model was established with the option pricing models of Black and Scholes (1973) and Merton (1973, 1974) and value the convertible bonds based on their firm value or stock value (McConnell and Schwartz 1986) as the underlying state variable. Later, Nyborg (1996) obtained a closed-form solution for most basic convertible bonds, where conversion is only allowed at maturity, while Zhu (2006) presented an analytical solution for the convertible bonds, which can be converted at any time on or before maturity. Although fast in computation, closed-form solutions are not suitable for empirical studies because they fail to account for several real-world specifications. Especially, dividends and coupon payments are often modeled continuously rather than discretely, early-exercise features are omitted, and path-dependent features are excluded. The second pricing approach values convertible bonds numerically, using lattice-based methods. In general, among lattice-based methods, there are finite difference methods (e.g. Brennan and Schwartz 1977; Tsiveriotis and Fernandes 1998; Takahashi etÂ al. 2001; Ayache etÂ al. 2003; Zhu etÂ al. 2018; Lin and Zhu 2020), finite element method (e.g. Barone-Adesi etÂ al. 2003), tree model (e.g. Hung and Wang 2002; Chambers and Lu 2007; Yagi and Sawaki 2010; Ma etÂ al. 2020); some of the lattice-based models provide sophisticated pricing and calibration solutions. Besides, in the face of practical problems related to real convertible-bond specifications and limited data availability, the proposed approaches turn out to be practicable only in very few cases. The third class of convertible bond pricing method uses least-squares Monte Carlo (LSM) simulation (e.g. Buchan 1997; Ammann etÂ al. 2008; Fan etÂ al. 2017; Batten etÂ al. 2018). LSM proposed by Longstaff and Schwartz (2001) is suitable for modeling discrete coupon and dividend payments, including dynamics of the underlying state variables, taking into account path-dependent option features, and overcoming many of the drawbacks of lattice-based methods. The most important input in LSM is the assumption and generation of underlying stock return process, such as the Black-Scholes (BS) model, the constant elasticity of variance (CEV) model, or the GARCH model. However, building on the limitation of explicit mathematical formulations, the models are difficult to recover all unique statistial properties such as the leverage effects, the coarse-fine volatility correlation, and the gain/loss asymmetry of financial time-series (Malmsten and TerÃ¤svirta 2010; Takahashi etÂ al. 2019; Dogariu etÂ al. 2022).

Considering the difficulties of the existing pricing approaches, it is desirable to develop an alternative data-driven path simulator that has a high reproducibility of stylized facts and can be built without many assumptions. Deep learning, especially generative adversarial networks (GAN), proposed by Goodfellow etÂ al. (2014), which bears the potential of being able to model complicated statistical (perhaps unknown) dynamic may provide such a solution. GAN-based methods have already shown spectacular ability in the generation of data including realistic image (Radford etÂ al. 2015; Karras etÂ al. 2021), audio (Donahue etÂ al. 2018), natural language text (Zhang etÂ al. 2016; Garbacea etÂ al. 2019) and have expanded to sequence generation, such as physics (Farimani etÂ al. 2017; Li etÂ al. (2019), music (Yang etÂ al. 2017; Gan etÂ al. 2020), medical time-series (Esteban etÂ al. 2017; Sun etÂ al. 2020), DNA sequences (Killoran etÂ al. 2017; Gupta and Zou 2019) and financial time-series (Takahashi etÂ al. 2019; Wiese etÂ al. 2020). Moreover, the GAN architectures also show the benefits in the financial time-series representation learning, such as stock price prediction (Zhang etÂ al. 2019; Dogariu etÂ al. 2022), disentangle market behaviors from the price movement of stocks (Hadad etÂ al. 2018), and systematic trading strategies (Koshiyama etÂ al. 2021). As it remains the problem of transiting the real (observable) financial time series to its risk-neutral distribution, the GAN-based approaches have not been successfully applied to classical LSM to generate stock return process and solve the problem of derivatives pricing (Wiese etÂ al. 2020).

In this paper, we propose *DeepPricing*, a novel financial time-series generative adversarial networks (*FinGAN*) based convertible bonds pricing model in the framework of LSM to address the challenges above. Our primary contributions are twofold. First, we extend the literature on derivatives pricing, especially convertible bonds pricing, by showing that GAN-based approaches can be successfully applied to the framework of LSM. The model is more flexible and accurate to capture the temporal structures of financial time-series so as to generate the major stylized facts of the underlying stock returns, including the linear unpredictability, the fat-tailed distribution, the volatility clustering, the leverage effects, the coarse-fine volatility correlation, and the gain/loss asymmetry, and then transit to its risk-neutral distribution. Second, a new mechanism, *FinGAN*, is explicitly constructed to generate stochastic underlying stock return process and transit to its risk-neutral distribution. We introduce an encoder network to provide a mapping between feature and neutral space, thereby separating the features of volatility and drift in the adversarial learning space. This capitalizes on the fact that during the risk-neutral measure transformation process, the volatility features have been retained while the drift features have been changed to the risk-free rate (Shreve 2004). Moreover, the supervised loss is minimized by jointly training both the embedding and generator networks, so that the neutral space not only serves to promote parameter efficiency but is also specifically conditioned to facilitate the generator in learning volatility features.

Empirically, we evaluate the capacity of the *DeepPricing* in the Chinese convertible bonds market in two steps. First, the statistical properties of the generated time-series from *FinGAN* and other baseline generators are analyzed and compared with the real financial time-series, and then the convertible bond pricing performances test is conducted across the *DeepPricing* and other baseline convertible bond valuation models. Several important conclusions can be drawn from the empirical results. First, the *FinGAN* model achieves consistent improvements over both traditional and state-of-the-art benchmarks in generating major statistical properties of stock returns, including the linear unpredictability, the fat-tailed distributions, the volatility clustering, the leverage effects, the coarse-fine volatility correlation, and the gain/loss asymmetry. Second, compared with real convertible bonds market prices, the *DeepPricing* model has a better convertible bonds pricing performance than both model-driven models, i.e. BS, CEV \((\beta =1)\), GARCH (1,1) and data-driven models, i.e. FinGAN-MLP, FinGAN-LSTM. Third, we have broken down convertible bonds according to their equity component levels, market states, and moneyness levels to investigate the ability of the models to fit the structure of the convertible bonds market. The empirical results show that the *DeepPricing* model has a better fitting capacity for the equity-liked, trading in the bull market and out-of-the-money convertible bonds. Generally speaking, the results indicate that due to the *FinGAN* generatorâ€™s higher reproducibility of stylized facts, the *DeepPricing* model is more flexible and consistent to capture the volatility and high-dimensional features of the underlying stock process thus outperforming other models in fitting higher volatility convertible bonds and the overall convertible bond market implied volatility smirk. The results also demonstrate the simulation of the dynamic process of the underlying stock return has a significant effect on the efficiency of convertible bond pricing, which is consistent with the empirical conclusion of Batten etÂ al. (2018) in U.S. convertible bond market.

Furthermore, we analyze the factors affecting the pricing of Chinese convertible bonds. Empirical results show that convertible bonds which are higher-liquidity, riskier (longer time to maturity, lower quality credit rating, higher volatility), equity-liked, trading in the bull market and out-of-the-money tend to be more likely to be mispriced. The results are consistent with the convertible bond pricing performance, and further illustrate the importance of accurately characterizing the volatility and high-dimensional features for the pricing of convertible bonds. Besides, considering the Chinese convertible bond market is a weak efficient market, thus, we propose investment strategies based on the *DeepPricing* model. Both *Long-Short Strategy* and *Long-Only Strategy* earn a significant annualized return with 41.16% and 31.06%, respectively, for the equally-weighted portfolio during January 01, 2018 to June 30, 2021, respectively.

The rest of the paper is organized as follows. "Preliminaries" section gives the basic terms and payoff structure of convertible bonds. "The DeepPricing model" section develops the methods used in the paper. "Data and experiment" section provides the descriptive statistics of the Chinese convertible bond market and empirical results across the *DeepPricing* model and the baseline models. "Investment strategies" section introduces the investment strategies based on the *DeepPricing* model. "Conclusion" section concludes.

## Preliminaries

Convertible bonds are corporate debt securities that give the holder the right to forego future coupon and/or principal payments and receive (i.e., convert to) a prespecified number of shares of common stock instead. In principal, a convertible bond is hybrid security consisting of a straight bond and a call on the underlying equity, but various terms of realistic convertible bonds make it impossible to decouple the stock option from the bond part. Furthermore, due to the existence of these special terms, the payoff structure of convertible bonds is really complicated. Thus, before introducing the pricing model, we first give the basic terms of convertible bonds in "Basic terms" section, and then present the payoff structure of convertible bonds in "Payoff structure" section.

### Basic terms

Generally speaking, taking the Chinese convertible bonds as an example, convertible bonds mainly have the following four types option terms:

*Conversion terms*: Investors can execute conversion rights within a certain period of time (\(t\in \Omega _\text {conv}\)). In case of conversion, the investor receives \(\text {n}_t \text {S}_t\), where \(\text {S}_t\) is the underlying stock price at time *t*, \(\text {n}_t= \text {B}_t/\text {K}_t\) is the conversion ratio (the number of stocks available per unit of bonds exchanged), \(\text {K}_t\) is the conversion price, and \(\text {B}_t\) is the face value of the convertible bond.

*Conditional redemption terms*: This term allows the issuer to demand premature redemption in exchange for the redemption price \(\text {C}_t\) applicable at time *t* under certain conditions (usually, if \(\text {S}_t\) is not less than 130% of the \(\text {n}_t\text {S}_t\) for at least 15 trading days in any 30 consecutive trading days and \(t\in \Omega _{\text {call}}\)). The issuer is obliged to announce his/her intention to redemption a certain period in advance, referred to as the call notice period. If the convertible bond is premature redemption, the investor may want to exercise his/her conversion option at any time during the call notice period to receive the conversion value instead of the redemption price.

*Repurchase terms*: This term entitles the investor to force the issuing firm to prematurely repurchase the convertible bonds for a certain predefined price \(\text {P}_t\) under certain conditions (usually, if \(\text {S}_t\) is lower than 70% of the \(\text {n}_t\text {S}_t\) for 30 consecutive trading days and \(t\in \Omega _{\text {put}}\)).

*Conversion price revision terms*: This term entitles the companyâ€™s board of directors the right to propose a downward revision plan for the conversion price from \(\text {K}_t\) to \(\text {K}_t^*\) to avoid trigger repurchase under certain conditions (usually, if \(\text {S}_t\) is lower than 85% of the \(\text {n}_t \text {S}_t\) for at least 10 trading days in any 20 consecutive trading days and \(t \in \Omega _\text {conv}\)).

### Payoff structure

The payoff of a convertible bond depends on whether and when the investor and the issuer decide to exercise their options and trigger the termination of the convertible bond. Let \(\tau ^*\) be the optimal stopping time, i.e. the time at which it is optimal for either the issuer or the investor to terminate the convertible bond.

The resulting action may either be conditional redemption, forced conversion, voluntary conversion, repurchase, regular redemption when the bond matures or default. Formally, the optimal stopping time of the convertible bond is defined as \(\tau *= \min \{t:\psi (\text {X}_t,t)\ne 0\}\), where \(\psi (\text {X}_t,t)\) is the payoff resulting from the convertible bond in state \(\text {X}_t\) at time *t*, given the optimal option-exercise behavior of both investor and issuer.

The alternatives presented in TableÂ 1 stand for all events that will cause the convertible bond to be terminated and reflect boundary conditions that impede arbitrage opportunities. The optimal exercise decision critically depends on the value of continuation \(\text {V}_t\), i.e. the conditional expected value of the convertible bond if it is not exercised immediately.

In addition to the payoff at the time of termination, the investor receives all coupon payments that occurred prior to the time of termination from his/her convertible bond investment. Formally, the function \(\Psi (\text {X}_{\tau ^*}, \tau ^*)\) represents the total payoff from a convertible bond in state \(\text {X}_{\tau ^*}\) and at time \(\tau ^*\):

where \(\psi (\text {X}_{\tau ^*}, \tau ^*)\) is the payoff from the convertible bond at the optimal stopping time \(\tau ^*\) and \(\eta (\tau ^*)\) is the present value at time \(\tau ^*\) of all coupon payments accumulated during the existence of the bond, i.e. before \(\tau ^*\).

## The DeepPricing model

In this section, we propose a generative adversarial networks (GAN) based model called *DeepPricing* to price convertible bonds. As shown in Fig.Â 1, the *DeepPricing* model mainly contains two components. The first component is a novel Financial time-series Generative Adversarial Networks (*FinGAN*). For each convertible bond *i*, we use the *FinGAN* model to generate several underlying stock return processes \(\text {r}^{(k)}_i\), and then transit them to their risk-neutral distribution \(\tilde{\text {r}}^{(k)}_i\). The second component is the least-squares Monte Carlo (LSM) approach, which was proposed by Longstaff and Schwartz (2001). For each generate and transit underlying stock path \(\tilde{\text {S}}^{(k)}_i\), we use the least-squares to estimate the conditional expected payoff \(\Psi (\text {X}^{(k)}_{\tau ^{(k)*}}, \tau ^{(k)*})\) to the optimal exercise strategy of the investor and the issuer from continuation. Then, the price of convertible bond *i* at time *t* in path *k* can be obtained by discounting all future cash flows under risk-neutral measure. Given a risk-neutral probability space \((\Omega , \mathcal {F}, \mathbb {Q})\) and information filtration \({\mathcal {F}_t}\), the value of convertible bond *i* is given by:

where \(\text {V}^{(k)}_{it}\) is the value of the convertible bond *i* at time *t* in path *k*, \(\tau ^{(k)*}\) is the optimal stopping time taking values in the finite set \(\{0,1,\cdots , T\}\), and the function \(\Psi (\text {X}^{(k)}_{\tau ^{(k)*}}, \tau ^{(k)*})\) represents the total payoff from a convertible bond *i* in path *k* defined in Eq.Â 1. The expectation \(r(\text {X}^{(k)}_s,s)\) is the interest rate of the time interval \([s,s+1]\) in state \(\text {X}_s\), and is also applicable for discounting cash flows from time \(s+1\) to *s*. Finally, we average the value of convertible bond along each path (\(\text {V}^{(k)}_{it}\)) as the final value of convertible bond *i* (\(\text {V}_{it}\)).

### Stock dynamic simulation: *FinGAN*

The most important input in convertible bonds valuation is the assumption of the underlying stock process, especially the volatility of the underlying stock returns under risk-neutral probability space (Ammann etÂ al. 2008, Batten etÂ al. 2018). However, modeling financial time-series is a challenging task as it retains various complex statistical properties such as the linear unpredictability, the fat-tailed distributions, the volatility clustering, the leverage effects, the coarse-fine volatility correlation, and the gain/loss asymmetry, it is desirable to develop an alternative approach that has a high reproducibility of stylized facts and can be built without a number of assumptions. In this section, we propose *FinGAN*, a novel deep neural networks based approach which can capture the temporal structures of financial time-series so as to generate major stylized facts of stock process mentioned above, and then transform them into risk-neutral probability space. We first give the network framework of *FinGAN* model in "Network architecture" section. Furthermore, we introduce the derivation process from *FinGAN* to the stock price process in "Theoretical basis" section to strengthen the theoretical interpretation of our model.

#### Network architecture

*FinGAN* consists of four network components: an encoder function, decoder function, Risk-Neutral Networks (RiNN) generator, and discriminator. The key insight is that the auto-encoding components (first two) are trained jointly with the risk-neutral adversarial components (latter two), such that *FinGAN* simultaneously learns to *encode* features, *generate* and *transfer* representations, and *iterate* across time. The encoder-decoder network provides the latent neutral space, the adversarial network operates within this space, and the latent dynamics of both neutral and synthetic data are synchronized through a supervised loss. The detailed architecture is shown in Fig.Â 2 and we describe each in turn^{Footnote 1}.

*Encoder and decoder functions:* The encoder and decoder functions provide mappings between the real sequence space and the latent risk-neutral sequence space, allowing the adversarial generative network to transfer the real stock return process to its risk-neutral distribution. The encoder function extracts the volatility (\(\sigma _t\)), drift (\(\mu _t\)), and innovation (\(\epsilon _t\)) terms from the real log-return series, and then we use Eq.Â 20 to reconstruct the risk-neutral series (\(\tilde{r}_{t}\)). Specifically, \(\sigma _t\) and \(\mu _t\) at time *t* are generated by \(r_{(t-T):(t-1)}\) through the embedding network \(en_h\) for the volatility and drift features, while \(\epsilon _t\) is generated by \(r_{t}\) through the embedding network \(en_{\epsilon }\) for the innovation features. As shown in the Fig. 3, we choose temporal convolutional networks (TCN) as the basic network compositions of the encoder^{Footnote 2}, which is particularly suited for modeling long-range dependencies, allows for parallelization, and guarantees stationarity. The definition of the encoder is given by Def.Â 1,

### Definition 1

*(Encoder)* Let input \(r_{in,t}\) be \(\mathbb {R}^{n}\)-valued, \(en_h: \mathbb {R}^{n \times \text {T}}\times \Theta _{en}^{(h)} \rightarrow \mathbb {R}^{2n}\) be an encoder network with receptive field size \(\text {T}\) and \(en_{\epsilon }:\mathbb {R}^{n}\times \Theta _{en}^{(\epsilon )}\rightarrow \mathbb {R}^{n}\) be a network. Furthermore, \(\alpha \in \Theta _{en}^{(h)}\) and \(\beta \in \Theta _{en}^{(\epsilon )}\) denote some parameters. The log-return neural process \(\tilde{\text {r}}\) in feature spaces is defined by:

where \(\odot\) denotes the Hadamard product and

Notably, the encoder network separate the log-return process into volatility and drift features and transit them to risk-neutral distribution. The neural process \(\sigma _{t,\alpha }:=(\sigma _{t,\alpha })_{t\in \mathbb {R}}\), \(\mu _{t,\alpha }:= (\mu _{t,\alpha })_{t\in \mathbb {R}}\), \(\text {r}_{\!f (t,\alpha )}:= (\text {r}_{\!f (t,\alpha )})_{t\in \mathbb {R}}\) and \(\epsilon _{t,\beta }:=(\epsilon _{t,\beta })_{t\in \mathbb {R}}\) and are called volatility, drift, risk-neutral and innovation neural process, respectively.

In the opposite direction, the decoder function transforms the risk-neutral series (\(\tilde{r}_{t}\)) back to the estimated original market data (\(\hat{r}_t\)). Notice that, just as the long-range dependencies of the encoder is emphasized before, the network of decoder needs to satisfy such properties as well. Thus, we also implement decoder by TCNs. The definition of the decoder is given by Def.Â 2,

### Definition 2

*(Decoder)* Let \(l \in \mathbb {N}\), and \(de: \mathbb {R}^{n \times l} \times \Theta _{de} \rightarrow \mathbb {R}^{n \times l}\) be a network with parameter space \(\Theta _{de}\). The time-series \(\hat{r}\) out feature spaces is defined by

where \(\theta \in \Theta _{de}\), and the network *de* is the decoder function. The decoder function takes risk-neutral process \(\tilde{r}\) back to estimated realistic time-series \(\hat{r}\).

*Generator and discriminator:* In the adversarial modeling framework two agents, the generator and the discriminator, are contesting with each other in a game-theoretic zero-sum game. Roughly speaking, the generator aims at generating samples such that the discriminator cannot distinguish whether the realizations were sampled from the target or the generator distribution. Different from other GAN-based models, the RiNN generator first outputs \(\hat{\sigma }_{t,\gamma }\odot \hat{\epsilon }_{t,\eta }\) into the latent risk-neutral space instead of producing synthetic sequence directly. To ensure that the produced sequences retain causal ordering (i.e. output at each step can only depend on preceding information), we build the generator with the same structure as the encoder. Here we give the definition of the RiNN generator,

### Definition 3

*(RiNN Generator)* Let \(Z= (Z_t)_{t\in \mathbb {Z}}\) be \(\mathbb {R}^{m}\)-valued *i*.*i*.*d*. Gaussian noise, \(g_h: \mathbb {R}^{m\times T} \times \Theta _g^{(h)} \rightarrow \mathbb {R}^{2n}\) and \(g_{\epsilon }:\mathbb {R}^{m} \times \Theta _g^{(\epsilon )} \rightarrow \mathbb {R}^{n}\) be networks. Furthermore, let \(\gamma \in \Theta _g^{(h)}\) and \(\eta \in \Theta _g^{(\epsilon )}\) denote some parameters. A stochastic process \(\hat{\tilde{r}}\), defined by

and

is called log-return risk-neutral process. The generator architecture defining the log-return risk-neutral process is called RiNN. Consistently, the neural process \(\hat{\sigma }_{t,\gamma }:=(\hat{\sigma }_{t,\gamma })_{t\in \mathbb {Z}}\), \(\hat{\mu }_{t,\gamma }:= (\hat{\mu }_{t,\gamma })_{t\in \mathbb {Z}}\), \(\hat{r}_{f (t,\gamma )}:= (\hat{r}_{f (t,\gamma )})_{t\in \mathbb {Z}}\) and \(\hat{\epsilon }_{t,\eta }:=(\hat{\epsilon }_{t,\eta })_{t\in \mathbb {Z}}\) are called volatility, drift, risk-neutral and innovation neural process, respectively.

Notice that \(Z_{{(t-T)}:(t-1)}\) can be sampled from a distribution of choice, and \(Z_t\) follows a stochastic process, here we assume that \(Z_t\) is a Wiener process. Finally, the discriminator function *d* determine whether the input data is from the risk-neutral space extracted by the encoder (\(\tilde{r}_{t}\)) or synthesized by the RiNN generator (\(\hat{\tilde{r}}\)). Here is our definition of the discriminator,

### Definition 4

*(Discriminator)* Let \(\widetilde{d}\): \(\mathbb {R}^{n}\times \Theta _d \rightarrow \mathbb {R}\) be a network with parameters \(\xi \in \Theta _d\) and \(\sigma :\mathbb {R}\rightarrow [0, 1]\) defined by \(x \mapsto 1/(1+e^{-x})\) be the sigmoid function. A function \(d: \mathbb {R}^{n}\times \Theta _d \rightarrow [0,1]\) defined by \(d:(r, \xi ) \mapsto \sigma \circ \widetilde{d}_\xi (r)\) is called a discriminator.

*Jointly learning to encode, generate, transfer and iterate:Â *First, purely as a reversible mapping between real log-return space and neutral spaces, the encoder and decoder functions should be able to extract neutral representations \(\tilde{r}\) from the real log-return sequence *r* and accurately reconstruct \(\hat{r}\) of the real market sequence from their neutral representations. Therefore, our first objective function is the reconstruction loss \(\mathcal {L}_{R}\) defined by,

In *FinGAN*, the discriminator receives input from the risk-neutral sequence \(\tilde{r}\) extracted by encoder and the risk-neutral sequence \(\hat{\tilde{r}}\) synthesized by generator. First, the discriminator acts as a binary classifier, assigning a probability to each sample as a realization from the real risk-neutral distribution. This is as one would expect-that is, to maximize the likelihood that the discriminator will label \(\tilde{r}\) as training data samples and \(\hat{\tilde{r}}\) as generation samples,

However, relying solely on the discriminatorâ€™s binary adversarial feedback may not be enough to motivate the generator to capture the distribution of risk-neutral sequences. To achieve this more efficiently, we propose an additional loss function \(\mathcal {L}_S\) based on the volatility and innovation terms. The stochastic gradient can now be calculated on the loss of capturing the difference between \(\sigma _t \odot \epsilon _t\) and \(\tilde{\sigma _t} \odot \tilde{\epsilon _t}\) in the distributions, allowing the generator to improve its synthesis capabilities. Thus our third objective function is the supervised loss,

where \(D_{\text {KL}}\) is the Kullback-Leibler divergence.

In sum, at any step in a training sequence, we assess the difference between the actual next-step risk-neutral latent vector (from the encoder function) and synthetic next-step risk-neutral latent vector (from the RiNN generator). While \(\mathcal {L}_{D}\) pushes the RiNN generator to create risk-neutral sequences (evaluated by an imperfect adversary), \(\mathcal {L}_{S}\) further ensures that it produces similar stepwise transitions (evaluated by ground-truth targets).

*Optimization:* The overall training objective is a min-max game played among the encoder, decoder, generator and discriminator. The first two components are trained on both the reconstruction and supervised losses,

where \(\lambda \ge 0\) is a hyperparameter that balances the two losses.

Next, as *FinGAN* receives an error signal from both \(\mathcal {L}_D\) and \(\mathcal {L}_S\), we use another parameter \(\gamma\) to weight the ability to reconstruct vs. fooling the discriminator. That is, in addition to seeking a balancing point in the binary game of generator and discriminator, the generator will follow the encoderâ€™s style. Rather than applying \(\gamma\) to the entire model, we perform the weighting only when updating the parameter of the generator,

where \(\gamma \ge 0\). Therefore, the generator and discriminator networks are trained adversarially as follows,

Notably, \(\mathcal {L}_D\) is the determinant of how effectively the *FinGAN* is trained. If we consider \(\mathcal {L}_D\) as a convex function of \(\theta _g\), then \(\sup _D \mathcal {L}_D(\theta _g)\) has a unique global optima. Consequently with sufficiently small updates of \(\theta _g\), and \(\theta _g\) converges to optima. This is equivalent to computing a gradient descent update for \(\theta _g\) at the optimal discriminator given the corresponding generator. PseudocodeÂ 1 show the pseudocode of *FinGAN*. We use Adam as the optimizer with learning rate of 1e-5 with \(\lambda = 0.1\) and \(\gamma = 0.1\). More architecture details can be found in Appendix.

#### Theoretical basis

Considering a one-dimensional log-return neural process, where the innovation neural process is constrained to represent a standard normal distributed random variable.

where, \(\epsilon _{t,\theta } \sim \mathcal {N}(0,1)\) for all \(t\in \mathbb {Z}\).

Then, the underlying stock prices are defined recursively by Eq.Â 15,

where \(\text {S}_{0,\theta }=\text {S}_{0}\) denotes the current price of the underlying stock.

Moreover, we assume a constant interest rate \(\text {r}_{\!f}\) and denote the discounted stock price \((\text {S}^{(d)}_{t,\theta }) _{t\in \mathbb {N}}\) in Eq.Â 16,

The discounted asset price is given in Eq.Â 17,

As we cannot value options under a log-return neural process, but need to convert it to its risk-neutral distribution. Given a risk-neutral probability space \((\Omega , \mathcal {F}, \mathbb {Q})\) and information filtration \(\{\mathcal {F}_t\}\), the discounted stock price process is a martingale. Therefore, we have,

We denote the conditional expectation given in Eq.Â 18 by \(h(\sigma _{t,\theta },\mu _{t,\theta }):=\text {E}^\mathbb {Q}[\text {exp}(\sigma _{t,\theta }\epsilon _{t,\theta }+\mu _{t,\theta })|\mathcal {F}_{t-1}]\). As the \(\mathcal {F}_{t-1}\)-measurable volatility and drift neural process, \(\epsilon _{t,\theta } \sim \mathcal {N}(0,1)\) and the independent of \(\mathcal {F}_{t-1}\), \(h(\sigma _{t,\theta },\mu _{t,\theta })\) can be calculated explicitly:

Now, we denote the risk-neutral log return neural process by \(\widetilde{\text {r}}_{t,\theta }:=\text {r}_{t,\theta }-\text {ln}(h(\sigma _{t,\theta },\mu _{t,\theta }))+\text {r}_{\!f}\), and the risk-neutral log return neural process is given by:

and the discounted risk-neutral price process is given by:

in particular, the discounted risk-neutral spot price process is given by:

### Interest rate

The interest rate in this study is assumed as a constant \(\text {r}_{\!f}\), and all interest rate data employed is obtained from the China Central Depository and Clearing Co., Ltd.. The time-series of the risk-free interest rates are extracted from the Chinese treasury bond and cover maturities from 3 months to 10 years on a daily basis. We obtain through Hermite interpolation the complete term structure of spot rates at any time.

### Credit risk

We account for credit risk in the spirit of Tsiveriotis and Fernandes (1998) and discount the cash flows subject to credit risk with the appropriate interest rate. We calibrate the probability of default from the spread between the risk-free interest rate and the yield of company bond. If the issuerâ€™s bond yield is not available, we can use the yield for the company bond with the same rating. Coupon payments, the redemption payment \(\kappa \text {N}\), the call price in the event of a conditional redemption \(\text {C}_t\) and the put price in the event of a repurchase \(\text {P}_t\) are subject to credit risk. The stock price \(\text {S}_t\), on the other hand, is not and should therefore be discounted with the risk-free interest rate.

## Data and experiment

In this section, we empirically evaluate our *DeepPricing* model by the data in the Chinese convertible bond market. We examine the Chinese convertible bond market for three main reasons. Firstly, China has the fastest growing convertible bond market in the past decade, the number of convertible issuance has increased from 8 in 2010 to 186 in 2020 and the scale has increased from 71,730.00 CNY million in 2010 to 230,064.20 CNY million in 2020. Secondly, some special terms of the Chinese convertible bond, such as the *Conditional Redemption Term*, *Conversion Price Revision Terms*, cause huge difficulty in the valuation, thus, there is not much literature focusing on the Chinese convertible bond market. Thirdly, due to the unique trading constraints such as price-limit rules and short-sale restrictions in the Chinese stock market, any assumptions about the distribution of the underlying stock return process cannot completely conform to its real return distributions, thus the alternative data-driven path simulator is much more needed.

This section is divided into five parts. "Data description" section gives a brief introduction of the descriptive statistics of convertible bond data. "Baseline methods" section introduces several baseline models of generating dynamic underlying stock return process. "Comparisons with different stock return generators" section reviews major statistical properties of the underlying stock return process and compares the statistical properties of the generated underlying stock return process between the *FinGAN* and other baseline generation models. "Model performance and Robustness test" section provides the main empirical results and robustness tests on convertible bonds pricing performances across *DeepPricing* and baseline valuation models. "Analysis of mispricing" section further analyzes the potential influencing factors on the mispricing of convertible bonds.

### Data description

For the empirical investigations, we obtain daily returns and the basic terms for all Chinese convertible bonds listed on the Shanghai and Shenzhen stock exchanges from the Wind Database. Besides, to ensure the consistency and reliability of the data, we use Bloomberg Database for cross-validation. Within this sample, those that were non-publicly raised and lacked an active underlying common stock were excluded from the sample. Based on these criteria, our data sample covers a total of 579 convertible bonds and 125,306 observations from January 01, 2010 to June 30, 2021.

To provide additional details, the sample is divided into several categories according to either the equity component levels or market states. Following Burlacu (2000), the equity component level is classified by \(\Delta\) (defined in Eq.Â 23), the sensitivity of the convertible bond value to its underlying common stock. And the debt-liked convertible bonds, the balanced convertible bonds, and the equity-liked convertible bonds is determined by its belongingness to the intervals [0, 0.33]; [0.33, 0.66]; and [0.66, 1], respectively.

where \(\text {S}_t\) is the current price of the underlying stock, \(\text {K}_t\) is the conversion price, \(\text {r}_{\!f}\) is the risk-free rate estimated from Chinese treasury bonds on the issue date, \(\sigma\) is the standard deviation of the continuously compounded underlying stock returns, \(\tau\) is the number of years to maturity, \(\delta\) is the continuously compounded dividend yield, and *N*(.) is the cumulative probability under a standard normal distribution function. The market states is classified by the bull market (including subperiod from January 01, 2014 to June 12, 2015 and subperiod from January 01, 2019 to June 30, 2021), the bear market (including subperiod from January 01, 2010 to December 31, 2013; subperiod from June 15, 2015 to January 29, 2016; subperiod from January 01, 2018 to December 31, 2018) and the direction-less market (including subperiod from February 01, 2016 to December 31, 2017).

TableÂ 2 reports some key summary statistics for the sample. Panel A summarizes the statistics of the whole sample, Panels B.1, B.2, B.3 provides the information on the subsamples by the equity component levels of convertible bonds, and Panels C.1, C.2, C.3 provides the information on the subsamples across different convertible bond market states.

Panel A of TableÂ 2 shows that the mean maturity of all convertible bonds at issuance is 5.93 years with 6-years being the longest maturity. Industry, material, and information technology accounted for 145 (25.04%), 129 (22.28%), and 89 (15.37%) of the bonds, respectively. Consumer discretionary, healthcare, and consumer stables accounted for 70 (12.09%), 42 (7.25%), and 37 (6.39%) bonds. Financial corporations accounted for 34 (5.87%) of the bonds, and the remaining 34 bonds were issued by firms from other industries including public utilities (20 or 3.45%), energy (7 or 1.21%), and real estate (6 or 1.04%)^{Footnote 3}. The highest credit rating in our sample of bonds was AAA (15.13% of 125,306 observations) and the lowest was A (0.24% of 125,306 observations). Notice that contrary to the US convertible market^{Footnote 4}, due to the strict issuance conditions, there is almost no credit risk in the Chinese convertible bond market. The average convertible bond can be converted at a conversion price of 14.70 CNY per share and a conversion premium^{Footnote 5} of 26.38%. The mean total issuance is approximately 259,263.94 CNY million. The daily average trading amount and the turnover rate is 103.07 CNY million and 7.66%, respectively, however, the standard deviation of daily average amount and the turnover rate is 153.90 CNY million and 15.23%, which indicate that the liquidity of individual bonds in the Chinese convertible bond market is very different. Panels B.1, B.2 and B.3 show that the equity-liked convertible bonds have relatively lower conversion premium ( Brown etÂ al. 2012), higher underlying stock volatility, higher liquidity (higher daily average amount and turnover rate). Whereas the debt-liked convertible bonds exactly show the opposite features. Moreover, Panels C.1, C.2 and C.3 show that the average conversion premium in the bull market is lower than bear market, which indicates that the convertible bond is more equity-liked in the bull market. Meanwhile, convertible bonds in the bull market are more liquid than in the bear market.

### Baseline methods

As we mentioned in "The DeepPricing model" section, the most important input in convertible bonds valuation framework is the simulation of underlying stock return process. In this section, we first introduce three commonly used traditional models to generate underlying stock return dynamics, i.e., the Black-Scholes (BS) model, the constant elasticity of variance (CEV) model, the GARCH model. In addition, we also consider two state-of-the-art networks, i.e. multilayer perceptron (MLP) and LSTM, to solve the time-series simulation problem. For fair comparison, we use the same architecture as the *FinGAN* model shown in Fig. 2 and the MLP and LSTM are only used to instead TCN networks.

*BS model* BS model assumes the underlying stock price \(\text {S}_t\) follows the geometric Brownian motion in Eq.Â 24. The volatility is constant and are independent of time and the current \(\text {S}_t\).

The volatility \(\sigma\) is measured by the standard deviation defined in Eq.Â 25, estimated on a historical basis, using the time-series data of the underlying stock. The volatility for each convertible bond is calculated using daily individual stock returns for 20 trading days prior to the first real-time trade data reported to WIND and is assumed to be constant.

*CEV model* CEV model extends the BS model to include the observed inverse dependence of volatility and implied volatility skew (Christie 1982; Cox 1996). The CEV model assumes the underlying stock price \(\text {S}_t\) take the following form:

The value of \(\beta\) is estimated via the following equation:

where \(\nu = \ln \sigma\) and \(\kappa = \frac{\beta -2}{2}\). The \(\beta\) for each convertible bondâ€™s issuer is estimated from the daily stock returns of 20 trading days prior to the first real-time trade price reported to WIND, which is similar to the estimation of volatility discussed earlier.

*GARCH (1,1) model* GARCH (1,1) model is the simplest way to extend the BS modelâ€™s constant volatility assumption to GARCH (1,1), in order to capture the volatility patterns present in the data, in particular volatility clustering ( Bollerslev (1986)). Following Bollerslev (1986) and Duan (1995), the conditional variance of the GARCH (1,1) evolves as

where \(\alpha _0>0\), \(\alpha _1\ge 0\), \(\beta _1\ge 0\) and the \(\epsilon _t\) are the return residuals, in which \(\epsilon _t = \sigma _t \text {Z}_t\) with \(\text {Z}_t \sim \text {N}(0,1)\). \(\sigma\) is estimated from the daily underlying stock returns of 20 trading days prior to the first real-time trade price reported to WIND.

*FinGAN-MLP model* In the FinGAN-MLP model, we use the MLP networks to replace the original TCN networks in *FinGAN*. For a neuron in the MLP, the output \(o_i\) is defined by Eq.Â 29:

where *d* is the length of the input \(r_t\), \(r_i\) is the single instance of the input vector, and \(b_j\) and \(w_{ij}\) are the bias and weights associated with each \(r_i\).

*FinGAN-LSTM model* In the FinGAN-LSTM model, we use the LSTM networks to replace the original TCN networks in *FinGAN*. At each time step, an LSTM maintains a hidden vector *h* and a memory vector *c* responsible for controlling state updates and outputs. More concretely, the operations performed by an LSTM unit at time step *t* as follows:

where \(r_t\) denotes the input, \(W_*\) and \(U_*\) are weight matrices, \(b_*\) are the vectors of bias term, \(\sigma\) is the sigmoid function, and the operator \(\odot\) denotes component-wise multiplication. Finally, the output of the memory cell is calculated by

For more details on LSTM processes, see Zhang etÂ al. (2019).

### Comparisons with different stock return generators

In this section, we take *Sun Paper* (stock code: 002078.SZ), the underlying stock of the *Sun Convertible Bond* (convertible bond code: 128029.SZ), as an example to compare the ability of different generation models to characterize the main statistical properties of the underlying stock returns process. First, stylized facts of real financial time-series (Cont 2001; Chakraborti etÂ al. 2011; Takahashi etÂ al. 2019) are reviewed to assess the quality of generated data. Second, the reproducibility of statistical properties of different generators is analyzed and compared with real financial time-series. Finally, the robustness of training process of GAN-based models is reported.

#### Statistical properties of financial time-series

The main statistical properties of real stock returns including the linear unpredictability, the fat-tailed distribution, the volatility clustering, the leverage effects, the coarse-fine volatility correlation, and the gain/loss asymmetry (in the first line of Fig.Â 4 and TableÂ 3), we give a short introduction of each in turn.

*Linear unpredictability (LU)* linear unpredictability is quantified by the diminishing autocorrelation function of stock returns, defined in Eq.Â 36. Empirically, there is no autocorrelation in daily frequency return series (Chakraborti etÂ al. (2011)).

where \(\bar{\text {r}}\) and \(\sigma\) are the mean and the standard deviation of the stock return. FigureÂ 4(i-a) shows the decay of the autocorrelation of the *Sun Paper* in daily scale.

*Fat-tailed distribution (FTD)* fat-tailed distribution is characterized by a higher probability density of outliers than the normal distribution. The probability of distribution \(\text {Pr}(\text {r}_t)\) consistently has a power-law decay in the tails defined in Eq.Â 37 (Liu etÂ al. 1999). Empirically, for normal distribution the power law asymptotic exponent \(\alpha \ge 5\), and real stock return distribution range \(3\le \alpha < 5\). The positive-tail (\(\alpha =3.4924\)) of the *Sun Paper* of 3, 556 data points in Fig. 4(i-b) is observed.

*Volatility clustering (VC)* volatility clustering refers to the fact that the large/small stock return fluctuations tend to cluster together temporally, which indicates the presence of the long-range temporal dependence in financial time-series. Quantitatively, volatility clustering is characterized with the power law decay of the autocorrelation function of the absolute stock returns, defined in Eq.Â 38 (Cont 2007). The slow and power decay up to \(k\approx 10^2\) (\(\beta =1.1194\)) of the *Sun Paper* of 3,556 data points in Fig.Â 4(i-c) is observed.

*Leverage effect (LE)* Leverage effects refer to the tendency that the past stock return has a correlation with future volatility^{Footnote 6}. Quantitatively, leverage effect is characterized with the lead-lag correlation function defined in Eq.Â 39 (Bouchaud etÂ al. 2001; Qiu etÂ al. 2006). In the case of the *Sun Paper*, the positive correlation is found for \(1<k<10\) as shown in Fig.Â 4(i-d).

*Coarse-fine volatility correlation (CFVC)* Coarse-fine volatility correlation refer to fine volatility has the power of predicting coarse volatility. Quantitatively, coarse-fine volatility correlation is characterized with the negative asymmetry of the lead-lag correlation defined in Eq.Â 40 (MÃ¼ller etÂ al. 1997; Gavrishchaka and Ganguli (2003).

where the lead-lag correlation of two different time scales volatility \(\rho _ cf ^\tau (k)\) is defined in Eq.Â 41,

\(v_c^\tau (t)= |\sum _{i=1}^{\tau }r_{t-i}|\) is coarse volatility and \(v_f^\tau (t)= \sum _{i=1}^{\tau }|r_{t-i}|\) is fine volatility.

The negative \(\Delta \rho _ cf ^\tau (k)\) indicates that fine volatility has the power of predicting coarse volatility. FigureÂ 4(i-e) shows the coarse-fine volatility correlation \(\rho _ cf ^\tau (k)\) in blue points and the lead-lag correlation asymmetry \(\Delta \rho _ cf ^\tau (k)\) in orange points. The asymmetry is present in the *Sun Paper* as the value deviates from the zero level indicated by the black dashed line.

*Gain/loss asymmetry (GLA)* Gain/loss asymmetry refers to the speed of the stock price fall is faster than the stock price rise. Quantitatively, gain/loss asymmetry is characterized with the probability distribution of \(T_\text {wait}^t(\theta )\) relating to the speed of stock price movement to reach the certain positive and negative change \(\pm \theta\), defined in Eq.Â 42 (Jensen etÂ al. 2003).

FigureÂ 4(i-f) shows the probability distribution of \(T_\text {wait}^t(\theta =0.1)\) in red and \(T_\text {wait}^t(\theta =-0.1)\) in blue for the *Sun Paper*. The peak of the positive returns comes before the peak of negative returns, indicating the presence of asymmetry in price up/down.

#### Experiment results

The reproducibility of statistical properties with different models are summarized in Fig. 4 and TableÂ 3^{Footnote 7}. The empirical results show that the time-series generated by the *FinGAN* model satisfies all six major statistical properties of the *Sun Paper* return series. In comparison, the traditional generator such as BS, CEV, GARCH outputs time-series that satisfies the stylized facts of the fat-tailed distribution, however, does not successfully reproduce the the leverage effect, the asymmetry in coarse-fine volatility correlation and the gain/loss asymmetry. While for more state-of-art approaches such as FinGAN-MLP and FinGAN-LSTM generators outputs time-series also satisfy major statistical properties, however, the *FinGAN* model shows more high-quality synthetic data in comparison to real data set based on both visualization (in Fig.Â 4) and the parameter scope (in TableÂ 3).

#### Robustness of training

It was reported that GANs are difficult to converge during the training process (Salimans etÂ al. 2016), in this section, the robustness of different GAN-based models are tested. Figure 5 shows the loss of the generator and the discriminator in FinGAN-MLP, FinGAN-LSTM and *FinGAN*, respectively. With the same optimizer and learning rate, the *FinGAN* model converges faster than others and converges toward a minimum as the network trains. In comparison, the loss of FinGAN-MLP and FinGAN-LSTM models do not show a stable trend as iterations increase.

### Model performance and Robustness test

We conduct empirical studies to discuss the convertible bonds pricing results produced by *DeepPricing* and other baseline valuation models. In order to be more comparable with baseline models, we use the uniform pricing framework as the *DeepPricing* model. The main difference among the valuation models are the generation models of the underlying stock returns. Daily model prices are compared against daily convertible bond market prices to determine whether there is fair pricing, overpricing or underpricing. Then, the results are pooled to determine the average mispricing for the sample using the mean absolute percentage error (MAPE) defined in Eq.Â 43:

where *N* denote the number of convertible bonds in the daily sample, \(V_i^{\text {Mkt}}\) is the closed prices of convertible bonds; and \(V_i^{\text {Model}}\) is the model determined prices for the given model.

Moreover, to measure the extent to which one model is better or worse than another, we compute pricing differences between two models. Let \(\Delta \text {MAPE}_{i|j}\) denote the pricing difference of a model *i* over a model *j*. \(\Delta \text {MAPE}_{i|j}\)is defined in Eq.Â 44:

where \(\text {MAPE}_i\) and \(\text {MAPE}_j\) denote the MAPE implied by models *i* and *j*, respectively. A negative (positive) value of \(\Delta \text {MAPE}_{i|j}\) means that model *i* yields lower (higher) pricing errors than model *j*, implying that the pricing performance of the former is better (worse) than that of the latter by a percentage of that value.

Besides, the mean absolute error (MAE) defined in Eq.Â 45 and \(\Delta \text {MAE}_{i|j}\) defined in Eq.Â 46 are used for robustness test.

TablesÂ 4 andÂ 5 detail MAPE and MAE pricing performance and improvements across different models respectively. The main results for all convertible bonds are offered in the first line, moreover, the convertible bonds are respectively broken down by equity component levels, market states and moneyness levels in lines 2-5, lines 6-8 and lines 9-11.

Several important conclusions can be drawn from the MAPE performance metrics^{Footnote 8}^{Footnote 9}. First, for all convertible bonds in the sample, the *DeepPricing* model has a MAPE of 0.0721. Based on \(\Delta \text {MAE}_{i|\text {DeepPricing}}\) metrics, the *DeepPricing* model has a better pricing performance than both model-driven models, i.e. BS, CEV (\(\beta\)=1), GARCH (1,1) and data-driven models, i.e. FinGAN-MLP, FinGAN-LSTM. Secondly, we have broken down convertible bondsâ€™ pricing error according to their equity component levels and market states to investigate the ability of the models to fit the structure of the convertible bonds market. Generally speaking, the equity-liked convertible bonds are more likely to be mispriced for all models, as their underlying stock return processes are more volatile and difficult to capture stylized facts. Compared with BS, CEV (\(\beta\)=1), GARCH (1,1), FinGAN-MLP and FinGAN-LSTM models, *DeepPricing* model much better improves the pricing of equity-liked convertible bonds by 26.51%, 23.90%, 17.71%, 12.15%, and 6.86%; and for bull market by 33.97%, 29.33%, 20.51%, 15.83%, and 6.88%. But for lower underlying stock volatility conditions, such as bond-liked convertible bonds or direction-less market, the results are only by 19.14%, 14.24%, 9.56%, 8.37%, and 4.73%; and 14.56%, 13.45%, 8.42%, 4.92%, and 2.04%. The results give evidence that the better fitting performance of our model stems from the improved modeling of the volatility stylized facts of the underlying stock return process. Lastly, we have broken down convertible bonds pricing error according to moneyness levels^{Footnote 10}. Compared with other baseline models, the *DeepPricing* model better improves the pricing of out-of-the-money convertible bonds by 16.99%, 12.82%, 10.50%, 7.54%, and 4.50%, the evidence indicates that the *DeepPricing* model is more flexible to capture the high-dimensional features of the stock return process, outperforming other models in fitting the convertible bond market implied volatility smirk, especially for out-of-the-money convertible bonds. As the main difference between the *DeepPricing* model and other baseline models is the generation methods of the underlying stock return process, i.e. the *FinGAN* model reproduces more accurately stock return series than other generation models, the results also demonstrate the simulation of the dynamic process of the underlying stock return has a better effect on the efficiency of convertible bond pricing, which is consistent with the empirical conclusion of Batten etÂ al. (2018) in USA convertible bond market.

### Analysis of mispricing

As the overall pricing efficiency of the Chinese convertible bond market is low, we further analyze the factors affecting the pricing of convertible bonds. The empirical results is provided in TableÂ 6. The dependent variable is the average *DeepPricing* model mispricing degree using the MAPE defined in Eq.Â 43. The potential influence factors including time to maturity (\(maturity\)), credit spread (\(credit\)), liquidity, underlying stock volatility (\(volatility\)), the equity component levels, market state, moneyness levels and industry. Notice that we employ two proxies to measure the liquidity of convertible bonds: \(amount\) (the daily average trading amount) and \(turnover\) (daily average turnover).

TableÂ 6 reports the regression analysis. A positive coefficient is observed between \(amount\) and \(turnover\), in which convertible bonds with higher liquidity are more likely to be mispriced. This is likely due to there are large number of sentiment-driven investors in Chinese capital market, especially in even less mature convertible bond market (Zhou etÂ al. 2013), Tan etÂ al. 2021). Convertible bonds with higher liquidity are much easier to be hyped by speculative traders, thus, cause market prices deviate from their fundamentals values (Keynes 2018).

Consistently, riskier convertible bonds are more likely to be mispriced as indicated by positive coefficient with \(maturity\), \(credit\) and \(volatility\). Convertible bonds with a longer time to maturity, higher rating code (lower quality credit rating), and higher volatility are perceived to be riskier by the market and are expected to be mispriced (Batten etÂ al. 2018).

Moreover, the positive sign of \(Dequity\), \(Dbull\) and \(Dotm\) indicates that equity-liked, trading in bull market and out-of-the-money convertible bonds tend to be more likely to be mispriced. The results is consistent with the empirical results presented in TablesÂ 4 andÂ 5, and this further illustrates the importance of accurately characterizing the volatility and high-dimensional features of the underlying stock return process for the pricing of convertible bonds.

## Investment strategies

In this section, we introduce the investment strategies based on *DeepPricing* model. As the regression results in "Analysis of mispricing" section indicate that the Chinese convertible bond market is a weakly efficient market, with the market-wide variations in investor sentiment, the convertible bond prices deviate from their fundamental values temporarily. Therefore, the arbitrageurs will benefit from the arbitrage strategy (*Long-Only Strategy*) - to long underestimated convertible bonds and short overestimated convertible bonds (Keynes 2018). Moreover, as there exits short-sale constraints in Chinese market, we also propose the *Long-Only Strategy* for more practical application. We first introduce the financial concepts used in the process of constructing the strategies, and then the *Long-Short Strategy* and *Long-Only Strategy* are formally proposed. Finally, the evaluation measures have been presented and we show the performance of the strategies in Chinese convertible bond market.

### Financial concepts

Following Wang etÂ al. (2019), we introduce some basic financial concepts before proposing investment strategies.

### Definition 5

*(Holding period)* A holding period is a minimum time unit to invest a convertible bond. In this work, we divide the time axis as sequential holding periods with fixed length - one day. We call the starting time of the *t*-th holding period as the time *t*.

### Definition 6

*(Long (Short) position)* The long (short) position is the trading operation that buys (sells) a convertible bond at time \(t_1\) first and then sells (buys) it at \(t_2\). The profit of a long position during the period from \(t_1\) to \(t_2\) for convertible bond *i* is \(v_i(p^{(i)}_{t_2}-p^{(i)}_{t_1})\), while the profit of a short position is \(v_i(p^{(i)}_{t_1}-p^{(i)}_{t_2})\), where \(v_i\) is the buying (selling) volume of convertible bond *i* and \(p^{(i)}\) is the price of convertible bond *i* at time *t*.

### Definition 7

*(Investment portfolio)* Given a convertible bond pool with *I* convertible bonds, a portfolio is defined as a vector \(\varvec{c} = ( c^{(1)},..., c^{(i)},..., c^{(I)})^{\top }\), where \(c^{(i)}\) is the proportion of the investment on convertible bond *i*, with \(\sum _{i=1}^{I}c^{(i)}=1\).

### Definition 8

*(Zero-investment portfolio)* A zero-investment portfolio is a collection of convertible bonds portfolios that has a net total investment of zero when the portfolios are assembled. Assume we have a collection of convertible bonds portfolios \(\{\varvec{c}^{(1)},..., \varvec{c}^{(j)},..., \varvec{c}^{(J)}\}\). The investment on portfolio \(\varvec{c}^{(j)}\) is \(M^{(j)}\), with \(M^{(j)}\ge 0\) when taking a long position on and \(M^{(j)}\le 0\) when taking a short position. Then, for a zero-investment portfolio containing *J* portfolios, the total investment \(\sum _{j=1}^{J}M^{(j)}=0\).

### Investment strategies

#### Long-short strategy

We execute *long-short strategy* as a zero-investment portfolio consisting of two portfolios: a long portfolio for underestimated convertible bonds and a short portfolio for overestimated convertible bonds. Given a sequential investment with *T* periods, we denote the short portfolio for the *t*-th period as \(\varvec{c}^-_t\) and the long portfolio as \(\varvec{c}^+_t, t=1,...,T\).

At time *t*, we first rank the convertible bonds in ascending order in accordance with the mispricing for the sample using the percentage error (PE) based on the *DeepPricing* model price defined in Eq.Â 47 and partition them into deciles.

where \(V_{it}^{\text {Mkt}}\) is the closed prices of convertible bond *i* at time *t*, and \(V_{it}^{\text {Model}}\) is the prices determined by *DeepPricing* model. Notice that \(\text {PE}_{it}>0\) means the convertible bond *i* is underestimated at time *t*, while \(\text {PE}_{it}<0\) means the convertible bond *i* is overestimated at time *t*.

Then, given a budget constraint \(\bar{M}\), we short the convertible bonds ranked in bottom decile in according to the equal or value weighted investment proportion in \(\varvec{c}^-_t\) from brokers. The volume of convertible bond *i* that we can short is

where \(c^{-(i)}_t\) is the proportion of convertible bond *i* in \(\varvec{c}^-_t\). After that, we use \(\bar{M}\) to long the convertible bonds ranked in top decile in according to the equal or value weighted investment proportion in \(\varvec{c}^+_t\). The volume of convertible bond *i* that we can long at time *t* is

The money \(\bar{M}\) we used to long stocks is the proceeds of short selling, so the net investment on the portfolio \(\{\varvec{c}^+_t, \varvec{c}^-_t\}\) is zero.

Finally, at the end of the *t*-th holding period, we sell convertible bonds in the long portfolio. The money we can get is the proceeds of selling convertible bonds using new prices at \(t+1\) for all convertible bonds, i.e.,

Also, we buy the convertible bonds in the short portfolio back and return them to the broker. The money we spend on buying the short convertible bonds is

The ensemble profit earned by the long and short portfolios is \(M_t = M^+_t -M^-_t\). Let \(z^{(i)}_t = p^{(i)}_{t+1}/ p^{(i)}_t\) denote the price rising rate of convertible bonds *i* in the *t*-th holding period. Then, the rate of return of the ensemble portfolio is calculated as

#### Long-only strategy

We execute *long-only strategy* at a given budget constraint \(\bar{M}\) and long underestimated convertible bonds. Given a sequential investment with *T* periods, the long portfolio is donated as \(\varvec{c}^+_t, t=1,...,T\).

At time *t*, we first rank the convertible bonds in ascending order in accordance with the mispricing for the sample using the percentage error (PE) based on the *DeepPricing* model price defined in Eq.Â 47 and partition them into deciles.

Then, we only long the convertible bonds ranked in top decile in according to the equal or value-weighted investment proportion in \(\varvec{c}^+_t\)s. The volume of convertible bond *i* that we can long at time *t* is

At the end of the *t*-th holding period, we sell convertible bonds in the long portfolio. The money we can get is the proceeds of selling convertible bonds using new prices at \(t+1\) for all convertible bonds, i.e.,

Let \(z^{(i)}_t = p^{(i)}_{t+1}/ p^{(i)}_t\) denote the price rising rate of convertible bonds *i* in the *t*-th holding period. Then, the rate of return is calculated as

### Evaluation measures

We select several important evaluation metrics to evaluate the model performance on a standard back-rest platform, including profitability - annualized return, risk - annualized volatility, the maximum drawdown and downside deviation, and performance ratios - Sharpe Ratio, Sortino Ratio and Calmar ratio.

*Annualized return (AR)* annualized return is an annualized average of return rate. It is defined as \(\text {AR}_T = \text {A}_T \times \text {N}_Y\) , where \(N_Y\) is the number of holding periods in a year.

*Annualized volatility (AVOL)* annualized volatility is an annualized average of volatility. It is defined as \(\text {AVOL}_T = \text {V}_T \times \sqrt{\text {N}_Y}\) and is used to measure the average risk of a strategy during an unit time period.

*Max drawdown (MDD)* max drawdown is the maximum loss from a peak to a trough of a portfolio, before a new peak is attained. It is the other way to measure the investment risk. The formalized definition of MDD is

*Downside deviation (DD)* downside deviation ratio measures the downside risk of a strategy as the average of returns when it falls below a minimum acceptable return (MAR). The formalized definition of DD is given as

*Annualized sharpe ratio (ASR)* Annualized sharpe ratio is a risk-adjusted profit measure based on AR and AVOL.The formalized definition of ASR is \(\text {ASR}_T = \text {AR}_T /\text {AVOL}_T\).

*Sortino ratio (STR)*: Sortino ratio is a risk-adjusted profit measure based on AR and DD. The formalized definition of STR is \(\text {STR}_T = \text {AR}_T /\text {DD}_T\).

*Calmar ratio (CR)*: Calmar ratio is a risk-adjusted profit measure based on AR and MDD. The formalized definition of CR is \(\text {CR}_T = \text {AR}_T /\text {MDD}_T\).

### Strategy performance

FigureÂ 6 are the equally-weighted and value-weighted cumulative returns of *Long-Short Strategy* and *long-only strategy*, respectively. In general, during the sample period from January 01, 2018 to June 30, 2021, *Long-Short Strategy* earns annualized return with 41.16% and 42.52% annualized return for equally-weighted portfolio and value-weighted portfolio, respectively. And *long-only strategy* earns annualized return with 31.06% and 37.09% annualized return for equally-weighted portfolio and value-weighted portfolio, respectively. Moreover, the performances evaluated by other measures are listed in TableÂ 7^{Footnote 11}.

## Conclusion

In this paper, we propose *DeepPricing*, a *FinGAN* based model for pricing convertible bonds. Extending traditional model-driven stock return generators, the method is more flexible and accurate than the baseline methods to capture dynamics of the underlying stock return process by adopting a novel financial time-series generative adversarial networks, as it is able to reproduce risk-neutral stock return process that retains the major stylized facts such as the linear unpredictability, the fat-tailed distributions, the volatility clustering, the leverage effects, the coarse-fine volatility correlation, and the gain/loss asymmetry.

We implement the *DeepPricing* model and conduct an extensive empirical pricing study for the Chinese convertible bond market, covering daily prices from January 01, 2010 to June 30, 2021. Several important conclusions can be drawn. First, the *DeepPricing* model has a much better convertible bonds pricing performance than the traditional model-based generators such as BS, CEV, GARCH and data-driven generators such as FinGAN-MLP and FinGAN-LSTM. Second, we find that due to the higher reproducibility of stylized facts, the *DeepPricing* model substantially improves the pricing of equity-liked convertible bonds, the convertible bonds trading in the bull market and out-of-the-money convertible bonds. The results indicate that the *DeepPricing* model is more flexible to capture the volatility and high-dimensional features of the underlying stock return process, outperforming other models in fitting higher volatility convertible bonds and the overall market implied volatility smirk. Third, we analyze the factors affecting the pricing of convertible bonds. Empirical results show that convertible bonds which are higher-liquidity, riskier (longer time to maturity, lower quality credit rating, higher volatility), equity-liked, trading in the bull market and out-of-the-money tend to be more likely to be mispriced. The results are consistent with the convertible bond pricing performance, and further illustrate the importance of accurately characterizing the volatility and high-dimensional features for the pricing of convertible bonds. Finally, the investment strategies based on the *DeepPricing* model are proposed. Both *long-short strategy* and *long-only strategy* earn a significant annualized return with 41.16% and 31.06% for equally-weighted portfolio and 42.52% and 37.09% for the value-weighted portfolio during the sample period, respectively.

This paper provides a new attempt to apply the GAN-based method for the pricing of convertible bonds. For future research, it can be applied for the pricing of other complex path-dependent derivatives.

## Availability of data and materials

All data were collected from Wind Database and Bloomberg Database. The Wind Database can be accessed at https://www.wind.com.cn, and the Bloomberg can be accessed at https://www.bloomberg.com/professional/.

## Notes

Notice that throughout this section, \(n, m \in \mathbb {N}\). \(r, \tilde{r}, \hat{r}\) and \(\hat{\tilde{r}}\) are \(\mathbb {R}^{n}\)-valued random variables,

*Z*is \(\mathbb {R}^m\)-valued random variable.Empirical results suggest that TCN is able to capture long-range dependencies in sequences more effectively than well-known recurrent architectures (Goodfellow etÂ al. 2016) such as the gated recurrent unit (GRU, Chung etÂ al. 2014) or the long short-term memory (LSTM, Zhang etÂ al. 2019). One of the main advantages of TCN is the absence of exponentially vanishing and exploding gradients through time (Pascanu etÂ al. 2013), which is one of the main issues why recurrent neural networks (RNN) are difficult to optimize. Although LSTM addresses this issue by using gated activations, empirical studies show that TCN performs better on supervised learning benchmarks (Bai etÂ al. 2018).

Not reported in the table.

US convertible market is more accessible for issuers who have difficulty entering the traditional bond market due to restrictive rating requirements ( Batten etÂ al. 2018).

The conversion premium measures the excess of the conversion price over the stock price at issuance as a percentage of the stock price.

Notice that this property is market dependent (Qiu etÂ al. (2006)). While the negative correlation (leverage effect) is observed in German DAX, the positive correlation (anti-leverage effect) is detected in Chinese market.

For fair comparison, the FinGAN-MLP and FinGAN-LASM models have been trained with the same optimizer (Adam) and learning rate of 1e-5 as FinGAN.

The MAE performance metrics for robustness test in TableÂ 5 has also shown the efficiency of the

*DeepPricing*model compared to other baseline models.Notice that all our empirical results are due to the fact that the real daily convertible bonds trading prices are used as a baseline.

Moneyness is defined as the \(\xi =\text {S}_t/\text {K}_t\), where \(\text {K}_t\) is the conversion price and \(\text {S}_t\) is the close price of the underlying stock. A convertible bond is said to be out-of-the-money if its \(\xi <0.99\); at the money if \(\xi \in [0.99,1.01]\); in-the-money if \(\xi > 1.01\).

Notice that strategy returns are computed in the absence of transaction costs.

## References

Ammann M, Kind A, Wilde C (2008) Simulation-based pricing of convertible bonds. J Empir Finance 15(2):310â€“331

Ayache E, Forsyth PA, Vetzal KR (2003) Valuation of convertible bonds with credit risk. J Deriv 11(1):9â€“29

Barone-Adesi G, BermÃºdez A, Hatgioannides J (2003) Two-factor convertible bonds valuation using the method of characteristics/finite elements. J Econ Dynamics Control 27(10):1801â€“1831

Batten JA, Khaw KLH, Young MR (2018) Pricing convertible bonds. J Bank Finance 92:216â€“236

Black F, Scholes M (1973) The pricing of options and corporate liabilities. J Polit Econ 81(3):637â€“654

Bollerslev T (1986) Generalized autoregressive conditional heteroskedasticity. J Econom 31(3):307â€“327

Bouchaud J-P, Matacz A, Potters M (2001) Leverage effect in financial markets: the retarded volatility model. Phys Rev Lett 87(22):228701

Brennan MJ, Schwartz ES (1977) Convertible bonds: valuation and optimal strategies for call and conversion. J Finance 32(5):1699â€“1715

Brown SJ, Grundy BD, Lewis CM, Verwijmeren P (2012) Convertibles and hedge funds as distributors of equity exposure. Rev Financ Stud 25(10):3077â€“3112

Buchan MJ (1997) Convertible bond pricing: theory and evidence. Harvard University, Cambridge

Burlacu R (2000) New evidence on the pecking order hypothesis: the case of French convertible bonds. J Multinatl Financ Manag 10(3â€“4):439â€“459

Chakraborti A, Toke IM, Patriarca M, Abergel F (2011) Econophysics review: I. Empirical facts. Quant Finance 11(7):991â€“1012

Chambers DR, Lu Q (2007) A tree model for pricing convertible bonds with equity, interest rate, and default risk. J Deriv 14(4):25â€“46

Christie AA (1982) The stochastic behavior of common stock variances: value, leverage and interest rate effects. J Financ Econ 10(4):407â€“432

Cont R (2001) Empirical properties of asset returns: stylized facts and statistical issues. Quant Finance 1(2):223

Cont R (2007) Volatility clustering in financial markets: empirical facts and agent-based models. In: Teyssiere G, Kirman AP (eds) Long memory in economics. Springer, Berlin, pp 289â€“309

Dogariu M, Åžtefan L-D, Boteanu BA, Lamba C, Kim B, Ionescu B (2022) Generation of realistic synthetic financial time-series. ACM Trans Multimed Comput Commun Appl (TOMM) 18(4):1â€“27

Duan JC (1995) The GARCH option pricing model. Math Finance 5(1):13â€“32

Fan C, Luo X, Wu Q (2017) Stochastic volatility vs. jump diffusions: evidence from the Chinese convertible bond market. Int Rev Econ Finance 49:1â€“16

Gavrishchaka VV, Ganguli SB (2003) Volatility forecasting from multiscale and high-dimensional market data. Neurocomputing 55(1â€“2):285â€“305

Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press, UK

Gupta A, Zou J (2019) Feedback GAN for DNA optimizes protein functions. Nat Mach Intell 1(2):105â€“111

Hung MW, Wang JY (2002) Pricing convertible bonds subject to default risk. J Deriv 10(2):75â€“87

Ingersoll JE Jr (1977) A contingent-claims valuation of convertible securities. J Financ Econ 4(3):289â€“321

Jensen MH, Johansen A, Simonsen I (2003) Inverse statistics in economics: the gain-loss asymmetry. Phys A Stat Mech Appl 324(1â€“2):338â€“343

Keynes JM (2018) The general theory of employment, interest, and money, 2nd edn. Springer, UK

Koshiyama A, Firoozye N, Treleaven P (2021) Generative adversarial networks for financial trading strategies fine-tuning and combination. Quant Finance 21(5):797â€“813

Lewis CM (1991) Convertible debt: valuation and conversion in complex capital structures. J Bank Finance 15(3):665â€“682

Lin S, Zhu S-P (2020) Numerically pricing convertible bonds under stochastic volatility or stochastic interest rate with an adi-based predictorâ€“corrector scheme. Comput Math Appl 79(5):1393â€“1419

Liu Y, Gopikrishnan P, Stanley HE et al (1999) Statistical properties of the volatility of price fluctuations. Phys Rev E 60(2):1390

Longstaff FA, Schwartz ES (2001) Valuing American options by simulation: a simple least-squares approach. Rev Financ Stud 14(1):113â€“147

Ma C, Xu W, Yuan G (2020) Valuation model for Chinese convertible bonds with soft call/put provision under the hybrid willow tree. Quant Finance 20(12):2037â€“2053

Malmsten H, TerÃ¤svirta T (2010) Stylized facts of financial time series and three popular models of volatility. Eur J Pure Appl Math 3(3):443â€“477

Merton RC (1973) Theory of rational option pricing. Bell J Econ Manag Sci 4:141â€“183

McConnell JJ, Schwartz ES (1986) LYON taming. J Finance 41(3):561â€“576

Merton RC (1974) On the pricing of corporate debt: the risk structure of interest rates. J Finance 29(2):449â€“470

MÃ¼ller UA, Dacorogna MM, DavÃ© RD, Olsen RB, Pictet OV, Von WeizsÃ¤cker JE (1997) Volatilities of different time resolutions-analyzing the dynamics of market components. J Empir Finance 4(2â€“3):213â€“239

Nyborg KG (1996) The use and pricing of convertible bonds. Appl Math Finance 3(3):167â€“190

Qiu T, Zheng B, Ren F, Trimper S (2006) Return-volatility correlation in financial dynamics. Phys Rev E 73(6):065103

Shreve SE (2004) Stochastic calculus for finance II: continuous-time models. Springer, New York

Takahashi A, Kobayashi T, Nakagawa N (2001) Pricing convertible bonds with default risk. J Fixed Income 11(3):20â€“29

Takahashi S, Chen Y, Tanaka-Ishii K (2019) Modeling financial time-series with generative adversarial networks. Phys A Stat Mech Appl 527:121261

Tan X, Zhang Z, Zhao X, Wang C (2021) Investor sentiment and limits of arbitrage: evidence from Chinese stock market. Int Rev Econ Finance 75:577â€“595

Tsiveriotis K, Fernandes C (1998) Valuing convertible bonds with credit risk. J Fixed Income 8(2):95

Wiese M, Knobloch R, Korn R, Kretschmer P (2020) Quant GANs: deep generation of financial time series. Quant Finance 20(9):1419â€“1440

Yagi K, Sawaki K (2010) The valuation of callable-puttable reverse convertible bonds. Asia-Pac J Oper Res 27(02):189â€“209

Zhang K, Zhong G, Dong J, Wang S, Wang Y (2019) Stock market prediction based on generative adversarial network. Proced Comput Sci 147:400â€“406

Zhou M, Huang W, Dong Z, Fang X (2013) Can the pricing efficiency of Chinese convertible bonds be improved? Analysis from the perspective of the cost of arbitrage. China Econ Q 4:1278â€“1298

Zhu S-P (2006) A closed-form analytical solution for the valuation of convertible bonds with constant dividend yield. ANZIAM J 47(4):477â€“494

Zhu S-P, Lin S, Lu X (2018) Pricing puttable convertible bonds with integral equation approaches. Comput Math Appl 75(8):2757â€“2781

Bai S, Kolter JZ, Koltun V (2018) An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv preprint arXiv:1803.01271

Chung J, Gulcehre C, Cho K, Bengio Y (2014) Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555

Cox JC (1996) The constant elasticity of variance option pricing model. 47-98, J Portf Manag 15:79â€“98

Donahue C, McAuley J, Puckette M (2018) Synthesizing audio with generative adversarial networks. arXiv preprint arXiv:1802.04208

Esteban C, Hyland SL, RÃ¤tsch, G (2017) Real-valued (medical) time series generation with recurrent conditional GANs. arXiv preprint arXiv:1706.02633

Farimani AB, Gomes J, Pande VS (2017) Deep learning the physics of transport phenomena. arXiv preprint arXiv:1709.02432

Gan C, Huang D, Chen P, Tenenbaum JB, Torralba A (2020) Foley music: learning to generate music from videos. In: European conference on computer vision. Springer, Cham, pp 758â€“775

Garbacea C, Carton S, Yan S, Mei Q (2019) Judge the judges: a large-scale evaluation study of neural language models for online review generation. arXiv preprint arXiv:1901.00398

Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. Adv Neural Inf Process Syst 27

Hadad N, Wolf L, Shahar M (2018) A two-step disentanglement method. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 772â€“780

Karras T, Aittala M, Laine S, HÃ¤rkÃ¶nen E, Hellsten J, Lehtinen J, Aila T (2021) Alias-free generative adversarial networks. Adv Neural Inf Process Syst 34

Killoran N, Lee LJ, Delong A, Duvenaud D, Frey BJ (2017) Generating and designing DNA with deep generative models. arXiv preprint arXiv:1712.06148

Li D, Chen D, Jin B, Shi L, Goh J, Ng S-K (2019) MAD-GAN: multivariate anomaly detection for time series data with generative adversarial networks. In: International conference on artificial neural networks. Springer, Cham, pp 703â€“716

Pascanu R, Mikolov T, Bengio Y (2013) On the difficulty of training recurrent neural networks. In: International conference on machine learning. PMLR, pp 1310â€“1318

Radford A, Metz L, Chintala S (2015) Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434

Salimans T, Goodfellow I, Zaremba W, Cheung V, Radford A, Chen X (2016) Improved techniques for training GANs. Adv Neural Inf Process Syst 29

Sun C, Hong S, Song M, Li H (2020) A review of deep learning methods for irregularly sampled medical time series data. arXiv preprint arXiv:2010.12493

Wang J, Zhang Y, Tang K, Wu J, Xiong Z (2019) Alphastock: a buying-winners-and-selling-losers investment strategy using interpretable deep reinforcement attention networks. InProceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, pp 1900â€“1908

Yang LC, Chou SY, Yang YH (2017) Midinet: a convolutional generative adversarial network for symbolic-domain music generation. arXiv preprint arXiv:1703.10847

Zhang Y, Gan Z, Carin L (2016) Generating text via adversarial training. In: NIPS workshop on adversarial training, vol 21. Academia. edu, San Francisco, pp 21â€“32

## Acknowledgements

We are grateful to helpful comments from Gang Kou (the editor) and four anonymous referee. We also benefitted from the discussions with Yeqing Zhang.

## Funding

This work has been supported by the Postdoctoral Science Foundation of China (Project No.2021M700055). Xiaoyu Tan has been supported by Special Fund for Postdoctoral Funding for Shanghai and Science and Technology Development in Shanghai Pudong New Area.

## Author information

### Authors and Affiliations

### Contributions

All authors collaborated closely on the subject. In particular, XYT: Conceptualization, Methodology, Algorithm, Writing-Original Draft. ZLZ: Conceptualization, Methodology, Supervision. XJZ: Conceptualization, Reviewing, Supervision. SYW: Conceptualization, Methodology, Algorithm, Visualization, Editing. All authors read and approved the final manuscript.Â .

### Corresponding authors

## Ethics declarations

### Competing interests

The authors declare that they have no competing interests.

## Additional information

### Publisherâ€™s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Appendix

### Appendix

### The implement details of *FinGAN*

We use TensorFlow to implement *FinGAN*. For all of the components (encoder, decoder, generator, and discriminator networks), we used TCNs with skip connections. Inside the TCN architecture, the block module is composed of temporal blocks each containing two dilated causal convolutions and two PReLUs as activation functions. The TCN architecture is illustrated in Table 8. Table 9 shows the input, hidden and output dimensions of the models. Note that for all models, the hidden dimension was set to 80, the kernel size of each temporal block, except the first block, was 2, and the receptive field size of each TCN is 127.

## Rights and permissions

**Open Access** This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

## About this article

### Cite this article

Tan, X., Zhang, Z., Zhao, X. *et al.* DeepPricing: pricing convertible bonds based on financial time-series generative adversarial networks.
*Financ Innov* **8**, 64 (2022). https://doi.org/10.1186/s40854-022-00369-y

Received:

Accepted:

Published:

DOI: https://doi.org/10.1186/s40854-022-00369-y

### Keywords

- Convertible bonds
- Generative adversarial network
- Time-series simulation
- Pricing
- Investment strategy
- Artificial intelligence

### JEL Classification

- G1
- G12
- C5
- C6
- C63