From: Predicting abnormal trading behavior from internet rumor propagation: a machine learning approach
Attribute name | Notation | Description | Source/Calculation | |
---|---|---|---|---|
Post | ||||
Accumulation | In the preceding 90 days | PA_90 | The summed daily post of a stock in CMoney up to the preceding X days. (X = 30, 60, 90) | CMoney |
In the preceding 60 days | PA_60 | |||
in the preceding 30 days | PA_30 | |||
Variation | The preceding 60th and 90th day | PV_90 | Between the preceding X1th and X2th day, the difference in the volume of posts within a day based on a stock in CMoney. ([X1, X2] = [0,30], [30,60], [60,90]) | CMoney |
the preceding 30th and 60th day | PV_60 | |||
the preceding 30th day | PV_30 | |||
Reply | ||||
Accumulation | In the preceding 90 days | RA_90 | The summed daily reply of all post for a stock in CMoney up to the preceding X days. (X = 30, 60, 90) | CMoney |
In the preceding 60 days | RA_60 | |||
In the preceding 30 days | RA_30 | |||
Variation | The preceding 60th and 90th day | RV_90 | Between the preceding X1th and X2th day, the difference in the volume of replies within a day based on all posts of a stock in CMoney. ([X1, X2] = [0,30], [30,60], [60,90]) | CMoney |
The preceding 30th and 60th day | RV_60 | |||
The preceding 30th day | RV_30 | |||
Like | ||||
Accumulation | In the preceding 90 days | LA_90 | The summed daily like of all post for a stock in CMoney up to the preceding X days. (X = 30, 60, 90) | CMoney |
In the preceding 60 days | LA_60 | |||
In the preceding 30 days | LA_30 | |||
Variation | The preceding 60th and 90th day | LV_90 | Between the preceding X1th and X2th day, the difference in the volume of likes within a day based on all posts of a stock in CMoney. ([X1, X2] = [0,30], [30,60], [60,90]) | CMoney |
The preceding 30th and 60th day | LV_60 | |||
The preceding 30th day | LV_30 | |||
Industry | ||||
Biotechnology and medicine | BM | There are 6 industries including Biotechnology and Medicine, Cultural and Creative Industry, Electronics Manufacturing, Information and Communications Technology, Other Manufacturing and Other Service Industry | TEJ | |
Cultural and creative industry | CC | |||
Electronics manufacturing | EM | |||
Information and communications technology | IC | |||
Other manufacturing | OM | |||
Other service industry | OS | |||
Management shocks | FC | The variable was flagged (“1” = “Yes”, and “0” = “No”) whether the stocks have ever had management shocks such as strikes in the preceding year | ||
Incorporation type | I_type | There are 2 types including General stock, and KY stock which is a stock registered abroad with initial public offering in Taiwan | ||
Shareholdings variation of major stockholders with above 600 lots | ||||
The preceding 30th day | S600_30 | The shareholdings of preceding Xth day for Major stockholders with above 600 lots. The major-stockholders definition is based on the multiple partitioning method. (X = 30, 60, 90) | ||
The preceding 60th day | S600_60 | |||
The preceding 90th day | S600_90 | |||
Shareholdings variation of individual stockholders with 20 lots and below | ||||
The preceding 30th day | S20_30 | The shareholdings of the preceding Xth day for individual stockholders with 20 lots and below. The individual-stockholders definition is based on the multiple partitioning method. (X = 30, 60, 90) | ||
The preceding 60th day | S20_60 | |||
The preceding 90th day | S20_90 |