Data
Data page description
Last updated
Data page description
Last updated
Arcstar provides its data scientist community free curated, high quality and obfuscated data.
There are 6 datasets in the Tournament.
X_train | /data/X_train.csv |
y_train | /data/y_train.csv |
X_test | /data/X_test.csv |
Each id
in X_train and X_test
corresponds to a stock at a specific time Moons
.
The frequency of the Moons depends on the dataset :
gordon-geeko : 30 days interval between each moon
dolly : 90 days interval
e-kinetic : 7 days interval
c-mechanics : 7 days interval
b-volatility : 7 days interval
3b1-signal : 7 days interval
The features
describe specific attributes of a stock at a point in time.
The y_train
file contains 3 targets target_r
, target_g
, target_b
that correspond to the idiosyncratic return of the stock over 3 time horizons : 30, 60 and 90 days respectively.
The overall dataset is splitted in two : train and test. The test set start one moon after the last moon of X_train.
Files might be big (200+MB) so make sure to have enough space before downloading.