Arcstar
  • Welcome page
  • Tournament
    • Getting Started
    • Data
    • Model
    • Submissions
    • Scoring
    • Leaderboard & Payout
    • API Key
    • Profile Page
    • FAQ
    • Support
    • Socials
    • Live Score Computation Process
  • $ARCSTAR
    • How to add the Token
    • How to use your Wallet
    • Tokenomics
    • Audit
  • Roadmap
    • General roadmap
Powered by GitBook
On this page
  • Dataset
  • ID
  • Moons
  • Features
  • Targets
  • Split
  • ​
  1. Tournament

Data

Data page description

PreviousGetting StartedNextModel

Last updated 1 year ago

Dataset

Arcstar provides its data scientist community free curated, high quality and obfuscated data.

There are 6 datasets in the Tournament.

X_train

​/data/X_train.csv​

y_train

​/data/y_train.csv​

X_test

​/data/X_test.csv​

ID

Each id in X_train and X_test corresponds to a stock at a specific time Moons.

Moons

The frequency of the Moons depends on the dataset :

  • gordon-geeko : 30 days interval between each moon

  • dolly : 90 days interval

  • e-kinetic : 7 days interval

  • c-mechanics : 7 days interval

  • b-volatility : 7 days interval

  • 3b1-signal : 7 days interval

Features

The features describe specific attributes of a stock at a point in time.

Targets

The y_train file contains 3 targets target_r, target_g, target_b that correspond to the idiosyncratic return of the stock over 3 time horizons : 30, 60 and 90 days respectively.

Split

The overall dataset is splitted in two : train and test. The test set start one moon after the last moon of X_train.

Files might be big (200+MB) so make sure to have enough space before downloading.

​