Card

About Dataset

This dataset is designed to analyze the impact of complex scheduling algorithms on injury rates and athletic performance in a collegiate sports environment. It provides synthetic but realistic data for athletes, capturing their demographics, training regimes, schedules, fatigue levels, and injury risks.

Features Overview

Athlete Information

  • Athlete_ID: Unique identifier for each athlete (e.g., A001, A002)
  • Age: Athlete's age (18–25 years)
  • Gender: Gender of the athlete (Male/Female)
  • Height_cm: Height in centimeters (160–200 cm)
  • Weight_kg: Weight in kilograms (55–100 kg)
  • Position: Playing position (Guard, Forward, Center)

Training Information

  • Training_Intensity: Average session intensity (1 = low, 10 = high)
  • Training_Hours_Per_Week: Total training hours per week (5–20 hours)
  • Recovery_Days_Per_Week: Recovery days per week (1–3 days)

Schedule Information

  • Match_Count_Per_Week: Matches per week (1–4 matches)
  • Rest_Between_Events_Days: Average rest days between matches (1–3 days)

Derived Features

  • Load_Balance_Score: Score (0–100) indicating the balance between training and recovery
  • ACL_Risk_Score: Predicted risk score (0–100) for ACL injury

Injury Information

  • Injury_Indicator: 1 if athlete sustained an ACL injury, 0 otherwise

Performance Metrics

  • Fatigue_Score: Subjective fatigue level (1 = low, 10 = high)
  • Performance_Score: Composite score (50–100) based on points, assists, etc.
  • Team_Contribution_Score: Overall contribution to team success (50–100)