[cdc8e8]: / README.md

Download this file

50 lines (41 with data), 2.2 kB

About Dataset

This dataset is designed to analyze the impact of complex scheduling algorithms on injury rates and athletic performance in a collegiate sports environment. It provides synthetic but realistic data for athletes, capturing their demographics, training regimes, schedules, fatigue levels, and injury risks.

Features Overview

Athlete Information

  • Athlete_ID: Unique identifier for each athlete (e.g., A001, A002)
  • Age: Athlete's age (18–25 years)
  • Gender: Gender of the athlete (Male/Female)
  • Height_cm: Height in centimeters (160–200 cm)
  • Weight_kg: Weight in kilograms (55–100 kg)
  • Position: Playing position (Guard, Forward, Center)

Training Information

  • Training_Intensity: Average session intensity (1 = low, 10 = high)
  • Training_Hours_Per_Week: Total training hours per week (5–20 hours)
  • Recovery_Days_Per_Week: Recovery days per week (1–3 days)

Schedule Information

  • Match_Count_Per_Week: Matches per week (1–4 matches)
  • Rest_Between_Events_Days: Average rest days between matches (1–3 days)

Derived Features

  • Load_Balance_Score: Score (0–100) indicating the balance between training and recovery
  • ACL_Risk_Score: Predicted risk score (0–100) for ACL injury

Injury Information

  • Injury_Indicator: 1 if athlete sustained an ACL injury, 0 otherwise

Performance Metrics

  • Fatigue_Score: Subjective fatigue level (1 = low, 10 = high)
  • Performance_Score: Composite score (50–100) based on points, assists, etc.
  • Team_Contribution_Score: Overall contribution to team success (50–100)