This section describes main characteristics of the baseball data set and its attributes:
General information
Baseball Salaries data set |
Type | Regression | Origin | Real world |
Features | 16 | (Real / Integer / Nominal) | (2 / 14 / 0) |
Instances | 337 | Missing values? | No |
Attribute description
Attribute | Domain | Attribute | Domain |
Batting_average | [0.063,0.457] | Walks | [0,138] |
On-base_percentage | [0.063,0.486] | Strike-Outs | [1,175] |
Runs | [0,133] | Stolen_bases | [0,76] |
Hits | [1,216] | Errors | [0,31] |
Doubles | [0,49] | Free_agency_eligibility | [0,1] |
Triples | [0,15] | Free_agent | [0,1] |
HomeRuns | [0,44] | Arbitration_eligibility | [0,1] |
Runs_batted_in | [0,133] | Arbitration | [0,1] |
Salary | [109,6100] |
Additional information
This dataset contains the 1992 salaries of the set of Major League Baseball players who played at least one game in both the 1991 and 1992 seasons, excluding pitchers. For each player, some performance measures are provided along with four categorical variables indicate how free each one was to move to other teams.
In this section you can download some files related to the baseball data set:
- The complete data set already formatted in KEEL format can be downloaded from
here.
- A copy of the data set already partitioned by means of a 5-folds cross validation procedure can be downloaded from here.
- The header file associated to this data set can be downloaded from here.
- This is not a native data set from the KEEL project. It has been obtained from the Bilkent University Function Approximation Repository. The original page where the data set can be found is: http://funapp.cs.bilkent.edu.tr/DataSets/.
|