Exploratory data analysis toolkit

Load the required packages

Load .csv file

Visualise top and bottom rows of the dataframe

Inspect data type and n/a columns

Change data type to float for column header labelled X

Data numeric attributes (Count, mean, std, min, quartiles, max values), rounded to 2 decimal places

Correlation between numeric attributes

Categorical data attributes

Drop null values

Replace null values with median

Inspect and remove duplicates

Obtain overall summary for each variable