This page contains an explanation of each data type that is compatible with this website and what kind of analysis is available for each. Every data type's analysis will contain modal analysis (most common value), the five most and least common values and the number of unique items.

- Boolean - a binary logical operator, this program accepts boolean values in many forms: True & False, Yes & No and variations on these. Analysis of boolean columns will include the total number of True, False, Yes and No values as well as the combined total of these. The frequency of these values are visualised through a pie chart.
- Character - a single character, this program recognises all ASCII characters. Analysis of Characters is basic with visualisation done through a histogram.
- Currency - numeric values which are preceded by a currency symbol such as a Pound, Euro or dollar sign. It is compatible with bracketed values which represent negative values. Analysis of currency columns is the same as for the Number data type.
- Date - this program recognises various date formats, including dates separated by dashes or slashes, as well as month first or day first representations. The Analysis of date columns includes tallys of the frequency of dates for each seasonal period. The frequency of each date is represented by a histogram.
- Date Time - a combination of both date and time separated by a space. The Analysis of date time columns is basic with Visualisation unavailable at this time.
- Day of the Week - the seven in a week, ie. Monday, Tuesday etc. Analysis is basic with visualisation done using a pie chart.
- Email Address - any string containing the '@' symbol, emails do not have to be valid to be classified as an email by the program. Analysis of columns of this type is basic with frequency visualised using a histogram.
- Decimals only - numbers which are all decimal numbers, not integers, ie 1.0 not 1. The program will label any whole numbers as errors in decimal columns. The Analysis and visualisation is the same as for the Number data type.
- Hyperlink - any https link to a website, the website address is not checked to be valid. Analysis is basic with frequency visualisation done using a histogram.
- Identification code - fixed length strings of characters or numbers, usually used to identify groups of objects, eg Staff Numbers or Part Numbers. All values in a column are expected to have the same length and any which is not is labelled by the program as an error. The program cannot automatically identity this data type and must be speicified using a template or the customise tool. The length of these values can also be specified. Frequency visualisation of this type is done using a histogram.
- Integers only - numbers which are all whole numbers, no decimals, ie 1 not 1.0. The program will label any decimal numbers as errors in integer columns. The Analysis and visualisation is the same as for the Number data type.
- Number - any real number is accepted. The analysis of these columns extends the basic analysis by adding minimum, maximum, mean, median low (lower median contained in the dataset for even datasets), median, median high (higher median contained in the dataset for even datasets), whether the values are normally distributed, standard deviation and outliers if the column is normally distributed. Frequency visualisation is done using a histogram.
- Scientific Notation - very large or small numeric values using e or E to signify the type. The Analysis and visualisation is the same as for the Number data type.
- String - any number of characters or digits not recognised as any other available type. There is no error detection available for this type and Analysis and Visualisation is basic.
- Time - time in 12 or 24 hour format. Analysis of time columns includes the most and least common hours. Frequency Visualisation is done using a histogram.
- Ignored / not detected - this label is for columns that are either empty or are specified by the user that they should be ignored. The program will automatically assign this type if it deems there are proportionally more empty spaces than filled ones. These columns have no error detection or analysis.

A project for the UWA System Health Lab | License | Contact Us