After loading data from a file or database, you may want to make a few checks to see if the data makes sense and there are no issues with its quality. In case of data quality issues, you may want to perform a more thorough investigation in order to understand the scale and patterns of the problem. This process is frequently called data profiling. EasyMorph provide comprehensive means to profile data:
- The "Cell Metadata" dialog is used to profile cell values.
- The "Column Profiler" dialog is for profiling individual columns.
- Finally, the "Analysis View" is a powerful tool for instant filtering and exploratory data analysis.
Cell Metadata
The "Cell Metadata" dialog is invoked by right-clicking a cell and choosing "Cell metadata". It displays the cell value's data type and additional metadata. For instance, in the screenshot below, with the help of the cell profiler you can see that the cell value is actually a text, not a number.
Note that the dialog is floating — you can keep it open while clicking different cells.
Column Profiler
The Column Profiler is invoked by double-clicking a column header. Alternatively, right-click the column header and choose "Filter/Profile".
The tab "Values" shows a list of unique values in the column. The list is searchable. Also, you can select particular values and create a filter action with them right from the profiler with a single click.
The tab "Profile" shows various counts and metadata that help understand what kind of values are present in the column.
Note that dates are numbers in EasyMorph (the type system of EasyMorph is explained later in the tutorial). Therefore, the Profiler shows counts for possible dates among number counts. Each count/metadata metric has a button for quick filtering.
Hint: The "Column Profiler" dialog is floating too. When the header of another column is clicked, the column metadata is automatically displayed in the Profiler window.