Description
With this function you can change the data type of columns in your dataset.
Application
The importers determine and assign the data type of all columns which usually this works well. But in case the importer assigned the wrong data type this function can be used to set the data type correctly. Typically, this occurs if there is an inconsistency with the data (e.g. one row of a numerical column contains a string which results in Categorical data type for the entire column). Therefore, you should check the data and fix the underlying issue before changing the data type.
How to use
- Create the step and assign a tabular dataset to it in the field Data.
- In the field Columns to change select all columns for which you want to change the data type.
- For each selected column a new field appears in which you can set the data type for that column. The current data type of the column is preset in the field.
- Change the data type of all selected columns by means of the new fields according to your requirements.
- Once you finished the settings click Apply to execute the step.
Available Data Types
Data type | Description |
---|---|
Float (float64) | Floating point number. Use this for any numerical data. |
Boolean (bool) | Columns with this data type can hold only one of two values (True /False , 0 /1 ). |
Categorical (object) | Use this for any non-numerical columns. Typically, that would be Strings (i.e. text). |
Was this article helpful?
That’s Great!
Thank you for your feedback
Sorry! We couldn't be helpful
Thank you for your feedback
Feedback sent
We appreciate your effort and will try to fix the article