Data Collection and Pre-processing

Abideen Bello
3 min readFeb 9, 2023
Photo by Firmbee.com on Unsplash

Data collection and pre-processing are the unsung heroes of the data science world!

While they may not be as flashy as building models or creating stunning visualizations, they are the foundation upon which all great data-driven insights are built.

The success of any data analysis project largely depends on the quality and accuracy of the data that is collected and prepared for analysis.

Think about it — without clean and well-prepared data, your analysis is like trying to build a skyscraper on shaky ground.

It just won’t work!

However, with the correct data gathering and pre-processing techniques, you can convert a jumbled, unstructured mess of information into a polished, well-organized dataset suitable for analysis.

So, let’s dive in and discover the amazing world of data collecting and pre-processing!

Data collection

These is the process of obtaining information from many sources, such as databases, surveys, and web scraping, among others. The acquired data may be structured data, such as a spreadsheet, or unstructured data, such as text or photos.

Following data collection, pre-processing entails cleaning, converting, and preparing the data for analysis. This is an…

--

--

Abideen Bello
Abideen Bello

Written by Abideen Bello

I’m an IBM Certified Data Scientist with knowledge of Machine learning application, Data Visualization Expertise, and vast knowledge of Tableau, Power BI, etc.

No responses yet