Get data from a Google BigQuery table using Python 3 – a specific task that took me a while to complete due to little or confusing online resources You might ask why a Data Scientist was stuck to solve such a trivial Data Engineering task? Well… because most of the time… there is no […]
Category: Data science
Get data from a GCS bucket using Python 3 – a specific task that took me a while to complete due to little or confusing online resources You might ask why a Data Scientist was stuck to solve such a trivial Data Engineering task? Well… because most of the time… there is no proper
Get data from a GCS bucket using Python 3Read More »
Run automatic EDA (Exploratory Data Analysis) in Python With 2 lines of Python code you’ll get a HTML report with all the important EDA aspects you need to understand your raw data. Install pandas_profiling: pip install pandas_profiling Import pandas_profiling: from pandas_profiling import ProfileReport Create the autoEDA report: profile = ProfileReport(rawdataTbl, title=”Profiling Report”) profile.to_file(“Profiling Report.html”) Check
autoEDA in Python: Pandas Profiling ExampleRead More »
Coaching and Mentoring are very important to me as they guided me throughout my career in Data and more recently, the career of a Data Scientist. They are extremely relatable, but there are still some key differences between the two. In this article I’ll explain the difference between the two and how you can benefit
The Benefits Of Having A Coach / Mentor In The Field Of Data ScienceRead More »
The lifecycles below will guide you from the initial phase of a Data Science project through the project’s successful completion. It will enable you to divide the work within the team, estimate efforts, document all the steps of your project, and set realistic expectations for the project stakeholders. I believe that implementing a standard process model
The Lifecycles of Data Science and MLOpsRead More »
At this stage, you managed to create your first Python project and saved all the packages used in a virtual environment. The next step is to collaborate with other data professionals. For a smooth collaboration with your peers, now you have to create a requirements.txt file with all the Python packages used and their respective
“Freeze” your Python environment by creating the Requirements.txt fileRead More »
Wait, what?? Why would you use R Studio as an IDE for running Python? There’s a simple answer to this question: this is the perfect Data Science IDE when you use R and Python together. You’ll find below the simple steps to help set up a project in R Studio so you can start using
A Data Scientist’s guide to run Python in RStudioRead More »
New job titles are confusing. When dad asked me what I do for a living, I told his that I’m a Data Scientist. By his look, I understood that a job title wasn’t enough to explain what I do, so I added: “I use current and past information to predict the future”. He seemed fascinated
What is a data scientist?Read More »
You are here because you code, but how professional does your code look? Professional programmers think of systems as stories to be told rather than programs to be written. I have listed 17 important clean coding standards into 4 different sections. Make sure you bookmark the page and share it with your colleagues: Naming: The
17 Clean Code standards to adopt NOW!Read More »
Starting with 2019, the interest in Data Science education and getting an accreditation skyrocketed. Have a look below at the Google trend of the two search terms: Data Science course vs Data Science certificate: This shows that many people are looking to get formal training on Data Science. Generally speaking, a technical certification
The perfect Data Science Certification for a Junior Data ScientistRead More »