This analysis is part of my course project on Data Analysis with Python: Zero to Pandas. In this project, I have performed a complete exploratory data analysis on the survey responses from Stack Overflow Developer Survey 2020.

Stack Overflow’s annual Developer Survey has been one of the largest, if not the largest, surveys of coders and programmers worldwide for almost a decade now. In the year 2020, this survey focused on being more representative of the diversity of programmers worldwide and it was taken by approximately 65,000 people.

This time, I have used a helper library called ‘opendatasets’ to download…

NumPy, short for Numerical Python, is one of the most important foundational packages for numerical computing in Python. One of the reasons why NumPy is so important for numerical computations in Python is because it is designed for efficiency on large arrays of data. It helps in performing complex computations on entire arrays without the need for Python for loops. Similarly, it is much faster and uses significantly less memory.

Following are 6 different NumPy functions that will be quite useful in the Data Analysis field. The functions are:

- numpy.linspace
- numpy.repeat
- numpy.std
- numpy.percentile
- numpy.reshape
- numpy.swapaxes

!pip install jovian --upgrade -q…

As part of my course project on Data Analysis with Python, I had to first find a real-world dataset and perform an exploratory data analysis on it. Without much thought, I decided to work on the most trending topic in today’s world — Covid-19. I downloaded the latest dataset on Covid-19 from https://ourworldindata.org/coronavirus-source-data which gave a complete list of information for all the countries starting from February 24, 2020. Similarly, I downloaded another dataset from https://www.kaggle.com/fernandol/countries-of-the-world. This dataset contained other basic information of the countries(not covid related). I wanted to merge certain columns from both these datasets for my analysis.

