Exploratory Data Analysis & Visualization Project
Comprehensive rainfall data analysis for Indian meteorological subdivisions
| Column Name | Description | Data Type | Range/Values |
|---|---|---|---|
SD NO. |
Subdivision ID number | Numeric | 1-36 |
SD_Name |
Name of meteorological subdivision | Text | 36 Indian regions |
YEAR |
Calendar year of measurement | Numeric | 1951-2014 |
| JAN | January rainfall | Numeric | 0-300.8 mm |
| FEB | February rainfall | Numeric | 0-363.7 mm |
| MAR | March rainfall | Numeric | 0-353.9 mm |
| APR | April rainfall | Numeric | 0-551.5 mm |
| MAY | May rainfall | Numeric | 0-973.1 mm |
| JUN | June rainfall | Numeric | 0.4-1432.8 mm |
| JUL | July rainfall | Numeric | 5.6-1884.9 mm |
| AUG | August rainfall | Numeric | 4.0-1664.6 mm |
| SEP | September rainfall | Numeric | 0.1-1034.8 mm |
| OCT | October rainfall | Numeric | 0-669.4 mm |
| NOV | November rainfall | Numeric | 0-583.0 mm |
| DEC | December rainfall | Numeric | 0-500.7 mm |
| ANNUAL | Total annual rainfall | Numeric | 86.5-5553.9 mm |
| JAN-FEB | Winter season rainfall | Numeric | 0-545.7 mm |
| Mar-May | Pre-monsoon rainfall | Numeric | 0-1172.3 mm |
| Jun-Sep | Monsoon season rainfall | Numeric | 79.4-4537.0 mm |
| Oct-Dec | Post-monsoon rainfall | Numeric | 0-1133.4 mm |
The dataset covers all 36 meteorological subdivisions of India, spanning diverse climatic zones:
64 years of continuous rainfall measurements providing a rich historical context:
Comprehensive data preprocessing performed:
Comprehensive Exploratory Data Analysis of Indian Rainfall Patterns (1951-2014)
Data Points
Years of Data
Regions
Variables
pip install jupyter pandas numpy matplotlib seaborn scipy scikit-learn plotly folium
Rainfall EDA Dashboard.html in browserpython -m http.server 8000http://localhost:8000Average Annual Rainfall
Maximum Recorded
Minimum Recorded
Outlier Events
Predicted annual rainfall based on 64 years of historical data trends.
| Metric | Value | Interpretation |
|---|---|---|
| Mean (Annual) | 1,411.2 mm | Overall average central tendency of rainfall |
| Median (Annual) | 1,385.5 mm | Middle value, less affected by extreme events |
| Standard Deviation | 185.4 mm | Measure of rainfall variability over time |
| Skewness | 0.452 | Slightly positive skew (occasional high-rainfall years) |
| Kurtosis | -0.124 | Platykurtic distribution (flatter than normal) |
| Confidence Interval (95%) | ± 45.8 mm | Range of probable future annual values |
No significant difference in rainfall patterns before and after 1980 (p-value: 0.349). Linear regression shows a stable trend with minimal variation over the 64-year period.
13 extreme rainfall events identified, primarily in Arunachal Pradesh and Coastal Karnataka. The highest recorded rainfall was 5,554 mm in Coastal Karnataka (1961).
June-September period (Monsoon) contributes approximately 75% of annual rainfall, with July being the peak month averaging 343 mm.
Coastal Karnataka, Arunachal Pradesh, and Kerala receive the highest rainfall, while Rajasthan and Gujarat regions experience the lowest precipitation.
K-means clustering identified 4 distinct rainfall patterns across regions, grouping areas with similar precipitation characteristics for better regional planning.
Periodic fluctuations observed every decade highlight long-term climatic cycles, indicating distinct wet and dry epochs influencing overall rainfall magnitude.
Loaded rainfall dataset from 1951-2014, handled missing values using median imputation, and performed data validation to ensure quality.
Calculated summary statistics, identified data distributions, and performed initial exploratory analysis to understand data characteristics.
Applied t-tests for comparing time periods, z-score analysis for outlier detection, and correlation analysis for variable relationships.
Utilized linear regression for trend analysis and K-means clustering for regional grouping based on rainfall patterns.
Created interactive charts and plots using multiple visualization libraries to effectively communicate insights and patterns.
Summarized key findings and trends into a final report, providing actionable insights for agricultural planning and water resource management.