Skip to content

Pranav-MSK/cyclistic-bike-share-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cyclistic Bike-Share Analysis 🚲

Project Overview

This project analyzes Cyclistic (Divvy) bike-share data to understand behavioral differences between annual members and casual riders. The goal is to generate data-driven insights and recommendations to help Cyclistic increase annual memberships.

This case study follows the Ask → Prepare → Process → Analyze → Share → Act data analysis framework.


Business Question

How do annual members and casual riders use Cyclistic bikes differently, and how can these insights inform a strategy to convert casual riders into annual members?


Data Sources

  • Divvy Trips Q1 2019
  • Divvy Trips Q1 2020

The data includes ride-level information such as start/end times, station names, and rider type.
The datasets were provided through the Google Data Analytics Professional Certificate curriculum.

Data license: Divvy Data License Agreement


Tools & Technologies

  • R
  • RStudio
  • tidyverse
  • lubridate
  • R Markdown
  • GitHub

Project Structure

cyclistic-analysis/
│
├── data/
│ └── raw/
│ ├── Divvy_Trips_2019_Q1.csv
│ └── Divvy_Trips_2020_Q1.csv
│
├── outputs/
│ └── cyclistic_analysis_report.html
│
├── scripts/
│ ├── cyclistic_analysis.R
│ └── cyclistic_analysis_report.Rmd
│
├── visuals/
│ └── Customer type usage behaviour.png
│ └── Number of rides by days of week.png
│ └── Ride length comparison.png
│
├── .gitignore
└── README.md

Data Cleaning & Processing

Key preparation steps include:

  • Standardizing column names across datasets
  • Converting timestamps to POSIXct format
  • Unifying rider types into member and casual
  • Removing invalid and zero-length rides
  • Calculating ride length in minutes
  • Removing extreme outliers (rides longer than 24 hours)

Key Insights

  • Casual riders take significantly longer trips than annual members
  • Members ride more frequently, especially on weekdays
  • Casual riders peak on weekends, indicating leisure-focused usage

Recommendations

  • Target high-engagement casual riders with membership promotions
  • Run weekend-focused marketing campaigns
  • Position annual membership as a cost-effective lifestyle upgrade

Reproducibility

The full technical analysis is documented in a reproducible R Markdown report:

📄outputs/cyclistic_analysis_report.html


Data License & Attribution

This project uses Divvy bike-share data provided by Lyft Bikes and Scooters, LLC and the City of Chicago under a public data license.

The data is used strictly for non-commercial, analytical, and educational purposes.
Raw data files are not redistributed in this repository.

Original data source:
https://www.divvybikes.com/system-data


Author

Pranav M S Krishnan
Aspiring Data Analyst


Note:
This is a practice case study completed as part of the Google Data Analytics Professional Certificate to apply R programming and data analysis concepts in a real-world scenario.

About

Data analysis of Cyclistic (Divvy) bike-share data using R to compare usage patterns between annual members and casual riders and generate business recommendations.

Topics

Resources

Stars

Watchers

Forks

Contributors