I'm Punya Ira Anand
Software Engineer
LinkedIn Tableau Github Leetcode HackerRank Instagram
Experienced Software Engineer with 4+ years of proven track record in leveraging data-driven insights to steer business strategies. Specialized in Business Intelligence, statistical analysis, Data Visualization, Automation, and Prediction Modeling.
Wipro Technologies is a global technology consulting and digital solutions company, providing end-to-end IT services to clients across various industries.
Wipro Technologies is a global technology consulting and digital solutions company, providing end-to-end IT services to clients across various industries.
GPA: 3.8
GPA: 3.7
Below are data analytics projects briefly describing the technology stack used to solve cases.
Below is the data analytics project developed using AWS and its services.
Steps: Develop a data pipeline in AWS to manage, streamline, and transform the structured and unstructured data containing 200+ trending youtube video datasets to perform analysis based on categories and trending metrics.
Skills: Python, Spark, AWS(S3, IAM, Glue, Athena, Lambda, Quicksight, AWS CLI)
• Project Learnings - Data Ingestion, Data Lake, ETL Operations, Cloud Reporting
• Description: Developed a data pipeline in AWS to manage, streamline, and transform the structured and unstructured data containing 200+ trending youtube video datasets to perform analysis based on categories and trending metrics.
The project involves setting up an ETL (Extract, Transform, Load) pipeline to extract data from Twitter using Tweepy, store the data in AWS S3 storage, and then load formatted data into a Redshift data warehouse for sentimental analysis.
Steps: Setting up AWS infrastructure with S3 storage and Redshift, creating a Python script using Tweepy to extract 1 million tweets and store them securely in S3, formatting and loading the data into Redshift for sentiment analysis, and automating the entire process with Airflow DAGs to ensure orderly task execution.
Skills: S3, Redshift Datawarehouse, Python, DAGs
• Project Learnings - By leveraging Airflow and AWS services, the project streamlines the process of data ingestion, processing, and analysis, making it easier and faster for analysts to access and analyze Twitter data.
• Description: To automate and run scheduled jobs, the project uses Airflow's DAG (Directed Acyclic Graph) concepts to orchestrate the pipeline.
Below is the data analytics project developed using Tableau.
Steps: data loading, data cleaning and preprocessing, and EDA (exploratory data analysis) & building Tableau Dashboard.
Skills: Calculated fields, LOD operations
• Analyzed Covid 19 data using Tableau to identify pandemic trends, geographical spread, and healthcare resource allocation.
• Description: The dataset contains records of Covid-19 cases, deaths, and vaccine records by country in 2020-2021.
Below is the data analytics project developed using Python.
Skills: data cleaning, data analysis, data visualization,Prediction Analysis
Technology: Python,Jupyter Notebook,Pandas,Seaborn,Matplotlib
• Description: Developed a data visualization project using Python to analyze job application trends, providing actionable insights.
• Converted raw data from a pickle file into structured CSV format, leveraging object-oriented programming.
• Utilized pandas library for data manipulation and Matplotlib/Seaborn libraries for creating insightful visualizations, highlighting key metrics such as application volume, job types, and temporal trends
Below is the SQL project for data querying and analysis.
Skills: MySQL, MS SQL Server, Pgadmin4
• Description: Designed DB Schema in PostgreSQL, normalized data for storage, retrieval, and manipulation.
• Developed SQL queries on Oracle Database to provide insights for café performance and customer behavior.
Below is the HDFS project for data querying and analysis.
Skills: Hive, Impala, HDFS, Tableau
• As the fleet manager of AZ National Trucking, our foremost challenge is to ensure adherence to company regulations, aiming to reduce insurance risks. This encompasses tackling issues such as speeding, unsafe following distances, lane departure incidents, and other hazardous driving practices among our fleet drivers.
• Description: The dataset contains 7 different data files such as Truck ,Truck mileage,Geolocation and more
Below is the project for which UML diagrams were designed.
Skills: Figma, Jira, Confluence, Microsoft Visio
• Simplify urban parking with a seamless platform for discovering and booking spaces while boosting revenue for space owners.
• Park In transforms parking from a stressful daily chore into a convenient and profitable experience. Our solution not only enhances individual lives but also contributes to better urban living. We ensure full compliance with technological and regulatory standards, leveraging cutting-edge digital tools to keep our platform ahead of the competition.
Dallas, TX, USA