My Portfolio
Data Engineer with 4+ years of experience specializing in cloud-based data solutions and ETL pipeline development. Expert in Azure, AWS and Databricks technologies with multiple current certifications.
Email: parthnvalani@gmail.com
Phone: (807) 357-6372
Location: Toronto, ON
Linkedin | Medium
👨💼 Work Experience
Deloitte
Consultant - Data Engineer
Jan 2022 - Present (Remote)
- Designed and implemented an end-to-end data solution in Azure Databricks with PySpark and SQL for a major telecom company’s marketing campaign, resulting in up to $2 million cross-selling revenue increase.
- Developed an AWS-based ETL pipeline that enabled cleanroom data quality, profiling, processing, and modeling to drive KPI-driven metrics, contributing to a 30% growth in client sales.
- Created and orchestrated data validation checks for external vendor data within Dynamics 365CE using Azure Synapse pipeline, ensuring seamless integration and robust discrepancy logging.
- Generated interactive visualizations in Power BI, revealing key insights on location proximity and customer overlap across provinces, leading to a 25% improvement in strategic marketing decisions.
Tata Consultancy Services
Software Developer
Oct 2021 - Dec 2021 (Remote)
- Collaborated on designing and launching real-time data streaming pipelines with Kafka and Airflow, achieving a remarkable 40% reduction in data processing latency.
- Reduced expenses by 95% via managing and auto-scheduling Databricks cluster to match Collibra’s demands.
- Diagnosed critical scheduling conflicts within Control-M workflows, implementing targeted optimizations that diminished manual intervention time to less than five hours per week for the entire team.
Bluebits Technologies
Data Scientist (Intern)
Nov 2018 - Jun 2019 (Surat, Gujarat, India)
- Built a 97% accurate ML model with fellow data scientists to predict the sales of flight tickets based on seasonal data via Azure components.
- Crafted intricate PostgreSQL queries to extract and analyze flight ticket sales data from various relational databases; identified key trends and patterns that resulted in a $10K monthly increase in ticket sales.
🛠 Skills & Interests
-
Cloud & Data Engineering: Microsoft Azure (Data Lake Storage, Synapse Analytics, Data Factory, Data Share, DevOps, PowerApps, Dynamics 365), AWS (S3, Glue, CLI), Databricks (Delta Lake, Unity Catalog), Snowflake, Apache Kafka, Apache Airflow, Control-M, Data Modeling, Data Warehousing, Data Governance
-
Programming & Databases: Python, SQL, PySpark, SSMS, PostgreSQL, Git, Docker
-
Data Analytics & Visualization: Pandas, NumPy, Scikit-learn, Power BI
-
Interests: Reading non-fiction books, traveling, playing badminton, listening to EDM and film score music
📜 Certifications
- Databricks Certified Associate Developer for Apache Spark 3.0
- Microsoft Certified Azure Data Engineer Associate (DP-203)
- AWS Certified Machine Learning - Specialty
- Microsoft Certified Azure AI Fundamentals (AI-900)
- AWS Certified Cloud Practitioner
🎓 Education
Lakehead University
MSc, Computer Science
Sep 2019 - Apr 2021 (Thunder Bay, Ontario)
- Directed an orientation team introducing 200+ international students to social and academic aspects of campus life; increased student engagement by 40% and retention rates by 20%.
L.D. College of Engineering
BE, Information Technology
Aug 2015 - Jun 2019 (Ahmedabad, India)
- Achieved the 215th rank in the ‘Indian Engineering Olympiad’, a national-level engineering examination.
Thank you for visiting my GitHub profile! Feel free to reach out if you have any questions or collaboration opportunities.