About
Data Scientist & Backend Engineer
I am Chia-En Wu, a data engineer passionate about distributed systems, cloud computing, and hands-on problem-solving. I enjoy applying programming to optimize data processing, enhance system efficiency, and tackle real-world challenges through practical implementation.
- Birthday: 8 February 2002
- Website: wu28hk@gmail.com
- Phone: 0966488012
- City: Taipei, Taiwan
- Age: 23
- Degree: B.B.A. in Big Data Management
- University: Soochow University
Skills
Python 70%
PHP 50%
SQL 40%
HTML 70%
CSS 60%
JAVASCRIPT 40%
Resume
Contact Information
CHIA-EN WU
TEL: (+886)966488012
Email: wu28hk@gmail.com
GitHub: github.com/Terrywu0208
Education
B.B.A., Major in Big Data Management (2020.09 - 2024.06)
Soochow University, Taiwan
GPA: 3.9/4
Exchange Program (2023.02 - 2023.06)
School of Data Science, Fudan University
Related Courses: Neural Networks and Machine Learning, Time Series Analysis, NLP
Professional Experience
Data Engineering Intern at Tencent Technology Co. Ltd. (2023.07 - 2023.09)
- Assisted in structuring ClickHouse with data layering, source analysis, and hot/cold analysis. Collaborated across departments to optimize data usage, reducing waste and improving platform efficiency by 35%.
- Developed a monitoring system for real-time tracking of model operations, server tasks, and databases. Kafka was used as the foundation for key features, including low latency, low resource use, and auto-alerting.
- Engineered a big data dashboard with RESTful APIs and Several control panels, allowing managers to monitor tasks, execute commands, check timeout events, and track failures, detecting issues early and reducing impact.
Data Analysis Intern at Advant Analytics Tactics Ltd. (2022.06 - 2023.06)
- Implemented data parallelism (DP) via GPU parallelization techniques to optimize genetic algorithms for scheduling a large volume of assignments, achieving a significant 65% acceleration in runtime efficiency.
- Developed a scalable and reliable parallel web crawler using Python, Docker, and RabbitMQ on AWS EC2. This system monitors activities to enhance anti-bot evasion measures and resolve errors efficiently.
- Trained NLP models that speed up pattern searching through IBM Watson Discovery.
- Utilized Tableau and MicroStrategy to build a dashboard to analyze telecom equipment logs, identify outliers and errors, and improve the efficiency of maintaining their equipment.
Project Research Assistant at Brinno Inc. (2021.06 - 2021.09)
- Automated news extraction using Python scraping, scheduled tasks on Heroku, and stored data in Google BigQuery. Linked Google Analytics to a dashboard to visualize product relevance and media impact.
- Maintained the company website with PHP Laravel, optimizing shopping cart, order, and warranty pages. Integrated Google Analytics to gather data, enhance performance, and inform marketing strategies.
- Retrieved and gathered product-related data, including launches and reviews, through YouTube API, and analyzed customer behaviors with Google Analytics and BigQuery to plan new marketing tactics.
Extracurricular Experience
Taipei 101 Boutique Marketing Project - Research Assistant (2022.03 - 2023.01)
- Created a marketing analytics platform for Taipei 101 as a full-stack developer, using Bootstrap, Node.js with Express, and MariaDB to analyze department store consumption data and generate insights.
- Deployed a marketing analytics dashboard using Nginx as a reverse proxy and Docker for consistency. Configured a Linux server for security and performance, enabling scalability and efficient resource utilization for real-time insights.
- Analyzed consumption and customer information by combining RFM analysis with K-means clustering using the Elbow method for segmentation. Assessed CLV (Customer Lifetime Value) to develop tailored membership programs.
KPMG Digital Audit Platform Project - Full-Stack Developer (2022.01 - 2022.07)
- Developed a digital audit platform to enhance data analysis efficiency by 60% by creating responsive web pages with Bootstrap and jQuery, establishing RESTful APIs via AWS Lambda and API Gateway, hosting on AWS EC2, and managing data with MariaDB, significantly improving audit processes.
- Extracted tables from financial PDFs using pdfplumber and performed OCR with Tesseract. Collaborated with accountants to apply audit logic for improved accuracy and efficiency.
- Built a Python Scrapy web scraper to automate financial reports retrieval from Taiwan Stock Exchange, scheduled with AWS Lambda and CloudWatch for efficient data collection.
Innovation and Research
2022 Intelligent Innovation and Interdisciplinary Creation Contest - Honorable Mention (2022.06 - 2022.10)
- Architected an iOS app that detects falls and activates a drone to verify incidents. The drone captures and sends video to the app while alerting family members, enhancing response times and ensuring timely assistance.
- Designed a home care monitoring system with video recognition and a mobile app, using MediaPipe Holistic for training data, a CNN for fall posture detection, and gyroscope-based fall alerts from three-axis acceleration data.
- Developed an iOS app using Swift and SwiftUI for fall recognition alerts, creating models from gyroscope data. Utilized APNs (Apple Push Notification service) to send real-time alerts to designated family members.
Research Publications
Practical Impact of ChatGPT in Introduction to Computer Science Course (2024)
Proceedings of the 28th Global Chinese Conference on Computers in Education (GCCCE)
- Co-authored with Chia-Hao Chiu
- Examined exam performance and real learning outcomes
- Available at: Conference Proceedings