I'm a final year computer science student based in Selangor, Malaysia with 3 years of computer science education. My focus is on data analytics and machine learning. I'm skilled in Python, SQL, data processing and visualization tools.
Currently learning 🦜️🔗 LangChain
About Me?
Currently pursuing a Bachelor's in Computer Science, passionate about data and AI ❤️. Leveraging my strong math skills and programming to deliver data insights. Eager to apply my analytical capabilities through an internship. A dedicated student, I've been named to the Dean's List 6 times for strong academic performance. I just recently picking up blogging on Medium. So far I’ve completed over 22 courses and Data Scientist track on Datacamp!
Experience
Data Steward
Maven Advanced Ventures Sdn Bhd, Petaling Jaya, Selangor – (Aug 2022 - Feb 2023)
- Scripted and implemented data processing/cleansing and validation procedures to identify and correct errors and inconsistencies in census and electoral data sets
- Ensured that election and demographic data was maintained to a high standard of quality, accuracy, and completeness, and that data was consistent and reliable across the organization
- Utilized Microsoft Excel and Python to manipulate and analyze large data sets, identify errors and inconsistencies, and develop data cleansing and validation procedures
- Maintaining an understanding of data-related regulations and compliance requirements and ensuring that data is managed accordingly
Education
Bachelor of Computer Science (Honors)
2021 - present
International Islamic University Malaysia (IIUM), Gombak, Malaysia - Expected 2025*
- Majoring in Data Science and Computational Intelligence
- Relevant Coursework: Data Structures, Algorithms, Probability & Statistics, Machine Learning, Deep Learning
- Awards: Dean's List (6 semesters), Best Group Project in Probability & Statistics
Skills
Problem Solving
I have strong problem solving abilities leveraging online resources like Google, Stack Overflow, and ChatGPT. I excel at breaking down complex issues into discrete steps and using critical thinking to develop effective solutions. My research skills allow me to quickly find relevant information to tackle technical blockers. I believe having the ability to problem solve efficiently is a crucial skill in today's fast-paced world.
Python
My primary programming language for 3 years. I'm proficient in using Python for data analysis and machine learning, including NumPy, Pandas, Matplotlib, Scikit-Learn, and TensorFlow. I've leveraged these libraries to build predictive models, neural networks, and other algorithms to extract meaningful insights from data.
Data Visualization
I'm skilled in communicating key insights through intuitive charts, graphs, and dashboards using tools like Tableau, Power BI, and Matplotlib. I focus on effective visualization design to transform complex data into digestible, actionable information for stakeholders.
SQL
I have extensive experience with PostgreSQL, MySQL, and other SQL dialects. I'm adept at writing optimized queries to wrangle large datasets, creating normalized relational databases for analytics, and implementing best practices for performance and scalability.
Machine Learning
I have hands-on experience building, training, and evaluating machine learning models using Python's Sklearn and TensorFlow. I've created models like logistic regression, random forests, and neural networks to solve problems including classification, prediction, and clustering.
Note Taking
I maintain excellent project and research notes using iPad Pro, Google Drive docs and Notion. My notes are organized, concise, and capture key details, analysis, action items and takeaways. Strong note taking skills allow me to efficiently document information, sketch ideas, retain knowledge, and share context across teams. I leverage tools that keep notes searchable and easy to reference later. This enables me to trace back timelines, ensure important information is saved for future use, and streamline collaboration.
Projects
- Develop the automation of the data extraction from financial documents to the Google Sheet.
- Analyzing the patterns and the characteristics of the spread of misinformation related to COVID-19
- Develop machine learning models to group the crime cases within its characteristics with high accuracy.
- Predicting the rate of university student dropouts using supervised machine learning algorithm.
- Constructed the data modelling and dashboard on quality of care for rural population data for datathon competition.
- Performed and compared KNN and ANN algorithm to identify the impacts of the protein expression levels on mice.
- Classify multilingual news article according to the main topic using NLP and KNN algorithm.
Articles and Blogging
Achievement
Phishing URL Detection Using KNN ModelRiverRescueEasy Clinic SystemParking Lot Automation ProjectSocial Anxiety Disorder's Impact on Academic PerformanceSentiment Analysis for Antipodean Cafe ReviewsData Integration ProjectPredicting Heart DiseaseMultilingual News Article Classification