Chong Dang
1725 Bay Ridge Pkwy, Brooklyn, NY, 11204 | 1-(631)590-0688 | rickydangc@yahoo.com
CORE QUALIFICATIONS
• Extensive working experiences in the field of Data Science, Machine Learning, Deep Learning, Data Mining, Predictive
Modeling, Recommendation Systems, ETL Development and Data Visualization
• Comprehensive programming skills in Python2/3, R, MATLAB, SQL, Scala, Bash, JavaScript, HTML5, CSS3, C, C# and Java
• Expertise in Supervised Machine Learning Algorithms like Regression and Classification, such as Decision Tree, Ada-Boost,
Gradient Boosting, XG-Boost, Random Forest, Naïve Bayes, KNN, SVM, LDA and Deep Learning Method. Proficient at
Unsupervised Learning like K-Means Clustering and PCA (Principal Component Analysis)
• Skilled in Deep Learning Framework: TensorFlow, Keras and Py Torc h; Familiar with Deep Learning Models like Neural Networks,
CNN and RNN (LSTMs, GRU)
• Experienced in building Data Warehousing and Extract Transform Load (ETL) pipelines using Spark, Airflow and cloud tools
• Experience in defining project scope across Data Science, Data Analytics projects in collaboration with senior management and client
• Strong experience in Software Development Life Cycle (SDLC) including Requirements Analysis, Design Specification in both
Waterfall and Agile methodologies
• Adept in using Python libraries such as Pandas, NumPy, SciPy, Seaborn, Matplotlib, Scikit-learn, Keras, Tenso rF low and NLTK
• Experience in using Anaconda Navigator (Jupyter Notebook), PyCharm, RStudio for Python and R programming
• Working knowledge with Big Data technologies like Hadoop, MapReduce, Spark, SparkSQL, HDFS, Hive and HBase
• Expert in designing visualizations using Tableau10.3, Dash, R-Shiny, Power BI and D3.js
• Experience in using A/B test, Hypothesis test and ANOVA testing to find the accuracy of model
• Professional experience with handling with Structured and Unstructured data (Social Media, Texting, Photographs and Videos)
using relational databases like MySQL_5.X, Oracle_11g
• Expert in dealing with big data on NoSQL databases like Cassandra3.0 and MongoDB3.2
• In-depth knowledge with Cloud Infrastructure like AWS , GCP and Azure
• Experience in working with version control systems like GIT and used Source code management client tools like GitBash and GitHub
• Excellent communication, analytical, interpersonal, and presentation skills; expert at managing multiple projects simultaneously
• Familiar with current industry standards, such ISO, Six Sigma, and Capability Maturity Model (CMM)
• Good knowledge in JIRA, Microsoft Project, Microsoft Office, WordPress, Photoshop etc.
TECHNICAL SKILLS
• Machine Learning Essentials:
- Regression Models: Logistic Regression, Polynomial Regression, Stepwise, Ridge, Lasso, ElasticNet
- Classification Model: Naive Bayes, Decision trees, Random Forests, XG Boost, GBDT, AdaBoost, SVM, KNN, LDA
- Unsupervised Learning: Hierarchical Clustering, K-means Clustering, PCA and SVD (Dimensionality Reduction)
- Deep Learning: Neural Networks, CNN, RNN, Graph Neural Network (GNN)
• Packages: Numpy, Pandas, Scipy, Seaborn, Matplotlib, Plotly, Keras, Scikit-learn, NLTK, PyTorch, Beautiful Soup, WordCloud,
TensorFlow, Flask, SQLAlchemy
• BI Tools/Big Data Tools: Tableau, Microsoft Power BI, MicroStrategy, Dash, R-Shiny, Spark (SparkSQL, Spark MLlib, Spark
Streaming, Spark GraphX), Hadoop, MapReduce, Hive
• Report/Document Tools: MS Office 2016, MS Project, JIRA
• Languages: Python2.7/3, R, SQL, Scala, Pig, HTML, CSS, Linux Shell (CentOS), Markdown
• Database: MySQL, SQL Server, Oracle, PostgreSQL, MongoDB
• Infrastructure: Docker, AWS, Microsoft Azure, Git, Bitbucket, Databricks
EXPERIENCE
Bethpage, NY, Altice USA – Data Scientist 03/2019—Till Date
• Project Development: Designed and developed scalable production-level recommendation systems leveraging Machine Learning,
Deep Learning, Natural Language Processing, Statistical Modeling using Python to solve real-world business problems; collaborated