Job Title: Data Scientist for Research Platform
Strong command of the English language in reading, speaking, and writing.
Fluent in R/Python with over 2 years of proven experience.
Job Description:
As a Data Scientist for the Research Platform, you will be an integral part of a team responsible for leveraging data science techniques and methodologies to enhance the capabilities of our research platform. Your expertise in data analysis, machine learning, and statistical modeling will be crucial in extracting valuable insights from complex data sets, improving data quality, and developing innovative tools to support research efforts across various domains.
Key Responsibilities:
-
Data Analysis and Exploration: Conduct exploratory data analysis on diverse datasets to identify patterns, trends, and anomalies. Collaborate with researchers and domain experts to understand data requirements and uncover potential research opportunities.
-
Machine Learning Model Development: Design, develop, and implement machine learning models to support predictive analytics, classification, clustering, and other research-driven tasks. Utilize both supervised and unsupervised learning techniques to extract meaningful information from data.
-
Data Cleaning and Preprocessing: Preprocess and clean raw data to ensure data quality and consistency. Address missing data, handle outliers, and apply appropriate data transformation techniques to prepare data for analysis.
-
Feature Engineering: Identify and engineer relevant features from raw data to improve the performance of machine learning models. Leverage domain knowledge to create meaningful and informative features for specific research needs.
-
Research Tool Development: Build data analysis tools, algorithms, and pipelines that streamline research workflows and enable researchers to efficiently analyze and interpret data. Focus on creating scalable and reusable solutions.
-
Performance Evaluation: Develop methodologies for evaluating the performance of machine learning models and research tools. Conduct rigorous testing and validation to ensure the accuracy and reliability of the outputs.
-
Collaboration and Communication: Collaborate with researchers, data engineers, and other stakeholders to understand their needs and provide data science expertise. Communicate findings and results effectively to both technical and non-technical audiences.
-
Data Visualization: Create informative and visually appealing data visualizations to present research findings, trends, and insights. Use interactive visualizations to facilitate data exploration by researchers.
-
Continuous Learning: Stay up-to-date with the latest advancements in data science, machine learning, and related fields. Explore and implement cutting-edge methodologies and techniques to improve the research platform.
-
Data Privacy and Security: Ensure compliance with data privacy and security regulations when handling sensitive or confidential data. Implement measures to protect data integrity and confidentiality.
Requirements:
- Master’s or Ph.D. in Data Science, Computer Science, Statistics, or a related field.
- Strong background in data analysis, machine learning, and statistical modeling.
- Proficiency in programming languages such as Python, R, or similar, and experience with data manipulation and machine learning libraries.
- Experience in handling and analyzing large-scale and complex datasets.
- Familiarity with research platforms, data repositories, and data integration processes.
- Knowledge of data visualization techniques and tools to communicate complex findings effectively.
- Strong problem-solving skills and the ability to derive meaningful insights from diverse data sources.
- Excellent communication and teamwork skills to collaborate effectively with interdisciplinary teams.
- Experience with cloud computing platforms and distributed computing is a plus.
- Knowledge of research methodologies and practices across various domains is advantageous.