Qualifications
- Bachelors degree in computer science, or data science
- Minimum 3-month internship in a non-academic setting utilizing Python and SQL.
- Strong interpersonal and collaboration skills, including oral and written communication and presentations as evidenced by publications or cover letter.
- Experience with Python and its data analysis ecosystem (pandas, scikit-learn, NumPy)
- Experience with data ETL processes (e.g. SQL queries, reading flat files (CSV/JSON), data cleaning).
- To qualify, applicants must be legally authorized to work in the United States and should not require now, or in the future, sponsorship for employment visa status.
Desired Skills/Experience:
- Ability to access, extract, integrate, create pipelines, and analyze data from a wide variety of sources (e.g., relational databases, text and unstructured data, sensor data, social media data, image and video data, streaming data).
- Experience with any of these: text analytics, social network analysis, natural language processing
- Experience with machine learning, deep learning, and/or predictive modeling.
- Ability to quickly and easily learn new open-source software.
- Experience with static and/or interactive data visualization methods.
- Experience navigating and scripting in a Unix command-line environment.
- Experience with containerization technologies (Docker).
- Collaborative software development experience, including familiarity with version control, testing, code reviews, and/or Agile software development.
- Experience with a compiled programming language (e.g., C, C#, C++, Java).
- Experience with web front-end technologies, including HTML, CSS, JavaScript, and frameworks, such as React and Vue.js.
- Experience with web back-end technologies, including frameworks such as Node.js/Express, Flask, and/or Django.
- Familiarity with SQL and NoSQL databases, including any of these: MySQL, PostgreSQL, SQLite, MongoDB, and Neo4j.
- Experience in cluster and cloud computing environments.