Infinia ML logo

Infinia ML

Data Engineer

– Durham, North Carolina
Employment Type Full-Time
Minimum Experience Mid-level

Infinia ML is seeking a Data Engineer with a desire to join a new and growing company focused on
cutting-edge machine learning and deep learning technology.  We provide our customers with immediate business impact through technology solutions that are tailored to their data and business needs. You will work with a dynamic team of researchers, engineers, and data scientists, collaborating on delivery of new machine learning algorithms and their application to multiple fields, including bioinformatics, natural language processing, image recognition, and business intelligence. We are looking for individuals who are excited about being exposed to new problems and challenges while working in a flexible and versatile environment.

Typical activities will include:
  • Data analysis and pre-processing of client data in Python
  • Work with and process data from many sources and in many formats
  • Work with Data Scientists to ensure data supports building and evaluating machine learning algorithms
  • Create reusable and reproducible experiments based on client data with meticulous tracking and organization
  • Communicate with clients to understand their needs, the data, its schema, inconsistencies disparities, and quirks
  • Present findings, visualizations, recommendations, and issues to clients to collaborate around solving problems
  • Secure and anonymize data to ensure the privacy and security of our clients' assets
  • Processing, characterizing, and managing business-critical data
  • Developing tools and libraries that support processing and analysis of datasets
Candidates must have the following characteristics:
  • Design and create scalable and reusable tools for data processing
  • Quickly distill data, its characteristics, and issues into client-ready presentations
  • Quickly catalog fast-moving product requirements into tangible engineering tasks and designs
  • Be instrumental in the future development of our organization's products and infrastructure
  • Work closely with our ML specialists to build applications with a strong focus on data-driven decisions
  • Shape our engineering culture by coming up with ideas, tools, and infrastructure wherever you see a problem to be solved
  • Have the ability to own projects from end-to-end: from design, through development, to production
Preferred candidates will meet many of the following qualifications:
  • 2+ years experience with data management tools including MySQL, PostgreSQL HDF5, CSV
  • 2+ years experience with data management platforms including S3, BigQuery
  • 2+ years development in Python, including libraries such as Jupyter Notebook, NumPy, SciPy, pandas, matplotlib
  • 1+ Years working with AWS and / or other cloud-based Infrastructure
  • Exposure to libraries such as scikit-learn, TensorFlow, NLTK
  • Exposure to data management pipeline tools such as Luigi or Airflow
  • Experience with deploying Tensorflow, Pytorch and/or other deep-learning frameworks a plus
  • Experience discussing ideas with diagrams, code, math, formulas, and matrices 
  • BA/BS in a relevant field (CS, Engineering, Mathematics, etc) or equivalent hands-on experience
  • (Bonus) Well-informed about big data processing tools and when to use them
And of course:
  • Curiosity. You are committed to understand how and why things work the way they do
  • A desire to carve out your own role in a fast-moving, agile environment
  • A team player with strong communication skills
Thank You
Your application was submitted successfully.
Apply for this Job
  • Location
    Durham, North Carolina
  • Employment Type
  • Minimum Experience
  • Powered by