You will help implement and optimize NLP and machine learning algorithms for near real-time performance over terabytes of data, and contribute to integrating them into our NLP pipeline. We expect you to work heavily with our app dev team and coordinate research work with product development.
Strong, hands-on Java skills
Familiarity with Hadoop, Maven, Git, and Spark
Experience in distributed systems and/or high-performance algorithms
Masters in CS or equivalent
Bonus: Python, experience in processing very large noisy datasets