Careers

Research Openings

dMetrics develops machine learning and natural language processing techniques
to meet both high-precision and high-recall information needs. Staffed with three
research scientists, and supported by software engineers, we offer opportunities
for applied research on a variety of projects related to language understanding
ranging from solving classification and structure prediction problems for fine-
grained information extraction to building robust natural language processing
(NLP) architectures integrating syntactic and semantic components for
processing very large noisy data sets to mining and validating information
extracted from natural language data as well as annotating and curating text
data.

Successful candidates will initiate and execute applied research projects,
develop innovative algorithms to analyze noisy data, coauthor patents, and stay
current with the academic state-of-the-art.

The candidates should have a strong background in machine learning, data mining,
and/or NLP. A solid research publication record in these areas is a plus. Experience
in processing very large noisy data sets and strong hands-on Java development skills
are both desirable.

Applicants to the Research Scientist position: please include a resume, a short
summary of research lines of interest, and a description of the contributions you
can make to these lines of research.

Note: If you are an NLP, ML, or data mining researcher interested
in joining our team and your profile does not perfectly fit the positions listed below,
don't hesitate to contact us anyway. We have tons of extraordinary challenges! We look
forward to hearing from you!

WHY SHOULD YOU JOIN DMETRICS?

  • Be part of a team that is revolutionizing how computers analyze text.
  • Build algorithms, SaaS, and API to analyze terabytes of online chatter to generate the most accurate, unbiased analysis of decisions people make about products.
  • Explore large-scale content aggregation, data processing, and storage. Provide leadership, code discipline, and project design by example.
  • Work closely with others to build a solid product stack. We believe in giving each employee ownership over an important part of our technology, product, and service.
  • Get to work with top-notch researchers in machine learning and natural language processing.
  • Be a true partner in our project (i.e. meaningful equity).
  • Earn a reasonable salary, bonuses, medical, and other benefits.
  • Work from NYC or Tel Aviv. Our NYC headquarters overlooks McCarren Park; a convenient 4 minute walk from the Bedford L, and 5 minute train ride from the East Village.
  • Get to enjoy informal and friendly office atmosphere.
  • Note: we are primarily a Java shop (plus some Python, Scala and R). Our infrastructure is EC2-based and we rely on Maven, Jetty, Solr, MongoDB, and MySQL. We hope that you are well-versed in these technologies, but are comfortable changing if a better alternative comes along. Our development environment supports both OS X and Ubuntu.

CONTACT

Send your resume to: jobs@dmetrics.com We accept internship applications. Refer a friend: they get hired, you get $500.


Click here for PDF version

POSITION DESCRIPTIONS

SOFTWARE ENGINEER, FRONTEND / UX

Engineer sleek solutions for visualization, presentation, and interaction with high volumes of semantically-rich longitudinal business intelligence data.

Develop beautiful and easy to use web applications that render a consistent output.

Your users will range from Fortune 100 to individuals using their iPhones. Educate us on the possibilities. We do not need to support older browsers, so you are free to unleash your creative knowledge.

Passionate about UX. Loves the challenge of designing elegant UIs for non-tech savvy users. Creative with both code & markup.

Experience with:

  • Microframeworks such as flask or web.py; or Django
  • (X)HTML, HTML5, CSS, JavaScript, cross-browser techniques
  • iOS, GWT (bonus).
SOFTWARE ENGINEER, BACKEND

Use your expert Java skills to solve a wide variety of engineering challenges, ranging from data flow, to storage, to aggregation, to supporting APIs and the presentation layer.

Design and build high-load web applications and service-oriented systems for storing, processing, and searching a very large volume of unstructured text.

Implement a SaaS information delivery architecture. Deploy to and manage apps in the cloud.

BONUS:

  • Knowledge of machine learning, statistics, natural language processing.
  • Designing and consuming RESTful web service APIs.
  • Hands-on experience with Hadoop / MPI or equivalent.
  • Experience with cloud deployment.
  • Experience with Python, Ruby, bash/unix tools.
COMPUTER SCIENCE, LINGUISTICS, AND HUMANITIES INTERNS

Want to get experience while avoiding the corporate monkey position? We have a few
projects that would love your attention, in data exploration, analytics, annotation,
and more.

POSITION DESCRIPTIONS

ALGORITHMS ENGINEER

We take great pride in running our machine learning and NLP algorithms on real world datasets at real world speeds. A successful candidate will:

  • Help researchers implement and optimize their algorithms for near real-time performance over terabytes of data, and integrate these algorithms into our overall NLP pipeline.
  • Work heavily with our application development team and coordinate research work with product development.
  • Have a Masters in CS (or equivalent), with experience in distributed systems and/or high-performance algorithms.
RESEARCH SCIENTIST

We offer research scientist and internship positions to address challenging problems including but not limited to:

DISCOURSE PROCESSING

Our research targets complex semantic tasks spanning multiple sentences and involving first-order as well as higher-order relations. Research work in discourse processing will focus on enlarging the scope and improving the recall of information extraction by modeling discourse constructs. The ideal candidate will be an expert in structure prediction for relation extraction, have a working knowledge of syntactic and shallow semantic parsing, and experience in weakly supervised structure prediction.

DATA MINING

Validating pieces of information extracted from our data is business-critical. Research work in data mining will focus on assessing both the predictive power and the novelty of extracted information. In addition to validating extracted information, the ideal candidate will provide feedback to the natural language processing research team as to which pieces of information to extract in order to gain further statistical insights.

DATA ANNOTATION AND CURATION

Achieving consistent annotation needed for training target statistical models is one of our core research tasks. The ideal candidate will implement annotation procedures, possibly relying on crowd or community sourcing. Additional tasks include curating linguistic resources relevant to information extraction tasks. A background in linguistic analysis of real-world text data is required. Working knowledge of NLP tools is a plus.

SEMANTIC PARSING

Mapping textual data to representations of meaning suitable for data aggregation is one of our core objectives. The ideal candidate will design broad-coverage semantic representations that are deep enough to capturing linguistic phenomena relevant to data aggregation and build statistical models to learn these representations given limited supervision.