Stash Talent Services

Stash Talent Services

Website

AI/ML Engineer-W2 Only

Role

AI/ML Engineer-W2 Only

Location

US

Job type

Full-time

Posted

1 month ago

Share this job

Salary

$70 - $80/HOUR

Job description

Title: AI/ML Engineer

Duration: 12+ months

Location: 4 days onsite NYC

Job Summary

We are seeking a highly skilled Senior Developer to lead the development of a Python-based platform that ingests internal data sources (e.g., Hadoop, REST APIs) and applies locally deployed language models (LLMs) for text analysis, including issue classification and summarization. This role requires a blend of strong engineering expertise in building scalable data systems and good communication skills.

Key Responsibilities

Design & Development

  • Architect and implement a robust data ingestion pipeline using Python.
  • Integrate with Hadoop and/or internal APIs for sourcing structured and unstructured data.
  • Design modular components for data transformation, enrichment, and routing to downstream NLP models.

LLM Integration

  • Incorporate local LLM models for classification and summarization tasks.
  • Use prompt orchestration, chaining, and context-aware techniques to improve NLP accuracy and consistency.
  • Ensure performance and stability of LLM-based components in production environments.

Collaboration & Engineering Practices

  • Work closely with data engineers, product owners, and ML researchers to refine use cases and deliver high-quality solutions.
  • Follow modern software engineering best practices including testing, CI/CD, and code documentation.
  • Participate in design reviews and knowledge-sharing sessions.

Required Qualifications

  • 4+ years of professional experience in Python software development.
  • Proven experience working with big data systems, particularly Hadoop, PySpark, or related technologies.
  • Practical experience using LLMs, vector databases, embedding pipelines, and retrieval-augmented generation (RAG) architectures.
  • Familiarity with NLP tasks such as classification, summarization, and information extraction.
  • Experience building or maintaining APIs and microservices.
  • Experience with Model Context Protocol (MCP) for managing prompts and contextual data across LLM applications.

Preferred Qualifications

  • Familiarity with LLMOps tools and scalable inference strategies.
  • Prior work with LangChain, Hugging Face Transformers, or vLLM runtime environments.
  • Data scientist–related experience, such as collaborating on model training and evaluation, aligning data processing logic to ML objectives, or working on feature engineering and experimentation pipelines.
  • Background in financial services, enterprise software, or regulated environments.
Resume ExampleCover Letter Example

Explore more