JobTarget Logo

Senior AI Systems Engineer (Edge & Inference) in Brazil, Indiana at Jobgether

NewJob Function: Engineering
Jobgether
Brazil, Indiana, 47834, United States
Posted on
New job! Apply early to increase your chances of getting hired.

Explore Related Opportunities

Job Description

Senior AI Systems Engineer (Edge & Inference)

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior AI Systems Engineer (Edge & Inference) in Brazil.

This role is focused on bringing advanced AI models into high-performance production environments, with a strong emphasis on efficiency, scalability, and ultra-low latency. You will work at the intersection of machine learning engineering and systems engineering, transforming cutting-edge LLMs, multimodal, NLP, and computer vision models into robust, production-ready services. The position involves close interaction with specialized hardware and optimized inference stacks to ensure maximum computational efficiency. You will be responsible for building and tuning inference pipelines that operate at scale in enterprise and edge environments. This is a highly technical role suited for someone who thrives in performance-critical systems. You will collaborate with multidisciplinary teams to design and deploy impactful AI solutions in real-world scenarios.

Accountabilities:
  • Lead the deployment and productionization of AI models in enterprise-grade environments, ensuring stability, scalability, and performance.
  • Optimize inference pipelines through quantization, pruning, and tuning techniques to balance accuracy, latency, throughput, and energy consumption.
  • Design, implement, and maintain inference services using tools such as Triton Inference Server and ONNX Runtime.
  • Integrate AI models with specialized hardware accelerators to maximize execution efficiency.
  • Develop monitoring, telemetry, and health-check systems for AI workloads in production.
  • Perform profiling, benchmarking, and performance analysis to continuously improve system efficiency.
  • Architect and support advanced AI use cases such as LLM serving, retrieval-augmented generation systems, copilots, and real-time video analytics.
  • Build APIs and inference services using Python and C++, collaborating closely with data, ML, and engineering teams.

Requirements:

  • Strong experience deploying and optimizing machine learning models in production environments.
  • Solid expertise with PyTorch and TensorFlow, especially model export workflows.
  • Deep understanding of ONNX and ONNX Runtime.
  • Hands-on experience with inference servers such as Triton Inference Server.
  • Strong knowledge of inference optimization techniques including INT8, FP16, and mixed precision.
  • Advanced Python programming skills.
  • Intermediate to advanced C++ skills with a focus on performance optimization.
  • Experience working in Linux environments, Docker, and containerized infrastructures.
  • Background in performance profiling, benchmarking, and system optimization.
  • Familiarity with LLMs, vision models, NLP architectures, and multimodal AI systems.
  • Experience in at least one of the following domains: GenAI at scale, real-time computer vision, edge AI systems, or large-scale NLP applications.
  • Strong analytical thinking, autonomy, and ability to work in complex technical environments.
  • Clear communication skills and ability to collaborate with cross-functional teams.

Benefits:

  • Fully remote work model with occasional on-site visits in São Paulo when needed.
  • Indefinite contract and long-term engagement.
  • Opportunity to work on cutting-edge AI systems involving high-performance and edge computing.
  • Exposure to advanced hardware acceleration technologies and large-scale AI deployments.
  • Collaborative and technically challenging environment focused on innovation and impact.
  • Strong engineering culture with emphasis on ownership, autonomy, and performance excellence.
  • Competitive conditions aligned with senior-level technical expertise.
How Jobgether works:
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
#LI-CL1

Job Location

Brazil, Indiana, 47834, United States

Frequently asked questions about this position

Similar Jobs In Brazil, Indiana

Computer Engineer III/IV

WARRANT TECHNOLOGIES
Crane, Indiana
New

PiMS Coordinator

Jobgether
Brazil, Indiana
New

AI Automation Engineer

Jobgether
Brazil, Indiana

Model-Based Systems Engineer (MBSE)

American Technology Solutions International Corp.
Crane, Indiana
Continue to apply
Enter your email to continue. You’ll be redirected to the employer’s application.
By clicking Continue, you understand and agree to JobTarget's Terms of Use and Privacy Policy.