Expert Systems Engineer in India at Jobgether
Explore Related Opportunities
Job Description
This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for an Expert Systems Engineer based in India.
This role sits at the heart of enterprise application reliability and performance, acting as a key technical escalation point for complex infrastructure and application issues. You will be responsible for ensuring stability, observability, and continuous improvement across critical client environments operating at scale. The position involves deep interaction with clients, acting as their trusted technical advisor for performance-related challenges and incident resolution. You will work closely with cross-functional technical teams to diagnose, troubleshoot, and resolve high-impact system issues across applications, databases, and infrastructure layers. The environment is highly dynamic, requiring strong analytical thinking and the ability to operate effectively in 24/7 support operations. You will also contribute to improving monitoring frameworks, operational workflows, and documentation standards. This is a high-ownership role where your expertise directly influences system performance, client satisfaction, and operational excellence.
- Serve as the primary technical point of contact for clients, managing communication and resolution of complex application and database performance issues.
- Diagnose, troubleshoot, and resolve high-severity incidents across application, infrastructure, and database layers in coordination with internal teams.
- Monitor system performance, identify recurring issues, and drive long-term remediation through structured problem management.
- Develop and maintain technical and operational documentation, ensuring clarity of procedures and support workflows.
- Deliver regular client updates, reports, and performance readouts to stakeholders.
- Identify monitoring gaps and implement alerts, dashboards, and automated workflows to improve operational efficiency.
- Collaborate with engineering, infrastructure, and support teams to ensure timely issue resolution and service stability.
- Participate in on-call rotations and 24/7 support coverage, including shift-based operations.
- Support continuous improvement initiatives across monitoring, reporting, and incident management processes.
- 7–8+ years of experience in application support, systems engineering, or production support environments.
- Strong understanding of enterprise monitoring and observability tools such as AppDynamics, LogicMonitor, Azure Monitor, SentryOne, or similar platforms.
- Hands-on experience with Windows Server environments and .NET-based applications, including IIS, worker processes, and event logs.
- Strong knowledge of system performance metrics (CPU, memory, I/O, queues, clustering, etc.) and troubleshooting methodologies.
- Solid SQL expertise, including query analysis, job management, blocking issues, and high availability setups such as Always On.
- Familiarity with Azure environments, networking fundamentals, and infrastructure monitoring concepts.
- Experience with ITSM tools such as ServiceNow or similar platforms.
- Working understanding of ITIL practices (certification is a plus).
- Strong analytical and reporting skills using Excel, Power BI, pivot tables, and dashboards.
- Excellent communication skills with experience supporting international clients (US/Europe exposure preferred).
- Ability to work in shift-based and 24/7 operational environments.
- Competitive compensation aligned with experience and industry benchmarks.
- Shift allowances and additional compensation for on-call responsibilities.
- Remote/hybrid flexibility depending on project requirements.
- Comprehensive health coverage and employee wellness programs.
- Opportunity to work on large-scale, complex global enterprise systems.
- Continuous learning exposure across Azure, monitoring, and performance engineering tools.
- Structured career growth within systems engineering and SRE-style pathways.
- Collaborative, high-impact environment focused on operational excellence and reliability.