Senior AI Engineer - 220097

Full Time
Remote

Pune, Maharashtra, India | Gurugram, Haryana, India | Bengaluru, Karnataka, India | Telangana, India

Posted 1 day ago

TERADATA

Software Engineer – Data Catalog & AI Query Services

Full-Time | Remote / Hybrid | Engineering – Cloud Platform

OUR COMPANY

Teradata empowers companies to achieve high-impact business outcomes through analytics. With a powerful combination of industry expertise and leading hybrid cloud technologies for data warehousing and big data analytics, Teradata unleashes the potential of great companies. Partnering with top organizations around the world, Teradata helps improve customer experience, mitigate risk, drive product innovation, achieve operational excellence, transform finance, and optimize assets.

Teradata is recognized by media and industry analysts as a future-focused company for its technological excellence, sustainability, ethics, and business value. The Teradata culture isn't just about one kind of person — our diversity is what makes us unique and drives the dynamic, collaborative environment we're proud of.

OUR TEAM

This position sits within the Data Intelligence Platform team, a group focused on building next-generation AI-assisted data services as part of Teradata's core platform. Our team operates at the intersection of cloud infrastructure, data engineering, and applied AI — shipping highly available, multi-tenant services that power intelligent query routing and data discovery at scale.

Our platform responsibilities include:

  • Designing and operating highly available microservices for data catalog ingestion and serving
  • Building AI-assisted query generation and routing services across heterogeneous data sources
  • Deployment and lifecycle management of services on Kubernetes (K8s) across AWS, Azure, GCP and on-prem.
  • Data pipeline development for catalog extraction, normalization, and semantic enrichment
  • Centralized observability: monitoring, alerting, and distributed tracing for all platform services
  • Providing DevOps tooling and CICD pipelines to support continuous delivery

THE OPPORTUNITY

We are building a new service to collect and normalize data catalogs from diverse data sources — including relational databases, data lakes, data warehouses, and streaming systems — and expose them to an AI agent that dynamically constructs and routes queries to the appropriate source. This is a greenfield initiative that requires strong engineering judgment, a systems-thinking mindset, and experience shipping production-grade services.

You will be a core contributor on this project, working from architecture to implementation — designing ingestion pipelines, building the catalog API layer, and collaborating with the AI/ML team to surface the right metadata signals for intelligent query generation.

RESPONSIBILITIES

  • Design, build, and operate a highly available data catalog collection service that ingests schema and metadata from heterogeneous data sources (RDBMS, data lakes, streaming platforms, APIs)
  • Develop robust data pipelines for catalog extraction, normalization, lineage tracking, and semantic tagging to power AI-driven query routing
  • Build and maintain RESTful and/or gRPC APIs that expose catalog data to an AI query agent
  • Deploy and manage services on Kubernetes (K8s), including helm chart authoring, autoscaling configuration, and multi-cluster operations
  • Ensure service reliability through SLO definition, circuit breakers, retry logic, and distributed tracing
  • Integrate with open-source and cloud-native technologies including Apache Kafka, Spark, dbt, Apache Atlas, or OpenMetadata
  • Collaborate with AI/ML engineers to design and iterate on the metadata schema and query routing interface
  • Participate in on-call rotations and contribute to incident response, postmortems, and reliability improvements
  • Contribute to CICD pipelines, infrastructure-as-code (Terraform / Helm), and automated testing frameworks

QUALIFICATIONS

Required

  • 3+ years of software engineering experience building and operating production services
  • Proficiency in one or more of: Rust, Go, Python, Java— with a preference for Go or Python for backend services
  • Hands-on experience with data pipeline development: ingestion, transformation, and metadata management at scale
  • Solid understanding of RESTful API design principles and service-to-service communication patterns
  • Experience deploying and operating services on Kubernetes (K8s) in production cloud environments
  • Familiarity with at least one major public cloud platform: AWS, Azure, or GCP
  • Strong knowledge of relational and non-relational database systems and their schema/catalog semantics
  • Experience with distributed messaging systems such as Apache Kafka or AWS Kinesis
  • Proficiency with Git, code review workflows, and agile development practices
  • Excellent troubleshooting skills and comfort operating in Linux environments

Preferred

  • Experience with data catalog or metadata management tools such as Apache Atlas, OpenMetadata, DataHub, or Collibra
  • Familiarity with semantic search, vector databases, or LLM-based query generation systems
  • Experience designing or integrating AI/ML model APIs into production backend services
  • Knowledge of data governance, lineage tracking, and schema registry patterns
  • Experience with infrastructure-as-code tools
  • Background in multi-tenant SaaS platform engineering
  • Contributions to open-source data or infrastructure projects

WHY TERADATA

  • Work on a greenfield AI-powered data intelligence product with direct business impact
  • Collaborate with world-class engineers across cloud infrastructure, AI/ML, and data platform teams
  • Competitive compensation including equity, comprehensive benefits, and flexible working arrangements
  • Commitment to engineering excellence: investment in tooling, learning, and technical growth
  • Inclusive and diverse culture with a strong sense of shared mission

 

Teradata is an Equal Opportunity / Affirmative Action Employer and commits to hiring returning veterans.

Why We Think You’ll Love Teradata We prioritize a people-first culture because we know our people are at the very heart of our success. We embrace a flexible work model because we trust our people to make decisions about how, when, and where they work. We focus on well-being because we care about our people and their ability to thrive both personally and professionally. We are committed to actively working to foster an inclusive environment that celebrates people for all of who they are.

.

© 2026 Teradata. All Rights Reserved. | Privacy | Terms of UseTracking Consent | www.teradata.com