Data Analytics Engineer (AI/ML Focus)
Job Description
Tata Consultancy Services is seeking a Data Analytics Engineer with an AI/ML focus to join onsite in Sunnyvale, CA. The role centers on building scalable data pipelines, preparing AI-ready data, deploying ML models with MLOps, developing analytics dashboards, and enforcing data governance within cloud environments.
Responsibilities
- Data Pipeline & Infrastructure Development: Build, maintain, and scale ETL or ELT pipelines using Apache Spark, Airflow, and Kafka to support AI and ML workloads.
- AI Ready Data Preparation: Convert unstructured data such as text, images, and video into structured datasets suitable for model training, including feature engineering and vector database ingestion.
- ML Model Productionization: Collaborate with data scientists to deploy models, create APIs for models, and implement MLOps practices with monitoring for data drift.
- Analytics and Visualization: Develop dashboards using Tableau, Power BI, or Looker and execute SQL queries to deliver actionable business insights as an analytics engineer.
- Data Governance & Quality: Ensure data quality, reliability, and security for AI systems, with adherence to regulations like GDPR or HIPAA and management of PII/PHI.
- Cloud and Data Management: Operate within cloud environments such as AWS, Azure, and Google Cloud, leveraging services like S3, Redshift, Glue, and Databricks.
Requirements
- Programming Languages: Expert Python and advanced SQL are mandatory; Java or Scala are preferred for large scale distributed systems.
- ML Frameworks: Familiarity with PyTorch, TensorFlow, or Scikit-Learn for data manipulation and model interaction.
- Data Engineering Tools: Experience with Apache Spark, Kafka, Airflow, dbt, and vector databases such as Pinecone or Milvus.
- Cloud Platforms: Hands-on experience with AWS (Glue, SageMaker) or Google Cloud Platform.
- Analytical Skills: Strong ability to perform exploratory data analysis and interpret complex datasets.
- Soft Skills: Effective communication to bridge technical data engineering with business stakeholders.
Technologies
- Apache Spark
- Airflow
- Kafka
- Tableau
- Power BI
- Looker
- Python
- SQL
- Java
- Scala
- PyTorch
- TensorFlow
- Scikit-learn
- dbt
- Pinecone
- Milvus
- AWS
- SageMaker
- S3
- Redshift
- Glue
- Databricks
- Google Cloud
Location
Sunnyvale, CA
Job Function
Technology
Role
Engineer
Job ID
406121
Desired Skills
- Artificial Intelligence
- Data Analytics
- Kafka
- Machine Learning
- Python
- SQL
- Apache Spark
Salary Range
$70,000 to $125,000 per year
Similar Jobs
N