You're using an older version of Internet Explorer that is no longer supported. Please update your browser.

Data Engineer, AWS Fraud Prevention

Full Time
5 days ago
Job summary
Are you interested in taking your skills and career to the next level, while having fun and fighting fraud in the cloud? How would you like to be the driving force for developing the data and insights strategy for our global AWS Fraud team? You will be part of the Analytics Team (Business Intelligence Engineer; Business Analysts) that is central in shaping the definition and execution of the long-term data and insights strategy for AWS Fraud team.

We are looking for an exceptional Data Engineer who is passionate about data and the insights that large amounts of data sets can provide. The ideal candidate will possess both a data engineering background and a strong business acumen that enables him/her to think strategically. He/she will experience a wide range of problem solving situations requiring extensive use of data collection and analysis. The successful candidate will work in lock-step with BI Engineers, Data scientists, ML scientists, Business analysts, Product Managers and other stakeholders across organization. He/she will:

Key job responsibilities
Develop and improve the current data architecture, data quality, monitoring and data availability.
Collaborate with Data Scientists to implement advanced analytics algorithms that exploit our rich data sets for statistical analysis, prediction, clustering and machine learning
Partner with BAs across teams to build and verify hypothesis to improve the AWS Support business.
Help continually improve ongoing reporting and analysis processes, simplifying self-service support for customers
Keep up to date with advances in big data technologies and run pilots to design the data architecture to scale with the increased data sets of customer experience on AWS.


Bachelor's degree in Computer Science or related technical field, or equivalent work experience.
4+ years of work experience with ETL, Data Modeling, optimizing ETL, and Data Architecture.
2+ years of scripting experience in Python, Scala or other languages.
2+ experience with AWS services including S3, Redshift, EMR, Kinesis and RDS.
2+ years of work experience with Big Data Technologies (Hadoop, Hive, Hbase, Pig, Spark, etc.)
Knowledge of distributed systems as it pertains to data storage and computing


Experience with ETL optimization, designing, coding, and tuning big data processes using Apache Spark or similar technologies.
Experience with building data pipelines and applications to stream and process datasets at low latencies.
Experience handling data - tracking data lineage, ensuring data quality, and improving discoverability of data.
Knowledge of distributed systems and data architecture (lambda)- design and implement batch and stream data processing pipelines, knows how to optimize the distribution, partitioning, and MPP of high-level data structures.
Experience with native AWS technologies for data and analytics such as Redshift Spectrum, Athena, S3, Lambda, Glue, EMR, Kinesis, SNS, CloudWatch, etc.

Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, disability, age, or other legally protected status. If you would like to request an accommodation, please notify your Recruiter.
Law Enforcement and Security Quality Assurance