Amazon's Profit Intelligence is seeking a talented Big Data Engineer to join the Profit Intelligence team. We develop software solutions that are revolutionizing Amazon Business Intelligence through advanced algorithms running on big data technologies. The ideal candidate thrives in a fast-paced environment and relishes working with petabytes of extremely complex and dynamic data. In this role you will be part of a team of high caliber data and software engineers to build data pipelines using big data technologies such as Apache Spark, Hive/Hadoop, and distributed query engines. You should be passionate about working with big data and have the aptitude to incorporate new technologies and evaluate them critically. You must possess excellent business and communication skills and be able to work with business owners to analyze requirements and build solutions. You are a self-starter, has a proven track record of dealing with ambiguity and working in a fast-paced, highly dynamic environment. Working experience of any one of the programming languages such as Java, C#, C++, Scala, etc. is a big plus.
Interface with PMs, business customers, and software developers to understand requirements and implement solutions
Collaborate with both Retail Finance and IN business teams to understand the inter-dependencies and deliverables
Design, develop, and operate highly-scalable, high-performance, low-cost, and accurate data pipelines in distributed data processing platforms with AWS technologies
Recognize and adopt best practices in data processing, reporting, and analysis: data integrity, test design, analysis, validation, and documentation
Keep up to date with big data technologies, evaluate and make decisions around the use of new or existing software products to design the data architecture
Bachelor's degree in computer science, engineering, mathematics, or a related technical discipline
5+ years of industry experience in software development, data engineering, business intelligence, data science, or related field with a track record of manipulating, processing, and extracting value from large datasets
5+ years of experience in designing and developing data processing pipelines using big data technologies (Hadoop, Hive, Hbase, Spark, EMR, etc.)
5+ years of experience in designing and developing analytical systems
Experience building large-scale applications and services with big data technologies
Experience providing technical leadership and mentoring other engineers for best practices on data engineering
Expertise in SQL, DB and storage Internals, SQL tuning, and ETL development
Ability to work and communicate effectively with developers and Business users
Strong organizational and multitasking skills with ability to balance competing priorities
Working knowledge of Scripting languages such as Python, Perl, etc.
5+ years of experience as a Data Engineer, BI Engineer, Business/Financial Analyst or Systems Analyst in a company with large, complex data sources.
Experience building/operating highly available, distributed systems of data extraction, ingestion, and processing of large data sets
Experience with full software development life cycle, including coding standards, code reviews, source control management, build processes, and testing
Experience with AWS technologies (EMR, Dynamo, RDS, Redshift, Athena, S3)
Demonstrated strength in data modeling, ETL development, and data warehousing
Proven success in communicating with users, other technical teams, and senior management to collect requirements, describe data modeling decisions and data engineering strategy