Job description
Tasks and duties:
- Working with the Data Engineering and Machine Learning team to build custom data pipelines.
- Working with external clients, teams, data owners, and solution architects to build data flows in a reliable way.
- Building transformations, scripts, and migrations to multiple specifications and standards.
- Data-driven mindset - our clients require PoCs, data exploration/normalization, and expertise.
- Monitoring data flows and making continuous improvements to data pipelines.
We want you on board if you:
- Are advanced in Python programming language (understanding: iterators, generators, exceptions, OOP, popular libraries for data engineering).
- Have advanced SQL knowledge.
- Have practical knowledge of DevOps t.j. CI, CD, terraform, observability.
- Have experience with Apache Spark / AWS Glue, or similar solutions.
- Have experience with ETL (Airflow) or other data processing automation approaches.
- Have experience with Snowflake.
- Have a very good command of written and spoken English (B2+). Polish is not required.
It would be a big plus if you have:
- Experience with AWS Redshift or GCP BigQuery.
- Experience with coding in Scala.
- Have hands-on experience with Hadoop technologies or equivalent in the cloud environment.
- Experience optimizing data storage in HDFS/Parquet/Avro.
- Experience with cloud technologies (AWS, GCP, Azure or other).
- Worked with data (ideally TB+).
- Can debug complex data infrastructures.
Perks and benefits:
- Sponsored AWS Certified Solutions Architect training and certification exam,
- Access to the WorkSmile platform offering benefits adapted to your preferences:
- Multisport card,
- Private health insurance package,
- Life insurance,
- And hundreds of other options to choose from 15 categories (shopping, leisure, travel, food, etc.)
- Support for your growth - a book budget and a head/manager’s budget available to every employee,
- Discounts on Apple products,
- One-time 1000 PLN home office bonus,
- Various internal initiatives: webinars, knowledge sharing sessions, internal conferences.
If you want to read more, check out our 7 reasons to work at Netguru.
Must have
- SQL
- Python
- Spark
- Glue
- Airflow
- Snowflake
- English
Nice to have
- AWS
- GCP
- Hadoop