Welcome to Michał's freelance profile!

Malt gives you access to the best freelancers for your projects. Contact Michał to discuss your project or search for other similar profiles on Malt.

Michał Dura

Data Engineer focused on Big Data and Cloud

Remote from Berlin

  • 52.517
  • 13.3889
Propose a project The project will only begin when you accept Michał's quote.
Propose a project The project will only begin when you accept Michał's quote.

Location and geographical scope

Berlin, Germany
Remote only
Works remotely most of the time


Project length
  • ≤ 1 week
  • ≤ 1 month
  • Between 1-3 months
  • Between 3-6 months
  • ≥ 6 months




Skills (18)

  • BigData
  • Beginner Intermediate Advanced
  • Beginner Intermediate Advanced
  • Beginner Intermediate Advanced
  • Testing
  • Beginner Intermediate Advanced

Michał in a few words

I am passionate about tools that can help companies make real business value and profit from data. I've worked on different projects where Hadoop, Spark and AWS were used for such purpose. I am well organized and highly motivated person, who always do responsibilities with a great commitment and loves to share knowledge with others.

Always open to new ideas and technologies.



Digital Agency and IT company

Co-founder and Data Engineer

Wrocław i okolice

April 2021 - Today

Data Engineer and cofounder focused on building the community of talented Data Engineers in Poland ready to work on challenging, multinational projects

Self-Employed Contractor

Digital Agency and IT company

Big Data Engineer

Wrocław, Woj. Dolnośląskie, Polska

January 2019 - Today

I've worked in multiple projects in the area of Big Data and Cloud. My main roles:

- Big Data Developer in Autonomous Driving project. I was working on design, implementation and performacne tuning of ETL pipelines and data flows implemented on top of Apache Spark, Kafka, HBase, Airflow.

- DevOps who was working on Apache Airflow setup on top of Openshift / Kubernetes cluster using Celery Executor. I was responsible for initial installation, environment preparation, security design, performance tuning and scaling.

- Big Data Developer in project related to GPS Data processing and visualization using Apache Spark and ELK stack (Elasticsearch, Kibana)

- Data Engineer focused on building ETL Pipelines in Apache Spark and Databricks / AWS EMR, Redshift, Glue, orchestrated on top of Airflow.


Senior Big Data Engineer

Wrocław, woj. dolnośląskie, Polska

January 2018 - December 2018

- Big Data Platform to process data from test vehicles for client from automotive industry Analyzing signals from different modules in cars to find answers for science questions via HiveQL-Queries. Preparing data flows on top of Hadoop Environment (Spark, Sqoop, Oozie, Hive, Pig, Shell) Creating and performance tuning of Spark applications - Near Real-Time ETL solution (Apache Kafka, Apache Spark- Structured Streaming, Apache Hive LLAP) - Conducting multiple trainings for Capgemini employees (Big Data Architecture, Apache Hive, Apache Spark) - Anomaly Detection for client from Aerospace & Defence industry. Cleaning and processing data using Python Pandas library Applying clustering algorithms (DBSCAN, HDBSCAN) Building Autoencoder Neural Network model in Tensorflow and Keras on GPU virtual machine


Consulting & Auditing

BI, Big Data Engineer

Wrocław, woj. dolnośląskie, Polska

November 2015 - January 2018

External recommendations

Check out Michał's recommendations