Summary
Overview
Work History
Education
Skills
Interests
Timeline
Generic
ATTILA JÁNKFALVI

ATTILA JÁNKFALVI

Data Engineer/Scientist
Budapest,BU

Summary


Results-focused data professional equipped for impactful contributions. Expertise in designing, building, and optimizing complex data pipelines and ETL processes. Strong in SQL, Python, Power BI, ensuring seamless data integration and robust data solutions. Known for excelling in collaborative environments, adapting swiftly to evolving needs, and driving team success.

Overview

10
10
years of professional experience
1
1
Language

Work History

Data Engineer | Data Architect

Hungarian Central Statistical Office
01.2021 - 01.2024
  • I designed and operated a Python-based batch ETL pipeline between Oracle source and target systems, using control tables, idempotent loading. Managed the full data lifecycle from data collection, cleaning, and database optimization to model training and on-premise deployment. Focused on GDPR compliance, high-level data security, and clear, actionable visualizations.


  • Engineering – Design and build ETL pipelines using Python to integrate diverse governmental datasets. High volume data migration.
  • Leadership – Led a data team of 3 members, coordinated workflows, and ensured data quality.
  • Model developing – Simulate the household behavior with Markov chains; applied different clustering and time series forecasting (ARIMA, SARIMA) to socioeconomic indicators.


  • Python (pandas, numpy, scikit-learn, statsmodels, tensorflow, keras, jupyter, DataBricks, Spark), Oracle SQL, Power BI, Linux, Windows, Git

Independent Data Scientist

Self-Employed
01.2019 - 01.2021
  • Used Car Market Analysis & Price Prediction:
  • Build a daily web scraping pipeline (Python + BeautifulSoup), performed feature engineering and regression modeling to estimate vehicle prices. Deployed automated SQL updates and live Power BI dashboard.(Linux, Bash, Python, Power BI)
  • Behavior Segmentation for a Wellness Studio:
  • Clean and restructure MySQL booking data to detect churn risk, identify high-value customers, and model for user funnel behavior. Delivered interactive Power BI reports for business decisions.

Healthcare Data Analyst

Spicy Analitics ltd.
01.2019 - 01.2020
  • Analyzed large-scale healthcare datasets (500GB+) to uncover relationships between neurological psychiatric conditions. Deliver insights for medical researchers and pharmaceutical industry stakeholders.
  • Data Management – Managed complex, multi-source datasets from national healthcare records.
  • Processing – data cleaning/feature engineering, transformation, and harmonization for advanced analysis. EDA to identify trends, anomalies, and correlations.
  • Insight – Data-driven insights to support research and strategic planning. Visualisation with Power BI.
  • Collaboration – Collaborated with academic and industry stakeholders.

Geophysical Data Analyst & GIS Project Lead

Hungarian Geophysical Institute
01.2014 - 01.2018
  • Led a national-scale geospatial data project to register abandoned mining sites and assess geohazard risks.
  • Developed GIS-based spatial database and designed risk scoring system for surface movement threats.- Processed seismic, geo-electric and GPR signal data, using domain-specific tools

Education

Udemy

Earth Science Engineer - Geophysics specialization

University of Miskolc
01.2010

Skills

Python (pandas, NumPy, scikit-learn, beautifulsoup, statmodel, tensorflow, networkx, hdbscan, oracledb, geopandas, table-baseline3)

Interests

Event-driven architecture, kafka, airflow

Reinforcement learning

Timeline

Data Engineer | Data Architect

Hungarian Central Statistical Office
01.2021 - 01.2024

Independent Data Scientist

Self-Employed
01.2019 - 01.2021

Healthcare Data Analyst

Spicy Analitics ltd.
01.2019 - 01.2020

Geophysical Data Analyst & GIS Project Lead

Hungarian Geophysical Institute
01.2014 - 01.2018

Earth Science Engineer - Geophysics specialization

University of Miskolc

Udemy
ATTILA JÁNKFALVIData Engineer/Scientist