A. Johnson πŸ‡¦πŸ‡·

Data Engineer

Summary

I'm a Data Engineer with over 3,5 years of experience in Software and Data Engineering. I'm a Data passionate and enthusiast.&nbsp;<div>I'm comfortable with challenges and working side by side with interdisciplinary teams.</div>

Skills

Proficient in: Python, AWS, SQL, NoSQL, Postgresql, Apache Spark, Bash, PowerBI.

Experienced with: Apache Airflow, Apache Kafka, GCP, C#, Apache Avro, Docker, Apache Cassandra.

Familiar with: Azure, Airbyte, JavaScript, Databricks, Apache Hive.

Experience

Business Intelligence/Data Engineer

G****** ***** ****** **** ******** - Cordoba, Argentina (fullTime)

The company is an IT Consulting firm around the world.

I contributed to two main projects:

  • Internal BI Team project:

    • Working side by side with other Business Intelligence colleagues, Functional Analysts, and other departments.

    • Reducing complexity and performance of SQL Queries to speed up dashboard loading.

    • Creating PowerBI dashboard for the Purchase Management team to get insights about purchases around the world, their status, and delays.

    • Improving manual tasks to fully automate them via Power Automate. RPA (Robot Process Automation) helped me to achieve this milestone.

    • Creating ETLs to build Dimensional Modelling in the Data Warehouse to support Back-End Dashboards.

    • Refactoring ETLs/ELTs pipelines to meet performance and reusability specifications using Pentaho Data Integration and SQL to extract, transform and load into the Data Warehouse.

    • Giving workshops to the client to give guidance on how to build PowerBI dashboards using the Self-Service Principle.

  • Google Internal Project: Working on a Supply chain project for Google as part of their team:

    • Giving insights about errors in the system through SQL.

    • Creating a dashboard to explore and analyze data formats in columns to help the Google team to make decisions to improve validations rules.

    • Developing workflow to execute SQL Script and populate temporary tables to serve dashboards.

Technologies used: Python, PowerBI, Bash, SQL, PostgreSQL, SQLServer, Pentaho Data Integration, Power Automate.

.Net Developer

G****** - Cordoba, Argentina (fullTime)

The company is an IT Consulting firm around the world.

My main achievements were:

  • Created a Back-End API using ASP.NET, and Patterns like MVC and Repository, alongside my team to meet client specifications.

  • Created Unit Tests to ensure the API is meeting the specifications and is working properly.

  • Fixed bugs or reported issues.

Technologies used: Angular, C#, PostgreSQL, MVC Pattern, ASP .NET, JSON.

Projects

Immigration Data Modelling

Source code

This is a Data Pipeline Project for my Data Engineering Nano-degree Program. The idea of this project was to look at the Form I-94 immigration process at the EEUU to get insights into how many immigrants came in 2016. Thanks to Dimensional Modelling we can find how many immigrants came and give more context: Country of origin, Gender, Airport, Temporary State Residence, etc.

Technologies used: AWS, Python, Apache Spark.

Education

Data Engineeering Nano-Degree

Udacity

Online

System Engineering

Universidad Tecnologica Nacional (UTN)

Cordoba, Argentina

Data Engineer with Python - Career track

DataCamp

Online

Languages

English, Spanish.

Work experience

Business Intelligence/Data Engineer

Company
G****** ***** ****** **** ********
Cordoba, Argentina
Full-time

The company is an IT Consulting firm around the world.

I contributed to two main projects:

  • Internal BI Team project:

    • Working side by side with other Business Intelligence colleagues, Functional Analysts, and other departments.

    • Reducing complexity and performance of SQL Queries to speed up dashboard loading.

    • Creating PowerBI dashboard for the Purchase Management team to get insights about purchases around the world, their status, and delays.

    • Improving manual tasks to fully automate them via Power Automate. RPA (Robot Process Automation) helped me to achieve this milestone.

    • Creating ETLs to build Dimensional Modelling in the Data Warehouse to support Back-End Dashboards.

    • Refactoring ETLs/ELTs pipelines to meet performance and reusability specifications using Pentaho Data Integration and SQL to extract, transform and load into the Data Warehouse.

    • Giving workshops to the client to give guidance on how to build PowerBI dashboards using the Self-Service Principle.

  • Google Internal Project: Working on a Supply chain project for Google as part of their team:

    • Giving insights about errors in the system through SQL.

    • Creating a dashboard to explore and analyze data formats in columns to help the Google team to make decisions to improve validations rules.

    • Developing workflow to execute SQL Script and populate temporary tables to serve dashboards.

  • Python
  • PowerBI
  • Bash
  • SQL
  • PostgreSQL
  • SQLServer
  • Pentaho Data Integration
  • Power Automate

.Net Developer

Company
G******
Cordoba, Argentina
Full-time

The company is an IT Consulting firm around the world.

My main achievements were:

  • Created a Back-End API using ASP.NET, and Patterns like MVC and Repository, alongside my team to meet client specifications.

  • Created Unit Tests to ensure the API is meeting the specifications and is working properly.

  • Fixed bugs or reported issues.

  • Angular
  • C#
  • PostgreSQL
  • MVC Pattern
  • ASP .NET
  • JSON

Personal projects

Immigration Data Modelling

Links

This is a Data Pipeline Project for my Data Engineering Nano-degree Program. The idea of this project was to look at the Form I-94 immigration process at the EEUU to get insights into how many immigrants came in 2016. Thanks to Dimensional Modelling we can find how many immigrants came and give more context: Country of origin, Gender, Airport, Temporary State Residence, etc.

  • AWS
  • Python
  • Apache Spark

Education

  1. Data Engineeering Nano-Degree

    Udacity
    Online
  2. Data Engineer with Python - Career track

    DataCamp
    Online
  3. System Engineering

    Universidad Tecnologica Nacional (UTN)
    Cordoba, Argentina

Audio Screening

πŸŽ₯

What's the most challenging technical project or feature I’ve worked on recently?

Technical interview

YEH
Youssef El HassaniGemography's assessment team
Yes

He stood out as an avid self-learner. He likes challenging tasks and is currently looking for an opportunity to grow his skillset. He is mainly interested in working with AWS. Although he has some issues expressing himself fluidly, I think he can be valuable due to his tech level.

Skills
System design
Yes

The candidate was given the "CSV file processing" below:

"Suppose we have a csv file in a location A (e.g SFTP server) and we want to move it to a location B (e.g internal AWS S3 bucket), while doing so, wanna change the file format from csv to parquet."

He proposed the following solution:

  • Access the CSV.

  • Use Spark to consume the data.

  • Access the S3 bucket.

  • Write down as parquet using SPARK ( Use AWS EMR).

  • Save file to S3.

  • Using  Cron to schedule the jobs.

  • Using Hadoop and Spark to scale the infrastructure.

Dealing with loose data:

  • Save all the data sources without modification in a temporary bucket.

Solving dependencies conflicts

  • Not relevant in his solution

Database systems
Yes
  • Specialized in the creation of databases, and data warehouses

  • Familiar with Snowflake, AWS, Azure DB services

  • He created several data pipelines

  • His strongest skill is Pyspark

Version control
Yes
  • Familiar with pull requests

  • Participate in PR review sessions

  • Uses git extensively

Networking (ex : HTTP protocol)
Yes
  • Worked extensively with REST APIs

  • Familiar with HTTP status code

Automated Testing
Yes

Build Automation
Yes
  • Basic knowledge of Docker and Kubernetes.

  • Experience with Terraform to manipulate automated deployment.

Debugging errors
Neutral
  • He uses logging library to see information for each executed line

Soft-skills interview

KF
Kyle FitzPatrickGemography's assessment team
Strong yes
  • A really nice person, very positive, humble and self aware. 

  • Great communication, adding a lot of details and examples to his answers. 

  • Conversational English - currently preparing for the IETLS exam and understands that he needs to work on his pronunciation. 

  • Clearly wants to learn and grow, not only in tech but as a person. Very open to others ideas and opinions. Asked about what the most important soft skills are, especially working with people from other cultures. 

  • Although he doesn't have a lot of work experience, he seemed mature and professional. 

  • Seems like someone that I would enjoy working with and could count on. 

  • He understands his flaws and gave concrete examples of what he needs to work on: intercultural communication, work/life balance.

  • He is hard working and puts in a lot of extra effort. Currently spends 12 hours a day on work and developing his skills. 

  • He seems to be a team player giving a concrete example of taking on extra work so that another employee could meet their deadline. 

Skills
English
Neutral

English level with sufficient structural accuracy and vocabulary to effectively hold a conversation but with a strong accent.

Ability to stay positive
Strong yes
Startup mindset and product sense
No
No experience with a startup so far.
Remote Experience
Yes
Self-Awareness
Yes
Humbleness and eagerness to learn
Strong yes
Autonomy and ownership
Yes
Clear communication and idea structure
Yes