Hi
I'm David Castillo

Hello world! I'm a Data Engineer, DevOps Specialist, and Creator all rolled into one. I've got a serious passion for data analysis, coding, and capturing life through the lens of my camera. My love for these fields means I'm always open to conversations, collaborations, or simply answering any questions you might have. Let's keep learning, growing, and creating together - feel free to reach out anytime. Here's to the exciting journey of discovery ahead!

Email me

Certificates

Databricks Certified Data Engineer Associate

AWS Certified Solutions Architect – Associate (AWS-SAA)

Certified Kubernetes Administrator (CKA)

Databricks Certified Associate Developer for Apache Spark 3.0

Academy Accreditation - Databricks Lakehouse Fundamentals

BAMM.102x: Data, Models and Decisions in Business Analytics

Projects

Airflow Production Cheap Deployment


Helm configuration to deploy airflow in Kubernetes with production level using the minimun resources. (Demo)

Stack
  • Airflow
  • Python
  • Kubernetes

Portfolio Optimization mean-variance


A rational investor, only take desitions based on objective information.

Stack
  • Python
  • cvxopt

OpenCovidETL


ETL to handle data from the goberment public API's and websites to our own database to create the reports, every steps code inside a function and every task was coded in side a Django command mangger.

Stack
  • python
  • django
  • pandas
  • postgresql

Forecast Server


Django server created to retrieve forecast data using live demo data, to forecast the amount of sellings of the products in a store.

Stack
  • django
  • python
  • pandas
  • forecast
  • Docker

Path Planning | Reinforcement Learning


We will create a map from the reality and put a diferential robot in there with the aim to use an path planning algorith through reinforecement learning (PPO)

Stack
  • python
  • tensorflow
  • keras
  • plotly

Birds sounds Analysis [competition]


Notebooks where i analysis birds sound as a part of my data science challenge, the dataset consist of the audio of the birds in the nature. I create a custom model with CNN and LSTM neural networks to classify to which kind of bird the audio belongs.

Stack
  • pandas
  • tensorflow
  • python
  • plotly
  • sklearn

Cassava Leaf [competition]


Transfer learning using eficientNet + xception and aumentation tecniques to clasify the leafs of cassava.

Stack
  • python
  • tensorflow
  • keras
  • plotly

Vlogs

¿How to begging in Machine Learning? | Neural networks


I've created a simple explination to undestand machine learning, focusing on neural networks, from the perceptron to a multi-layer neural network application.

Stack
  • python
  • numpy
  • matplotlib

PID Control + Incremental encoder + Arduino


I've created a simple PID control system with an incremental encoder and an ESP8266 microcontroller. Python to extract sample data from the motor behavior and Matlab to system modeling.

Stack
  • python
  • C++
  • matlab

Iot (ESP32) with Node-Red server


This tutorial shows how to configure an ESP32 in conjunction with the Node-Red framework of IBM, generating an IoT application ready for production.

Stack
  • node-red
  • c++
  • javascript

Photos

Stockholm at summer night
Stockholm at summer night
2023-06-03.jpg
Paris, near Panthéon
Paris, near Panthéon
2023-06-16
David in Stockholm
David in Stockholm
2023-06-05
Cats at Stockholm bay
Cats at Stockholm bay
2023-06-16
Electronics (arduino)
Electronics (arduino)
2022-08-03
Coffee
Coffee
2022-09-15
Water Park, Lima
Water Park, Lima
2021-10-15
Cheap Canon Lens
Cheap Canon Lens
2022-07-01
Beautiful Lima Night
Beautiful Lima Night
2022-11-10
Nuts
Nuts
2022-12-10
Carabayllo at the afternoon
Carabayllo at the afternoon
2023-01-10

Technologies

I've worked with a wide range of technologies in both web development and data engineering fields, specializing in web application architecture and data analytics. My experience spans from designing the architecture to providing data services.

  • Analytics

    Apache Spark
    Google Analytics
    Data Studio
    PowerBI

  • Data-Engineering

    Databricks
    Apache Spark
    Apache Airflow
    Big Query
    Delta Lake

  • Back-End

    Apache Spark
    Postgreql
    Firebase
    Django
    Golang
    javascript

  • Architecture

    Docker
    Kubernetes
    Helm
    CloudRun
    CI/CD
    Google cloud Platform
    AWS

About Me

During my professional journey, I have worked with diverse teams to share my expertise and learn from others in the data industry. As a skilled data engineer and DevOps specialist, I possess the knowledge to design and manage the architecture of companies of all sizes worldwide, as well as the data flow between services. I am a curious and passionate learner, and my interest in data science has persisted since my college days, driving me to constantly develop and enhance my skills.

Personal Achievements

4+

WorldWide Certificates

5+

Kaggle Notebooks

13+

Open Source Projects

22+

Github Followers

200+

CI/CD deployments

12000+

GitHub Contributions

Innovating one project at a time