Question-and-answer system in a custom database (OpenAI API + SQL project)
Building a question-and-answer system that respond as ChatGPT but fetch a reply from an internal organization database using the OpenAI API (OpenAI Function Calling). App written in Python (back-end) and Streamlit (front-end).
Phytogeographic image recognizer (TensorFlow project)
Convolutional Neural Network (CNN) model to classify images of common botanic species in Rio Blanco - Lima - Peru. Trained from a own search dataset of 3619 Google Image photos divided into 17 botanic classes. The application of this code is for speed up the preliminary classification in field, especially in students where botanic species recognition is limited. The model use the Keras API, TensorFlow platform, and the Nunpy and Matplotlib libraries.
Machine Learning Operations (MLOps)
Resolution of the first individual project of Henry's BootCamp of Data Science. This work comprehensively explores the role of an MLOps Data Engineer, covering the key phases of data engineering, exploratory data analysis and processing, and model creation using machine learning methods. The ETL, EDA, Machine Learning Model and API Deployment tasks were carried out on a database from the Steam video game platform. Project only available in Spanish.
Data analysis of road accidents in the city of Buenos Aires - Argentina (Data Analyst)
Resolution of the second individual project of Henry's BootCamp of Data Science. The main objective of this project is to analyze the information related to deaths in traffic accidents in Buenos Aires - Argentina during the period 2016-2021. The purpose is to produce relevant data and conclusions that help authorities establish strategies that efficiently reduce the number of deaths and injuries in incidents. The ETL, EDA and KPI were developed on the open data of the Government of CABA. Project only available in Spanish.
Reviews and recommendations in Yelp and Google Maps databases
Comprehensive analysis of the US market within the restaurant and related sectors. Main focus on developing a restaurant recommendation system, using data from platforms such as Yelp and Google Maps. This system provides valuable information to investors, helping them make strategic decisions. Work carried out in a group where all the typical Data analyst and Data scientist tasks were carried out such as ETL, EDA, Machine Learning Model, cloud survey, dashboard design, etc. Project only available in Spanish.
Iβm a passionate enthusiastic of the world of Data Science, Machine Learning and Artificial Intelligence. I am a very studious and hard-working person who advances harmoniously in a group and learns new concepts in a self-taught way.
My main skills are Python, NumPy, Pandas, Matplotlib, Seaborn, SQL, Power BI and TensorFlow with some knowledge of front-end development with HTML, CSS and JavaScript, and desktop development with .NET C# and WPF.
My academic background is Environmental Engineering, and Mining and Environmental Management specifically in geomatics modeling with the software ArcGIS, ArcPy, QGIS, AutoCAD and AutoCAD Civil 3D.
Data analyst and Data scientist enthusiast π
Geographic Information System (GIS) and Computer-Aided Design (CAD) specialist πΊ
Lima, Peru
Graduated in bootcamp Data science at Henry π
Graduated in training program Data scientist at Platzi π
Graduated in training program Data analyst at Platzi π
Technician degree in Surveyor at SENCICO π
College degree in Environmental Engineering at UNFV π
Graduated in Master Degree in Mining and Environmental Management at UNMSM π
Python, Jupyter Notebook and Visual Studio Code π
NumPy, Pandas, Matplotlib, Seaborn, SQL and TensorFlow β
Web scraping πΈ
Power BI π
.NET C# and WPF π₯οΈ
ArcGIS and AutoCAD πΊ
Spanish πͺπΈ (Native)
English πΊπΈ (Intermediate B1 TOEFL)
French π«π· (Basic)