Gabriel Lima

I'm

About

Data Science & AI Research

A computer engineer passionate about statistics, artificial intelligence, big data, algorithms, cloud, and much more.

  • Current: Data Scientist at Stefanini
  • Phone: +55 (27) 99600-1277
  • Degree: M.Sc., Applied Computing
  • E-mail: gabriel.mota.b.lima@gmail.com

Resume

Education

Master of Applied Computing (Artificial Intelligence)

2023 - 2025

Federal Institute of Espírito Santo, Serra, ES

  • Pre-project: Towards a Transformer-based Architecture for Brazilian Portuguese Image Captioning
  • Research interests: deep learning, computer vision, natural language processing, multimodal machine learning, large language models, generative artificial intelligence.

Bachelor of Computer Engineering

2017 - 2022

Federal University of Espírito Santo, São Mateus, ES

  • Bachelor Thesis: Recommender Systems - An Approach with Graph Neural Networks
  • President Director, Computer Engineering Academic Center, 2021
  • Event Organizer, CompTalk, 2021
  • Operational Director, Orienta Covid ES HUCAM UFES, 2020

Research

Undergraduate AI Researcher

2020 - 2021

University of São Paulo, São Paulo, SP

Mental Health Research:

  • Developed a script for patients segmentation with form-based data with clustering techniques, using Scikit-Learn and Python. Researched sentiment analysis through images, texts and audios. Researched diagnostic analysis based on socioeconomic data. Designed an architecture that makes use of machine learning and ontologies.

Undergraduate AI Researcher

2019 - 2021

Federal University of Espírito Santo, São Mateus, ES

Smart Grids Research:

  • Developed a script for load detection in smart grids with Python. Applied dimensionality reduction in household appliance data with Keras, NumPy, Scikit-Learn, Pandas, Python, and Tensorflow. Collected voltage and (electric) current data from loads in smart grids simulated at the Laboratory of Renewable Energy I. Created a database with load data and its characteristics with MongoDB.
  • Designed a successful extraction of load characteristics with (electric) current data only. Simple estimator training with only the load characteristics with 98% accuracy (considering all classification metrics and the balance of the data).

Face Recognition Research:

  • Designed convolutional neural networks for face identification using with Databricks, Keras, OpenCV, Python, and Tensorflow. Developed scripts for extraction of facial features (embeddings) using convolutional autoencoders made with Keras. Designed a software for face recognition by the similarity of cosines between the faces feature vectors with NumPy and Scikit-Learn. Created a dataset using the loads data and its embeddings with NumPy files.
  • The final software results: 100% accuracy (also 100% precision and recall) achieved in specific cases and over 80% accuracy in another cases. Facial data extraction process takes less than a second, as well as storing and searching it.

Professional Experience

Data Scientist

2023 - Present

Stefanini, Brasília, DF

As a consultant, I work closely with an international bank.

Data Scientist

2022 - 2023

Tok&Stok, São Paulo, SP

  • Developed tables, views, partitions, and pipelines to feed the main tabular model from Tok&Stok, goals metrics and other destinations. Done with Analysis Services, AWS S3, Azure Automation, dbt, GitHub, Oracle, Pentaho, Power BI, SQL, and Snowflake.
  • Built market basket analysis to deliver insights about products organization and afinity, and continued sales forecasting project. Done with AWS S3, dbt, GitHub, MLflow, mlxtend, NumPy, Pandas, Python, SQL, Scikit-Learn, and Snowflake.
  • Built Slack bot features to generate alerts about Analysis Service database overloads, to publish Power BI reports by typing commands in Slack and to do chat actions. Done with APIs, AWS (S3, Kinesis, EKS, ECR, EC2, SQS, and Secrets Manager), Docker, GitHub, Kubernetes, Poetry, Power BI, and Python.
  • Helped maintain the data warehouse big data architecture and somes data pipelines. Done with AWS (S3, Glue, EC2), dbt, GitHub, Prefect, Python, Snowflake, and Terraform.

Data Science Intern

2021 - 2022

Tok&Stok, São Paulo, SP

  • Developed tables, views, and pipelines to feed some reports and an app to ranking sellers performance. Done with dbt, Oracle, Pentaho, and Snowflake.
  • Built reports and analyzed data to optimize decision making about customer budgets, sales performance, store performance, and product discounts. Done with SQL, Power BI, and Python.
  • Applied regression strategies to solve sales forecasting and to analyze goals metrics. Done with dbt, GitHub, NumPy, Pandas, Prophet, Python, Scikit-Learn, and Snowflake.

Portfolio

  • All
  • Data Engineering
  • Data Science
  • Analytics Engineering
  • ML Engineering
  • Data Analysis
process of data engineering, analytics engineering, web scraping, data preparation, data analysis for retail database organization and insights

Scraping to Analysis - Extra Store

Data Engineering and Analytics

solutions for reducing churn rate in telecom industry

Churn Solutions - Telecom Client

Machine Learning and Analytics

Services

What I am able to do.

Business Understanding

Capture business issues and translate it to techonology solutions.

Analytics

Transformations to prepare data to be used. Incredible reports using data visualization techniques.

Pipelines

Plan and build both essential and complex data pipelines.

Machine Learning

Machine learning modeling, tuning and validation.

MLOps

Machine learning operations for model registrying, serving and productizing.

Storytelling

Explanations and presentations which shows the solution and analysis story, easily and accurately.