Joao Diogo de Oliveira, Developer in Fortaleza - State of Ceará, Brazil
Joao is available for hire
Hire Joao

Joao Diogo de Oliveira

Verified Expert  in Engineering

Machine Learning Engineer and Developer

Location
Fortaleza - State of Ceará, Brazil
Toptal Member Since
October 20, 2022

Joao是一名AI/ML工程师,在宝洁等财富100强公司拥有超过14年的经验 & 甘布尔和赫斯特,以及医疗、能源和金融行业的初创企业. Joao拥有波尔图大学计算机工程硕士学位,并拥有机器学习和深度学习方面的多项认证.

Portfolio

Hearst - Technology
Python,人工智能(AI),生成预训练变压器(GPT)...
Peyton & Greyson Solutions Inc,
人工智能(AI),人工智能设计,生成对抗网络(GANs)...
Freelance Clients
Python 2, Python 3, Deep Learning, Statistics, Data Analytics, Python...

Experience

Availability

Part-time

Preferred Environment

Python 3, PyTorch, TensorFlow, R, Machine Learning, Google Cloud Platform (GCP), Amazon Web Services (AWS)

The most amazing...

...我领导的项目预测在创纪录的1小时内建成300多个风能和太阳能发电场.5 months.

Work Experience

MVP Developer

2023 - PRESENT
Hearst - Technology
  • 成功开发了MVP,证明了在3-4周内更换遗留系统的便捷性.
  • Used generative AI (GPT 3.5, GPT 4)和其他框架和库(LangChain和LlamaIndex)从非结构化数据中提取结构化数据. Achieved up to a 98% success rate.
  • 研究并推动生成式人工智能的最新趋势的实施,以广泛的受众. These included but were not limited to, the newest models like GPT4, Turbo, Gemini, Claude, and multimodal models, and the newest frameworks, like LlamaIndex, LangChain, and AutoGPT.
  • 为培训和推理计划和详细阐述工作管道,以便无缝地使用它们.
Technologies: Python,人工智能(AI),生成预训练变压器(GPT), AgentGPT, Generative Artificial Intelligence (GenAI), Google Cloud Platform (GCP), Azure, Gemini, AI Agents, Information Extraction, Generative AI, Large Language Models (LLMs), Data Science, Natural Language Processing (NLP), GPT, Amazon Web Services (AWS), OpenAI

AI Developer

2022 - PRESENT
Peyton & Greyson Solutions Inc,
  • Developed an AI application for writing automatic proposals, saving at least 20% of the time from a specialized employee.
  • Designed and architected the entire IT solution: a) database choice and detail; b) AWS Serverless Services; b) chose and set up the web app back-end implementation; c) API configuration; d) to complete AI model development and deployment with AgentGPT.
  • Tracked team members' development and ensured that milestones were met, from demos to critical development deliverables.
  • Tailored the GPT-3 model to a specific business case successfully.
Technologies: 人工智能(AI),人工智能设计,生成对抗网络(GANs), Language Models, OpenAI, APIs, Backendless, Amazon Web Services (AWS), AWS Lambda, Amazon RDS, Python, DaVinci, Large Language Models (LLMs), Models, AI Programming, Natural Language Understanding (NLU), Matplotlib, Natural Language Processing (NLP), GPT, Generative Pre-trained Transformers (GPT), Information Extraction, GitHub, Cloud Platforms, Data Pipelines, Early-stage Startups, Data Processing, Data Transformation, Back-end, ChatGPT, OpenAI GPT-3 API, Generative Pre-trained Transformer 3 (GPT-3), DevOps, Amazon SageMaker, Jupyter Notebook, OpenAI GPT-4 API, Kubernetes, Scraping, Analytics, Keras, Sentiment Analysis, Generative AI, Data Structures

IT Engineer | Artificial Intelligence Engineer

2019 - PRESENT
Freelance Clients
  • 开发了太阳能和风力发电场能源预测的人工智能AI项目, summing up 2.6 GW of installed power.
  • Built a model for computer vision that did face recognition.
  • Created a model using computer vision to ease pneumonia detection through X-ray.
  • 为两个海上风电项目提供风电认证咨询服务,预计总装机容量为2GW.
  • Maintained over 20 distributed Linux servers, updating, securing, and creating key performance indicators KPIs.
Technologies: Python 2, Python 3, Deep Learning, Statistics, Data Analytics, Python, Data Science, Deep Neural Networks, Big Data Architecture, Linux, Datasets, Pandas, Machine Learning Operations (MLOps), Image Processing, Hardware, Large Language Models (LLMs), Models, AI Programming, GPT, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Data Processing Automation, Artificial Intelligence (AI), Image Generation, ARIMA, ARIMA Models, LSTM, SARIMA, R, Matplotlib, Information Extraction, GitHub, Cloud Platforms, Data Pipelines, Energy, Neural Networks, Regression Modeling, Data Processing, Data Transformation, CSV, Data Analysis, Back-end, DevOps, Amazon SageMaker, Jupyter Notebook, Speech Recognition, Scraping, Analytics, FFmpeg, Keras, Sentiment Analysis, Image Recognition, TensorFlow, PyTorch, Computer Vision, Generative AI, OpenAI, Speech to Text, Speech to Intent

Product Owner | Country Manager

2017 - PRESENT
Prewind
  • Developed AI models, including deep learning, weather forecast, and energy prediction for multiple markets.
  • Performed business and data analytics for customers.
  • Led the successful establishment of a European institute in Brazil.
  • 管理客户的投资组合,总发电量超过3gw.
Technologies: Deep Learning, Artificial Intelligence (AI), Machine Learning, Data Analytics, Data Science, Data Visualization, Linux, Datasets, Pandas, Amazon Web Services (AWS), Python, Hardware, Models, Matplotlib, Information Extraction, GitHub, Early-stage Startups, Energy, Neural Networks, Data Transformation, CSV, Data Analysis, Back-end, DevOps, Workshop Facilitation, Analytics, Sentiment Analysis, Image Recognition

Managing Director

2013 - PRESENT
Niway Group
  • Managed daily operations of the group's investments, including a shopping mall, business towers, and representation before official government bodies.
  • 通过大量稳定的变化,将七年的亏损转为盈利.
  • Supervised the financial control of the construction of three towers, 12 floors each, with a total cost of R$ 43 million.
Technologies: Team Leadership, Finance, Data Science, Data Visualization, Python, Real Estate, CSV, Data Analysis, CTO, Workshop Facilitation, Analytics

Machine Learning Developer

2023 - 2023
EIS - Main
  • Did a feasibility study and implemented a POC on capturing, 在石油和天然气工厂扫描的云点项目中对阀门进行计数和地理定位.
  • 开发了一种人工智能模型,可以在工厂扫描的批量图像中识别阀门.
  • Implemented a method to process and slice cloud point data automatically, extracting images and transforming them into 2D.
Technologies: Machine Learning, Computer Vision, Deep Learning, Convolutional Neural Networks (CNN), Artificial Intelligence (AI), Point Clouds, Point Cloud Data, Image Processing, Natural Language Processing (NLP), Python, TensorFlow, PyTorch, Audacity

Team Leader

2023 - 2023
Stop the Traffik
  • 分析志愿者组织中最根本的技术问题,并通过分散在9个国家的11名志愿者团队提出解决这些问题的计划.
  • 带领ML/AI专家团队开发情感分析AI模型,自动分析贩卖文章并进行分类, removing the manual labor currently applied.
  • 指导和领导ML/AI专家团队改进遗留模型,该模型将组织的文章分为相关和不相关的文章.
  • 通过会议指导项目的成功和参与,向组织交付提议的结果. Participated in all parts of development (AI, DevOps, Python) to make sure that commitments were met and delivered.
Technologies: IBM Cloud, Amazon SageMaker, Kubernetes, Data Science, Python, Artificial Intelligence (AI), IBM Cloud Platform

NLP Engineer

2023 - 2023
Mercatus Center at George Mason University - Main
  • Developed a long text classification for documents within 96 labels. 目的是使用不同的NLP技术来获得三位数NAICS代码的概率.
  • Explored literature on the most advanced techniques of text classification and long text and applied them; Combined the different techniques to achieve a better result, achieving an improvement of 15% on the F1 score.
  • 使用AWS SageMaker提供有效且富有洞察力的培训和推理管道.
  • Achieved F1 scores on some categories up to 0.95-0.98 on others using different techniques increased from 0.4 to 0.7.
Technologies: Natural Language Processing (NLP), Python, GPT, Generative Pre-trained Transformers (GPT), NLPP, Deep Neural Networks, Amazon SageMaker, Transformers, Data Science, Artificial Intelligence (AI), TensorFlow

Engineering Manager

2012 - 2013
Procter & Gamble
  • 在法国、意大利和西班牙的工厂实施多个生产线更新项目.
  • Developed cost-saving solutions and deployed them across multiple factories.
  • 领导与供应商的技术讨论,确保他们能够满足要求.
Technologies: Agile, Project Design & Management, Process Management, APIs, Linux, Hardware, Supply Chain Management (SCM), Supply Chain Optimization, SARIMA, Data Processing, Data Analysis, Workshop Facilitation

Supply Chain Leader

2009 - 2012
Procter & Gamble
  • 领导设计和实施全球试点项目,改造公司的物流部门.
  • Found a solution to complex problems of inventory costs, achieving a reduction from $12 million to $7 million.
  • Participated in creating an internal cross-docking supply chain prototype, resulting in yearly savings of $2 million.
  • Coached, guided, and coordinated the work of multiple team members.
Technologies: Project Design & Management, Logistics, Agile, Forecasting, Data Science, Datasets, Supply Chain Management (SCM), Supply Chain Optimization, Data Processing, Data Analysis, Workshop Facilitation

NLP in Healthcare | Score Clinical Patient Notes

http://www.kaggle.com/c/nbme-score-clinical-patient-notes
一个项目,根据医生从临床试验中得到的实际记录,对每个病人的可能疾病进行分类,我的任务是在基础框架RoBERTa的基础上建立一个自然语言处理(NLP)模型.

CV: X-ray Pneumonia Detection

http://github.com/joao-d-oliveira/X-Ray_PneumoniaDetection
A computer vision model, which receives an X-ray image and detects the presence of foreign tissue, and predicts whether the image belongs to a patient with pneumonia or not. The model performed similarly to a trained physician, with a precision of 86% (no pneumonia) and 19% (pneumonia).

Power Generation Forecast for Wind and Solar Farms

一份发电预测显示,葡萄牙将有300多个风能和太阳能发电场. 我对工厂的地理位置以及风力和太阳能剖面进行了数据分析, structuring all the data, building an ensemble of around five models per farm, and training and deploying the models.

Computer Vision - Face detection

用机器学习技术构建的计算机视觉模型,可以进行基于视频的面部识别. 我从一开始就在制作模型和必要的管道方面发挥了重要作用. Additionally, I've achieved a positive false acceptance rate (FAR) of around 10^-5, meeting clients' needs.

Developing AI Automated Proposal Generation

The application provides automation for Proposal Writing, 因为我的想法是开发一个模型和支持模型的WebApplication,以节省专业员工至少20%的时间,我已经完成了基于GPT-3的工作AI模型的开发. 我还设计和开发了web应用程序的结构和体系结构, making most of the back-end functions and all database architecture.

CV: Image Captioning - Identifying Objects and Writing Caption

Developed a machine learning model that, through deep learning networks, analyses images, identifies objects, and captions the images accordingly; The project got a BLUE-1 score of 0.679 for an image caption, a score of 0.6-0.7 is considered best in class.

Email NLP/NLU/NER Analysis

Through advanced techniques of NLP, extract insights from emails. 在一组预定义的范围内进行分类(总体准确率达到+83%), extracting important information from the text, doing data analysis, summarisation, and other relevant tasks.

Surgery Assistance Software

A piece of AI software that could do voice recognition, interpret commands, and recognize the tools needed for the specific surgical moment. On top of that, 人工智能(基于历史信息)预测手术中工具的顺序.

I designed and implemented the architecture of the software, achieving an MVP.
2003 - 2009

Master's Degree in Computer Science

University of Porto - Porto, Portugal

2007 - 2008

Exchange Program Coursework Toward Master's Degree in Computer Science

Delft University of Technology - Delft, Netherlands

AUGUST 2022 - PRESENT

Quantum Excellence Certificate

IBM | Qiskit Global Summer School 2022

JULY 2022 - PRESENT

AI for Healthcare

Udacity

JULY 2021 - PRESENT

Machine Learning

Stanford University

JULY 2021 - PRESENT

Deep Reinforcement Learning

Udacity

JUNE 2021 - PRESENT

Advanced Computer Vision - Machine Learning

Udacity

Libraries/APIs

PyTorch, TensorFlow, Scikit-learn, Pandas, LSTM, Matplotlib, Keras, OpenCV, PyTorch Lightning, FFmpeg

Tools

GitHub, Amazon SageMaker, ChatGPT, You Only Look Once (YOLO), NLPP, Audacity

Languages

Python 3, SQL, Python, R, Python 2, C++

Paradigms

Data Science, Agile, DevOps, Anomaly Detection

Platforms

Linux, Amazon Web Services (AWS), Jupyter Notebook, Google Cloud Platform (GCP), Kubernetes, Docker, Azure, Backendless, AWS Lambda, IBM Cloud Platform

Storage

Data Pipelines, PostgreSQL, MySQL

Other

Machine Learning, Deep Learning, Data Structures, Artificial Intelligence (AI), Algorithms, Team Leadership, Project Design & Management, Computer Vision, BERT, Natural Language Processing (NLP), Deep Neural Networks, Datasets, Language Models, OpenAI, Image Processing, Hardware, Large Language Models (LLMs), Models, AI Programming, Data Processing Automation, Real Estate, ARIMA, ARIMA Models, Supply Chain Management (SCM), Supply Chain Optimization, Forecasting, Information Extraction, Energy, Neural Networks, Regression Modeling, Data Processing, Data Transformation, CSV, Data Analysis, GPT, Generative Pre-trained Transformers (GPT), Back-end, Generative Pre-trained Transformer 3 (GPT-3), OpenAI GPT-4 API, Workshop Facilitation, Analytics, Convolutional Neural Networks (CNN), Sentiment Analysis, Generative AI, Data Analytics, Process Management, Logistics, Statistics, Computer Vision Algorithms, Data Visualization, Big Data Architecture, Machine Learning Operations (MLOps), Generative Adversarial Networks (GANs), DaVinci, SARIMA, Natural Language Understanding (NLU), Hugging Face, Cloud Platforms, Early-stage Startups, Generative Artificial Intelligence (GenAI), Web Development, Word Embedding, OpenAI GPT-3 API, API Integration, Speech Recognition, Scraping, Facial Recognition, Image Recognition, Speech to Text, Finance, Quantum Computing, Healthcare IT, Deep Reinforcement Learning, APIs, Object Detection, Generative Models, AI Design, Amazon RDS, Image Generation, CTO, Transformers, IBM Cloud, Prompt Engineering, Qiskit, AgentGPT, Point Clouds, Point Cloud Data, Gemini, AI Agents, Speech to Intent

Collaboration That Works

How to Work with Toptal

在数小时内,而不是数周或数月,我们的网络将为您直接匹配全球行业专家.

1

Share your needs

在与Toptal领域专家的电话中讨论您的需求并细化您的范围.
2

Choose your talent

在24小时内获得专业匹配人才的简短列表,以进行审查,面试和选择.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring