Votre navigateur est obsolète !

Pour une expériencenet et une sécurité optimale, mettez à jour votre navigateur. Mettre à jour maintenant

×

Patrick Kamnang Wanko, PhD

Senior Data Scientist

Data Science
Computer Science
Data Engineering
Cloud - AWS
AI
Experiences
  • Development and integration of NLP/LLM/AI features under AWS environment
    * Automatic answering of questions by querying a large documentary database
    * Timeline of publication summaries related to user defined parameters (topic and location)
    * Document deduplication
    * Named entity recognition
    * Document classification
  • Translation of customer needs into Data Science issues
  • Abstraction and modeling of business problems
  • Production of prototypes of functionalities/models
  • tools: Python, ElasticSearch, PostgreSql/OpenSearch, Spark, MapR, AWS, Pandas, Linux, Shell, Git, Sagemaker, HuggingFace
  • Sodexo (6 months)
    * Construction of data processing pipeline in distributed environment
    * Construction of a restaurant attendance forecasting model in order to reduce waste
    tools: Dataiku, PySpark, Azure, time series, LSTM, Prophet, SARIMA
  • BNP Paribas (6 months)
    * Development of a disasters clustering model
    * Topic extraction
    * Text classification
    * Text translation
    tools: Python, Scikit-Learn, NLP
  • Airbus (one year)
    * Setting up a data processing pipeline under Palantir's Foundry environment
    * Data engineering
    tools: Palantir Foundry, PySpark, Hive, Scala, Spark
  • optimize in time and space the calculation of queries called Skyline within relational databases
  • estimation of the size of the query result
  • approximate calculation
  • identification of relationships (in particular functional dependencies) between columns
  • pre-computation, data structure
  • Multidimensional data analysis and correlation detection
  • tools: Java, C++, BigData
  • Introduction to statistics with SPSS
  • Student assessment
Education

Engineer Statistician

Ecole Nationale de la Statistique et de l'Analyse de l'Information (ENSAI - Rennes)

September 2011 to November 2013
Data processing and analysis
Statistical Information System
Skills

Data Science

  • Data processing
  • Data analysis
  • Decision support models
  • Classification, Clustering
  • Machine Learning
  • Data Mining
  • AI, LLMs

Tools

  • SAS (certification)
  • R, SPSS, Matlab, Spad
  • Scikit-learn, TensorFlow, Pytorch, Keras, MLOPS,
  • Tableau, PowerBI
  • AWS (EC2, EBS, Sagemaker, OpenSearch, ...)
  • GCP, Microsoft Azure, Dataiku, Palantir
  • Jupyter notebook, Jupyter Lab, Pycharm
  • ElasticSearch

Computer Science

  • Python, JAVA, C++, C, VBA
  • HTML5, Javascript, PHP, CSS
  • Base de données, SQL, NoSQL, Postgresql, MySQL, Oracle
  • Spark, pySpark, Hadoop, Scala
  • Docker, VmWare, CI/CD
  • bitbucket, gitlab, github, subversion
  • Revue de code

Languages

  • French
  • English

Management

  • Pilotage de projet
  • Agile, Scrum, Kanban
  • Jira, Confluence, Notion
Certifications

Sequence Models

2019

Convolutional Neural Networks

2019

Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization

2018

Structuring Machine Learning Projects

2018

Neural Networks and Deep Learning

2017

SAS Certified Base Programmer for SAS 9

2013
BP031276v9
Download Download