Paris
I am currently a Data Scientist / Senior Developer at PriceMinister/Rakuten, working on machine learning and data mining, Recommendations engines and Record Linkage.
I develop mostly in Python (pandas, scikit-learn), using big-data technologies (such as Hadoop, Cassandra, GlusterFS, Couchbase, among others), and works on statistical analysis and machine learning.
Formerly, I worked as a Data Scientist / R&D engineer at Logilab, especially in the context of Web semantic. I developed tools for the CubicWeb framework, and I contributed to Scikit-learn, a Python module for machine learning.
I hold a PhD in Computer Science, and more especially in Machine Learning, Data Mining, applied to neuroimaging. I am an ESPCI (Ecole Supérieure de Physique et de Chimie Industrielles de Paris) engineer, and I hold a Master 2 in Applied Mathematics, Computer Vision and Machine Learning, from the Ecole Normale Supérieure de Cachan (Cachan). I also followed the teaching Unit in Anatomy and Imagery of the central nervous system at the Faculty of Medicine Pitié-Salpétrière (Paris).
Sofware skills:
* Python (development, trainer), with more than 6 years of experience.
* Hadoop/HDFS, parallel computing, data streaming (ZMQ).
* SQL databases (PostgreSQL, Postgis, SQLite).
* NoSQL databases (Redis, Cassandra, Couchbase).
* Agile development and software engineering.
* Web (HTML, CSS, Javascript, CubicWeb famework)..
* Mercurial, Git.
* Debian/CentOS, Windows, IOs.
Data engineering skills:
* Import and data structuration (SQL, NoSQL, Semantic Web, record linkage).
* Datamining and machine learning (Scikit-learn, NLTK, Pandas).
* Statistical learning (feature selection, predictive model, regularization and Bayesian methods).
* Data visualization.
* Deep-learning for images classification.
Research interests / Application fields:
* E-commerce.
* Catalogues/Bibliographical data.
* Medical data.
* OpenData
Mes compétences :
Gestion de projet
Web Sémantique
Python