Post-Doctoral Research Visit F/M Postdoctoral position Reinforcement Learning for Collaborative Annotation

INRIA

Villeneuve d'Ascq, FR

Cette offre d'emploi n'est pas disponible dans votre pays.

Contexte et atouts du poste

This postdoctoral position is part of the national PEPR (Programme et Equipement Prioritaire de Recherche) PlantAgroEco project, coordinated by Alexis Joly.

The PEPR involves several teams from various institutes (Inria ZENITH, CIRAD AMAP, CIRAD PHIM, CIRAD PBVMT, INRAE ePhytia, INRAE IGEPP, INRAE LISAH, IRD EGCE, IRD IEES, Univ.

Paris Saclay, TelaBotanica). The position is funded for 18 months, and will be conducted a Inria Lille - Nord Europe under the supervision of Odalric-Ambrym Maillard.

This is a postdoctoral position in Machine Learning, more specifically in Reinforcement Learning.

The starting date is flexible, it could start earlier than Feb. 1st, 2024.

Odalric-Ambrym Maillard is a researcher at Inria. He has worked for over a decade on advancing the theoretical foundations of reinforcement learning,

using a combination of tools from statistics, optimization and control, in order to build more efficient algorithms able to better estimate uncertainty, exploit structures, or adapt to some non-stationary context.

He was the PI of the ANR-JCJC project BADASS (BAnDits Against non-Stationarity and Structure) until Oct. 2021. He is also leading the Inria Action Exploratoire SR4SG (Sequential Recommendation for Sustainable Gardening) and the Inria-Japan associate team RELIANT (Reliable multi-armed bandits),

and is involved in a series of other projects, from more applied to more theoretical ones all related to the grand-challenge of reinforcement learning that is to make it applicable in real-life situations.

See texttt for further details.

Scool (Sequential COntinual and Online Learning) is an Inria team-project. It was created on November 1st, 2020 as the follow-up of the team SequeL.

In a nutshell, the research topic of Scool is the study of the sequential decision making problem under uncertainty. Most of our activities are related to either bandit problems, or reinforcement learning problems.

Through collaborations, we are working on their application in various fields, mainly : health, agriculture and ecology, sustainable development.

See our href page for more information.

Topic . Making reinforcement learning techniques applicable to real-life applications (such as the recommendation of agroecological practices in agriculture) requires overcoming several scientific bottlenecks.

Within the scope of the PEPR PlantAgroEco project, this 18m postdoc will focus on providing novel reinforcement learning strategies in order to improve the collaborative annotation process of the href data acquisition platform, both from a theoretical and applied perspective.

This project makes appear appealing challenges around contextual multi-armed bandits relevant to collaborative decision making and recommendation at large, with a unique opportunity to interact with a real data platform used by millions.

Solving the different challenges in a sound and effective way requires special attention from both mathematical and computational standpoints.

Mission confiée

The project is organized around three high-level tasks and research questions :

1. The first task is about the user annotation-expertise profile (which

may vary with features and plants) : Here the goal is to

estimate it, track its evolution, and improve it.,

Regarding methods, estimation could be done actively adapting contextual bandit strategies using a form of information-driven intrinsic reward, while change-point detection and expert methods are natural to help tracking.

Finally, active improvement could be done via minimal interaction, active hypothesis testing

and personalized content / task recommendation.

2. The second task is to assist the users in performing rapid annotation,

using sequential hypothesis testing personalized to their (estimated) expertise.

Here pone challenge is to get rapid annotation in a possibly non-parametric context, by adapting sample efficient hypothesis testing and best-arm identification and finite-time analysis techniques.

The short number of interactions available also suggests considering a satisficing instead of optimal regret objective. Another challenge is to personalize assistance to each user expertise, which involves contextual bandit but also contextual hypothesis testing (charting) techniques.

3. A last task is to adapt query strategies of complementary experts based for the collective labeling of existing and unknown items.

One of the challenge is to handle uncertainty of experts, building adaptive confidence sets as well as sequential tests, both parametric and non-parametric, in order to perform adaptive stopping (decide when enough labeling information has been collected) in a reliable way.

Further, experts can be complementary or disagree, which wields the challenges of enforcing diversity in the pool of experts and ensuring sound collective labeling adapting majority voting systems.

Last, one may consider fairness constraints on the pool of experts to avoid a large load unbalance between experts.

These tasks can be explored in various ways and lead to other challenges but should be considered the backbone of the project.

The research, though focused on the PlantNet example, should be considered from a broader perspective, and be beneficial to recommender systems at large.

Principales activités

The postdoctoral position requires a solid capacity to code, conduct relevant numerical experiments and strong analytical skills, as well as a solid background in statistics, probability, Markov chains, concentration of measure and confidence regions, a good knowledge of multi-armed bandits, especially contextual bandits, active sampling and recommender systems processes methods, and be at ease with theoretical guarantees of the considered strategies.

The successful candidate will interact both with the Scool team at Inria Lille (specialized in bandits) and the Zenith team (hosting the PlantNet application) at Inria Montpellier and more generally with the members of the PEPR project, to produce both novel publications and modules for PlantNet (with the help of the engineers from PlantNet).

A good balance between theory and application is expected throughout the project.

Compétences

A PhD in machine learning or statistics, possibly related to multi-armed bandits or recommender systems.

Language : fluency in English.

Relational skills : ability to work within a group of people, listen to others, present one's work, discuss it and be able to learn from others.

While performing the assigned tasks, a certain amount of autonomy is welcome, if not necessary.

Avantages

Subsidized meals
Partial reimbursement of public transport costs
Leave : 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
Possibility of teleworking and flexible organization of working hours
Professional equipment available (videoconferencing, loan of computer equipment, etc.)
Social, cultural and sports events and activities
Access to vocational training
Social security coverage

Rémunération

Gross monthly salary (before taxes) : 2 788€

Il y a plus de 30 jours

Emplois reliés

Post-Doctoral Research Visit F/M Postdoctoral position Reinforcement Learning for Collaborative Annotation

INRIA

Villeneuve-D'ascq, Hauts-de-France

This is a postdoctoral position in Machine Learning, more specifically in Reinforcement Learning. Within the scope of the PEPR PlantAgroEco project, this 18m postdoc will focus on providing novel reinforcement learning strategies in order to improve the collaborative annotation process of the \href{...

PhD Position F/M Bandits-Inspired Reinforcement Learning to Explore Large Stochastic Environments

INRIA

Villeneuve-D'ascq, Hauts-de-France

He has worked for over a decade on advancing the theoretical foundations of reinforcement learning, using a combination of tools from statistics, optimization and control, in order to build more efficient algorithms able to better estimate uncertainty, exploit structures, or adapt to some non-statio...

Post-Doctoral Research Visit F/M Drone-based adapted agricultural data collection in rural environment.

INRIA

Villeneuve-D'ascq, Hauts-de-France

The postdoctoral fellow will be recruited by one of the In 2024, six postdoctoral positions have been allocated for this purpose, to be selected among the following institutional collaborations and priorities:. This job offer is related to the Inria@Africa project: The Inria International Laboratory...

Offre sponsorisée

Clinical Research Associate II - Multisponsor

Pharmiweb

France

Site Management experience or equivalent experience in clinical research, with understanding of clinical trials methodology and terminology. ...

Offre sponsorisée

Chef de projet réseau wifi H/F

UMANTIC TECHNOLOGIES

Roubaix, Hauts-de-France

Depuis plus de 10 ans nous compagnons les grands comptes sur les projets innovants de la transformation numérique de l'économie. Gérer le projet de déploiement de bout en bout, depuis l'expression du besoin jusqu’à la livraison. Nous intervenons en ingénierie et management de projet, en mode projet ...

Offre sponsorisée

Data Analyst H/F

INETUM

Lille, Hauts-de-France

Dans le cadre de la croissance de notre agence lilloise, nous développons notre Practice Data et recrutons des profils Data de divers horizons : Data Analyst, Data Engineer, Data Scientist et Data Gouv. En tant que Data Analyst, vos principales missions consistent à :. Développer l'outil de data vis...

Offre sponsorisée

Chef de projet GTB - H/F (CDI) (Basé à Lille)

METIGA

Mons-En-Barœul, Hauts-de-France

Développement de nouvelles prestations (accompagnement des industriels, développement de l'outil de supervision, amélioration de la qualité des offres. GTB en conception - réalisation (de l'étude à la livraison) sur des sites de type grande distribution ou bâtiments tertiaires. Pendant la préparatio...

Offre sponsorisée

Chef de Projet Design & Commissioning H/F (CDI)

ELYSIS CONSULTING

Lille, Hauts-de-France

Entreprise : Depuis sa création à Lille en 2011, le Groupe ELYSIS accompagne les directions techniques de plus de 130 leaders industriels internationaux basés sur les régions des Hauts de France, de l'Ile de France, du Grand Est, de la Région Alpes et de la Belgique. La mission de nos équipes est de...

Offre sponsorisée

Data Analyst H/F

Apside

Lille, Hauts-de-France

OPPORTUNITE A POURVOIR : [DATA ANALYST H/F] !Découvrez la Vie Apsidienne et vous aussi, devenez Apsidien //On aurait pu demander à Chat GPT de vous démontrer en quoi Apside est l'ESN qu'il vous faut, mais on préfère que vous le découvriez vous-mêmesDécouvrez votre future mission //Contexte- Client :...

Offre sponsorisée

Chercheur en Acoustique / Physique (H/F)

FI Match - Cabinet de Recrutement Scientifique en R&D et Innovation

Lille, Hauts-de-France

Recherche appliquée – 30 brevets - Solutions sur-mesure intégrant matériel et logiciel – .Transducteurs - Electronique de puissance - Mécanique - Traitement du signal – Industrie - Santé - Biotech – R&D.Votre expertise scientifique permettra de renforcer les connaissances actuelles des équipes pour ...

Post-Doctoral Research Visit F/M Postdoctoral position Reinforcement Learning for Collaborative Annotation

Post-Doctoral Research Visit F/M Postdoctoral position Reinforcement Learning for Collaborative Annotation

PhD Position F/M Bandits-Inspired Reinforcement Learning to Explore Large Stochastic Environments

Post-Doctoral Research Visit F/M Drone-based adapted agricultural data collection in rural environment.

Clinical Research Associate II - Multisponsor

Chef de projet réseau wifi H/F

Data Analyst H/F

Chef de projet GTB - H/F (CDI) (Basé à Lille)

Chef de Projet Design & Commissioning H/F (CDI)

Data Analyst H/F

Chercheur en Acoustique / Physique (H/F)

Recherches associées