Prati
Olivier Pietquin
Olivier Pietquin
Cohere | ex Google DeepMind (On leave - Professor at University of Lille)
Potvrđena adresa e-pošte na univ-lille.fr - Početna stranica
Naslov
Citirano
Citirano
Godina
Deep q-learning from demonstrations
T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ...
Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018
11502018
Leveraging demonstrations for deep reinforcement learning on robotics problems with sparse rewards
M Vecerik, T Hester, J Scholz, F Wang, O Pietquin, B Piot, N Heess, ...
arXiv preprint arXiv:1707.08817, 2017
7332017
Modulating early visual processing by language
H De Vries, F Strub, J Mary, H Larochelle, O Pietquin, AC Courville
Advances in neural information processing systems 30, 2017
5262017
Guesswhat?! visual object discovery through multi-modal dialogue
H De Vries, F Strub, S Chandar, O Pietquin, H Larochelle, A Courville
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017
4402017
Listen and translate: A proof of concept for end-to-end speech-to-text translation
A Bérard, O Pietquin, C Servan, L Besacier
arXiv preprint arXiv:1612.01744, 2016
3092016
A theory of regularized markov decision processes
M Geist, B Scherrer, O Pietquin
International Conference on Machine Learning, 2160-2169, 2019
2852019
Audiolm: a language modeling approach to audio generation
Z Borsos, R Marinier, D Vincent, E Kharitonov, O Pietquin, M Sharifi, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
2622023
Acme: A research framework for distributed reinforcement learning
MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ...
arXiv preprint arXiv:2006.00979, 2020
2272020
What matters for on-policy deep actor-critic methods? A large-scale empirical study
M Andrychowicz, A Raichuk, P Stańczyk, M Orsini, S Girgin, R Marinier, ...
arXiv preprint arXiv:2006.05990, 2020
211*2020
End-to-end automatic speech translation of audiobooks
A Bérard, L Besacier, AC Kocabiyikoglu, O Pietquin
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
2112018
A probabilistic framework for dialog simulation and optimal strategy learning
O Pietquin, T Dutoit
IEEE Transactions on Audio, Speech, and Language Processing 14 (2), 589-599, 2006
2042006
Learning from demonstrations for real world reinforcement learning
T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ...
arXiv preprint arXiv:1704.03732, 2017, 2018
1842018
Machine learning for spoken dialogue systems
O Lemon, O Pietquin
European Conference on Speech Communication and Technologies (Interspeech'07 …, 2007
1502007
A framework for unsupervised learning of dialogue strategies
O Pietquin
Presses univ. de Louvain, 2005
1452005
What matters for on-policy deep actor-critic methods? a large-scale study
M Andrychowicz, A Raichuk, P Stańczyk, M Orsini, S Girgin, R Marinier, ...
International conference on learning representations, 2020
1442020
A survey on metrics for the evaluation of user simulations
O Pietquin, H Hastie
The knowledge engineering review 28 (1), 59-73, 2013
1372013
Observe and look further: Achieving consistent performance on atari
T Pohlen, B Piot, T Hester, MG Azar, D Horgan, D Budden, G Barth-Maron, ...
arXiv preprint arXiv:1805.11593, 2018
1262018
Kalman temporal differences
M Geist, O Pietquin
Journal of artificial intelligence research 39, 483-532, 2010
1212010
Algorithmic Survey of Parametric Value Function Approximation
M Geist, O Pietquin
Transactions on Neural Networks and Learning Systems 24 (6), 845-867, 2013
120*2013
Inverse reinforcement learning through structured classification
E Klein, M Geist, B Piot, O Pietquin
Advances in neural information processing systems 25, 2012
1192012
Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.
Članci 1–20