ZENITH: Gestion de données scientifiques

Zenith s’attaque aux défis posés par la gestion (stockage, partage, traitement, recherche analyse) des données massives (big data, données scientifiques). Ces défis (correspondant aux trois big V : Volume, Velocity, Variety) peuvent se résumer ainsi:

1. très grande échelle (big data, big analytics) ;

2. données en continu (produits par des capteurs, des appareils mobiles, …) ;

3. hétérogénéité et complexité des données (différences sémantiques, données incertaines ou multi-échelles, …).

Notre objectif est d’apporter des solutions innovantes, en démontrant des avantages en termes de passage à l’échelle, fonctionnalité, facilité d’usage et performance, dans des environnements distribués et parallèles (P2P, grid, cloud).

Nous cherchons à produire des résultats fondamentaux et algorithmiques, que nous pouvons implémenter dans des environnements spécifiques, par ex. Grid5K. Pour valider nos solutions, nous collaborons avec des partenaires scientifiques (INRA, CIRAD, IRD, etc.) et industriels (Data Publica, Bull, EDF, Orange, Microsoft, MonetDB, Sparsity, etc.).

Membres

Permanents

Non permanents

Thématiques de recherche

Le projet Zenith est organisé en trois thèmes complémentaires :

1. Gestion de données et métadonnées : gestion et intégration de données et métadonnées (schémas, ontologies) à grande échelle, en particulier, stockage de big data, résolution d’entités incertaines et traitement de requêtes probabilistes.

2. Partage de données et processus : gestion des données et processus scientifiques dans des environnements distribués et parallèles, avec partage de données en P2P, recommandation dans les communautés en ligne et support des workflows scientifiques.

3. Analyse de données : fouille de données et recherche de données par contenu en exploitant le parallélisme du cloud et les nouvelles technologies NoSQL et MapReduce.

Ces trois thèmes reflètent le continuum qui va de la capture des données, en passant par leur intégration, gestion et partage, jusqu’à leur analyse, afin de produire informations et connaissances.

Publications depuis 2013 - Evaluation 2019

Articles de revues internationales

2018

  1. Data reduction in scientific workflows using provenance monitoring and user steering
    Renan Souza, Vitor Silva, Alvaro L.G.A. Coutinho, Patrick Valduriez, Marta Mattoso
    Future Generation Computer Systems, Elsevier, In press, pp.1-21.
  2. An Overview of Lead and Accompaniment Separation in Music
    Zafar Rafii, Antoine Liutkus, Fabian-Robert Stöter, Stylianos Ioannis Mimilakis, Derry Fitzgerald, Bryan Pardo
    IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2018. <10.1109/TASLP.2018.2825440>
  3. AutoWIG: automatic generation of python bindings for C++ libraries
    Pierre Fernique, Christophe Pradal
    PeerJ Computer Science, PeerJ, 2018, 4. <10.7717/peerj-cs.149>
  4. A Survey of Scheduling Frameworks in Big Data Systems
    Ji Liu, Esther Pacitti, Patrick Valduriez
    International Journal of Cloud Computing, Inderscience Publishers, In press, pp.1-27.

2017

  1. Scientific workflows for computational reproducibility in the life sciences: Status, challenges and opportunities
    Sarah Cohen-Boulakia, Khalid Belhajjame, Olivier Collin, Jérôme Chopard, Christine Froidevaux, Alban Gaignard, Konrad Hinsen, Pierre Larmande, Yvan Le Bras, Frédéric Lemoine, Fabien Mareuil, Hervé Ménager, Christophe Pradal, Christophe Blanchet
    Future Generation Computer Systems, Elsevier, 2017. <10.1016/j.future.2017.01.012>
  2. Data placement in massively distributed environments for fast parallel mining of frequent itemsets
    Saber Salah, Reza Akbarinia, Florent Masseglia
    Knowledge and Information Systems (KAIS), Springer, 2017, 53 (1), pp.207-237.
  3. Scientific Workflow Scheduling with Provenance Data in a Multisite Cloud
    Ji Liu, Esther Pacitti, Patrick Valduriez, Marta Mattoso
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2017, 33, pp.80-112.

2016

  1. Effective and Efficient Similarity Search in Scientific Workflow Repositories
    Johannes Starlinger, Sarah Cohen-Boulakia, Sanjeev Khanna, Susan Davidson, Ulf Leser
    Future Generation Computer Systems, Elsevier, 2016, 56, pp.584-594.
  2. Multi-Objective Scheduling of Scientific Workflows in Multisite Clouds
    Ji Liu, Esther Pacitti, Patrick Valduriez, Daniel De Oliveira, Marta Mattoso
    Future Generation Computer Systems, Elsevier, 2016, 63, pp.76-95.
  3. FP-Hadoop: Efficient Processing of Skewed MapReduce Jobs
    Miguel Liroz-Gistau, Reza Akbarinia, Divyakant Agrawal, Patrick Valduriez
    Information Systems, Elsevier, 2016, 60, pp.69-84.
  4. Guest Editorial: Environmental Multimedia Retrieval
    Stefanos Vrochidis, Kostas D. Karatzas, Ari Karppinen, Alexis Joly
    Multimedia Tools and Applications, Springer Verlag, 2016, 75 (3), pp.1557--1562.
  5. Analyzing Related Raw Data Files through Dataflows
    Vitor Silva Souza, Oliveira Daniel De, Patrick Valduriez, Marta Mattoso
    Concurrency and Computation: Practice and Experience, Wiley, 2016, 28 (8), pp.2528-2545.
  6. Plant identification: Man vs. Machine
    Pierre Bonnet, Alexis Joly, Hervé Goëau, Julien Champ, Christel Vignau, Jean-François Molino, Daniel Barthélémy, Nozha Boujemaa
    Multimedia Tools and Applications, Springer Verlag, 2016, LifeCLEF 2014 plant identification challenge, 75 (3), pp.1647-1665.
  7. Query processing in multistore systems: an overview
    Carlyna Bondiombouy, Patrick Valduriez
    International Journal of Cloud Computing, Inderscience Publishers, 2016, pp.38.
  8. Multistore Big Data Integration with CloudMdsQL
    Carlyna Bondiombouy, Boyan Kolev, Oleksandra Levchenko, Patrick Valduriez
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2016, 28, pp.48-74.

2015

  1. Data-Centric Iteration in Dynamic Workflows
    Jonas Dias, Gabriel Guerra, Fernando Rochinha, Alvaro Coutinho, Patrick Valduriez, Marta Mattoso
    Future Generation Computer Systems, Elsevier, 2015, 46, pp.114-126.
  2. A Survey of Data-Intensive Scientific Workflow Management
    Ji Liu, Esther Pacitti, Patrick Valduriez, Marta Mattoso
    Journal of Grid Computing, Springer Verlag, 2015, 13, 44 p. <10.1007/s10723-015-9329-8>
  3. FP-Hadoop: Efficient Execution of Parallel Jobs Over Skewed Data
    Miguel Liroz-Gistau, Reza Akbarinia, Patrick Valduriez
    Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2015, 8 (12), pp.1856-1867.
  4. A look inside the Pl@ntNet experience
    Alexis Joly, Pierre Bonnet, Hervé Goëau, Julien Barbe, Souheil Selmi, Julien Champ, Samuel Dufour-Kowalski, Antoine Affouard, Jennifer Carré, Jean-François Molino, Nozha Boujemaa, Daniel Barthélémy
    Multimedia Systems, Springer Verlag, 2015, pp.16.

2014

  1. Dynamic Workload-Based Partitioning Algorithms for Continuously Growing Databases
    Miguel Liroz-Gistau, Reza Akbarinia, Esther Pacitti, Fabio Porto, Patrick Valduriez
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2014, pp.105.
  2. Entity Resolution for Probabilistic Data
    Ayat Naser, Reza Akbarinia, Hamideh Afsarmanesh, Patrick Valduriez
    Information Sciences, Elsevier, 2014, 277, pp.492-511.
  3. Interactive plant identification based on social image data
    Alexis Joly, Hervé Goëau, Pierre Bonnet, Vera Bakić, Julien Barbe, Souheil Selmi, Itheri Yahiaoui, Jennifer Carré, Elise Mouysset, Jean-François Molino, Nozha Boujemaa, Daniel Barthélémy
    Ecological Informatics, Elsevier, 2014, 23, pp.22-34.

2013

  1. Entity Resolution for Distributed Probabilistic Data
    Naser Ayat, Reza Akbarinia, Hamideh Afsarmanesh, Patrick Valduriez
    Distributed and Parallel Databases, Springer, 2013, 31 (4), pp.509-542.
  2. As-Soon-As-Possible Top-k Query Processing in P2P Systems
    William Kokou Dedzoe, Philippe Lamarre, Reza Akbarinia, Patrick Valduriez
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2013, Part IX, LNCS (7980), pp.1-27.
  3. Efficient Evaluation of SUM Queries Over Probabilistic Data
    Reza Akbarinia, Patrick Valduriez, Guillaume Verger
    IEEE Transactions on Knowledge and Data Engineering, Institute of Electrical and Electronics Engineers, 2013, 25 (4), pp.764-775.
  4. Chiron: A Parallel Engine for Algebraic Scientific Workflows
    Eduardo Ogasawara, Dias Jonas, Vitor Silva, Chirigati Fernando, Oliveira Daniel De, Fabio Porto, Marta Mattoso, Patrick Valduriez
    Concurrency and Computation: Practice and Experience, Wiley, 2013, 25 (16), pp.2327-2341.

2011

  1. Replication in DHTs using Dynamic Groups
    Reza Akbarinia, Mounir Tlili, Esther Pacitti, Patrick Valduriez, Alexandre A. B. Lima
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2011, Part III - Special Issue on Data and Knowledge Management in Grid and P2P Systems, LNCS (6790), pp.1-19.
  2. Optimizing the reliability of streaming applications under throughput constraints
    Anne Benoit, Hinde Lilia Bouziane, Yves Robert
    International Journal of Parallel Programming, Springer Verlag, 2011, 39 (5), pp.584-614.
  3. Best Position Algorithms for Efficient Top-k Query Processing
    Reza Akbarinia, Esther Pacitti, Patrick Valduriez
    Information Systems, Elsevier, 2011, 36 (6), pp.973-989.
  4. Discovering Significant Evolution Patterns from Satelllite Image Time Series
    François Petitjean, Florent Masseglia, Pierre Gancarski, Germain Forestier
    International Journal of Neural Systems, World Scientific Publishing, 2011, 21 (6), pp.15.
  5. Building a Peer-to-Peer Content Distribution Network with High Performance, Scalability and Robustness
    Manal El Dick, Esther Pacitti, Reza Akbarinia, Bettina Kemme
    Information Systems, Elsevier, 2011, 36 (2), pp.222-247.
  6. Energy Efficient Data Access in Mobile P2P Networks
    Kwangjin Park, Patrick Valduriez
    IEEE Transactions on Knowledge and Data Engineering, Institute of Electrical and Electronics Engineers, 2011, 23 (11), pp.1619 - 1634.
  7. Discovering Frequent Behaviors: Time is an Essential Element of the Context
    Bashar Saleh, Florent Masseglia
    Knowledge and Information Systems (KAIS), Springer, 2011, 28 (2), pp.311-331.
  8. An Algebraic Approach for Data-Centric Scientific Workflows
    Eduardo Ogasawara, Daniel De Oliveira, Patrick Valduriez, Daniel Dias, Fabio Porto, Marta Mattoso
    Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2011, 4 (11), pp.1328-1339.

2010

  1. FORUM: A Flexible data Integration System Based on Data Semantics
    Zohra Bellahsene, Salima Benbernou, Hélène Jaudoin, Francois Pinet, Olivier Pivert, Farouk Toumani, Stephan Bernard, Pierre Colomb, Remi Coletta, Emmanuel Coquery, Fabien De Marchi, Fabien Duchateau, Mohand-Said Hacid, Allel Hadjali, Mathieu Roche
    SIGMOD record, ACM, 2010, 39 (2), pp.11-18.

2009

  1. DHTJoin: Processing Continuous Join Queries Using DHT Networks
    Wenceslao Palma, Reza Akbarinia, Esther Pacitti, Patrick Valduriez
    Distributed and Parallel Databases, Springer, 2009, pp.291-317.

2004

  1. View Adaptation in Fragment-Based Approach
    Zohra Bellahsene
    IEEE Transactions on Knowledge and Data Engineering, Institute of Electrical and Electronics Engineers, 2004, 16 (11), pp.1441-1455.

2002

  1. Schema Evolution in Data Warehouses
    Zohra Bellahsene
    Knowledge and Information Systems (KAIS), Springer, 2002, 4 (3), pp.283-304.
  2. 2018

    1. Non-parametric Bayesian annotator combination
      Maximilien Servajean, Romain Chailan, Alexis Joly
      Information Sciences, Elsevier, 2018, 436-437, pp.131-145.
    2. Species distribution modeling based on the automated identification of citizen observations
      Christophe Botella, Alexis Joly, Pierre Bonnet, Pascal Monestiez, François Munoz
      Applications in Plant Sciences, Wiley, 2018, Green Digitization: Online Botanical Collections Data Answering Real‐World Questions, 6 (2), pp.1-11.
    3. In situ visualization and data analysis for turbidity currents simulation
      José Camata, Vitor Silva, Patrick Valduriez, Marta Mattoso, Alvaro Coutinho
      Computers & Geosciences, Elsevier, 2018, 110, pp.23-31.

    2017

    1. Going deeper in the automated identification of Herbarium specimens
      Jose Carranza-Rojas, Herve Goeau, Pierre Bonnet, Erick Mata-Montero, Alexis Joly
      BMC Evolutionary Biology, BioMed Central, 2017, 17 (1), pp.181.
    2. Crowdsourcing Thousands of Specialized Labels: A Bayesian Active Training Approach
      Maximilien Servajean, Alexis Joly, Dennis Shasha, Julien Champ, Esther Pacitti
      IEEE Transactions on Multimedia, Institute of Electrical and Electronics Engineers, 2017, 19 (6), pp.1376-1391.
    3. InfraPhenoGrid: A scientific workflow infrastructure for Plant Phenomics on the Grid
      Christophe Pradal, Simon Artzet, Jerome Chopard, Dimitri Dupuis, Christian Fournier, Michael Mielewczik, Vincent Negre, Pascal Neveu, Didier Parigot, Patrick Valduriez, Sarah Cohen-Boulakia
      Future Generation Computer Systems, Elsevier, 2017, 67, pp.341-353.
    4. A Highly Scalable Parallel Algorithm for Maximally Informative k-Itemset Mining
      Saber Salah, Reza Akbarinia, Florent Masseglia
      Knowledge and Information Systems (KAIS), Springer, 2017.
    5. Raw data queries during data-intensive parallel workflow execution
      Vítor Silva, José Leite, José Camata, Daniel De Oliveira, Alvaro Coutinho, Patrick Valduriez, Marta Mattoso
      Future Generation Computer Systems, Elsevier, 2017, 75, pp.402-422.

    2016

    1. CloudMdsQL: Querying Heterogeneous Cloud Data Stores with a Common Language
      Boyan Kolev, Patrick Valduriez, Carlyna Bondiombouy, Ricardo Jimenez-Peris, Raquel Pau, José Pereira
      Distributed and Parallel Databases, Springer, 2016, 34 (4), pp.463-503.
    2. AgroLD API. Une architecture orientée services pour l'extraction de connaissances dans la base de données liées AgroLD
      Gildas Tagny Ngompe, Aravind Venkatesan, Nordine El Hassouni, Manuel Ruiz, Pierre Larmande
      Revue des Sciences et Technologies de l'Information - Série ISI : Ingénierie des Systèmes d'Information, Lavoisier, 2016, 21 (5-6), pp.133-158.
    3. Database System Support of Simulation Data
      Hermano Lustosa, Fabio Porto, Pablo Blanco, Patrick Valduriez
      Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2016, 9 (13), pp.1329-1340.
    4. Categorizing plant images at the variety level: Did you say fine-grained?
      Julien Champ, Titouan Lorieul, Pierre Bonnet, Najate Maghnaoui, Christophe Sereno, Thierry Dessup, Jean-Michel Boursiquot, Laurent Audeguin, Thierry Lacombe, Alexis Joly
      Pattern Recognition Letters, Elsevier, 2016, In press. <10.1016/j.patrec.2016.05.022>
    5. Gigwa—Genotype investigator for genome- wide analyses
      Guilhem Sempéré, Florian Philippe, Alexis Dereeper, Manuel Ruiz, Gautier Sarah, Pierre Larmande
      GigaScience, BioMed Central, 2016. <10.1186/s13742-016-0131-8>
    6. Social Networks and Information Retrieval, How Are They Converging? A Survey, a Taxonomy and an Analysis of Social Information Retrieval Approaches and Platforms
      Mohamed Reda Bouadjenek, Hakim Hacid, Mokrane Bouzeghoub
      Information Systems, Elsevier, 2016, 56, pp.1-18.

    2015

    1. Rank aggregation with ties: Experiments and Analysis
      Bryan Brancotte, Bo Yang, Guillaume Blin, Sarah Cohen-Boulakia, Alain Denise, Sylvie Hamel
      Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2015, pp.2051.
    2. Increasing Coverage in Distributed Search and Recommendation with Profile Diversity
      Maximilien Servajean, Esther Pacitti, Miguel Liroz-Gistau, Sihem Amer-Yahia, Amr El Abbadi
      Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2015, LNCS (9430), pp.115-144.
    3. Profile Diversity for Query Processing using User Recommendations
      Maximilien Servajean, Reza Akbarinia, Esther Pacitti, Sihem Amer-Yahia
      Information Systems, Elsevier, 2015, Information Systems, 48, pp.44-63.

    2014

    1. Autonomic Intrusion Detection: Adaptively Detecting Anomalies over Unlabeled Audit Data Streams in Computer Networks
      Wei Wang, Thomas Guyet, René Quiniou, Marie-Odile Cordier, Florent Masseglia, Xiangliang Zhang
      Knowledge-Based Systems, Elsevier, 2014.
    2. Special section on data-intensive cloud infrastructure
      Ashraf Aboulnaga, Beng Chin Ooi, Patrick Valduriez
      The VLDB Journal, Springer, 2014, pp.1.
    3. The anti-bouncing data stream model for web usage streams with intralinkings
      Chongsheng Zhang, Florent Masseglia, Yves Lechevallier
      Information Sciences, Elsevier, 2014, 278, pp.757-772.
    4. Similarity Search for Scientific Workflows
      Johannes Starlinger, Bryan Brancotte, Sarah Cohen-Boulakia, Ulf Leser
      Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2014, 7 (12), pp.1143-1154.
    5. Query Reformulation in PDMS Based on Social Relevance
      Angela Bonifati, Gianvito Summa, Esther Pacitti, Fady Draidi
      Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2014, Transactions on Large-Scale Data- and Knowledge-Centered Systems XIII, LNCS, pp.59-90.
    6. Object-based visual query suggestion
      Amel Hamzaoui, Pierre Letessier, Alexis Joly, Olivier Buisson, Nozha Boujemaa
      Multimedia Tools and Applications, Springer Verlag, 2014, Multimedia Tools and Applications, 68 (2), pp.429-454.

    2013

    1. Stress Testing of Transactional Database Systems
      Jorge Augusto Meira, Eduardo Cunha de Almeida, Gerson Sunyé, Yves Le Traon, Patrick Valduriez
      Journal of Information and Data Management, Brazilian Computer Society, 2013, 4 (3). <http://hdl.handle.net/10993/9919>
    2. A Hierarchical Grid Index (HGI), spatial queries in wireless data broadcasting
      Kwangjin Park, Patrick Valduriez
      Distributed and Parallel Databases, Springer, 2013, 31 (3), pp.413-446.

Communications internationales

2018

  1. Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition
    Mathieu Fontaine, Fabian-Robert Stöter, Antoine Liutkus, Umut Simsekli, Romain Serizel, Roland Badeau
    LVA ICA 2018 - 14th International Conference on Latent Variable Analysis and Signal Separation, Jul 2018, Surrey, United Kingdom. 2018.
  2. The 2018 Signal Separation Evaluation Campaign
    Fabian-Robert Stöter, Antoine Liutkus, Nobutaka Ito
    LVA ICA : 14th International Conference on Latent Variable Analysis and Signal Separation, Jul 2018, Surrey, United Kingdom. 2018. <http://cvssp.org/events/lva-ica-2018/>

2013

  1. Opening the Black Box of Ontology Matching
    Duy Hoa Ngo, Zohra Bellahsene, Konstantin Todorov
    ESWC'2013: 10th Semantics and Big Data, Montpellier, France. pp.16-30, 2013.

2011

  1. Principles of Distributed Data Management in 2020?
    Patrick Valduriez
    DEXA'11: International Conference on Databases and Expert Systems Applications, 2011, Toulouse, France. Springer, 6860, pp.1-11, 2011, Lecture Notes in Computer Science.
  2. Scaling Up Query Allocation in the Presence of Autonomous Participants
    Quiané-Ruiz Jorge, Philippe Lamarre, Sylvie Cazalens, Patrick Valduriez
    DASFAA'11: International Conference on Database Systems for Advanced Applications, 2011, Hong Kong, China. Springer, 6588, pp.210-224, 2011, Lecture notes in computer science.
  3. Modeling View Selection as a Constraint Satisfaction Problem
    Imene Mami, Remi Coletta, Zohra Bellahsene
    DEXA'2011: 22nd International Conference on Database and Expert Systems Applications, Toulouse, France. pp.396-410, 2011, LNCS.

2010

  1. Improving Many-Task Computing in Scientific Workflows Using P2P Techniques
    Jonas Dias, Eduardo Ogasawara, Daniel De Oliveira, Esther Pacitti, Marta Mattoso
    MTAGS: Many-Task Computing on Grids and Supercomputers, 2010, New Orleans, United States. 3rd IEEE Workshop on Many-Task Computing on Grids and Supercomputers, pp.31-40, 2010.

2007

  1. Learning Implied Global Constraints
    Christian Bessière, Remi Coletta, Thierry Petit
    IJCAI'07: International Joint Conference on Artificial Intelligence, 2007, Hyderabad, India. pp.50-55, 2007.
  2. Query-Driven Constraint Acquisition
    Christian Bessière, Remi Coletta, Barry O'Sullivan, Mathias Paulin
    IJCAI'07: International Joint Conference on Artificial Intelligence, 2007, Hyderabad, India. pp.44-49, 2007.
  3. 2018

    1. Interference reduction on full-length live recordings
      Diego Carlo, Antoine Liutkus, Ken Déguernel
      ICASSP 2018 - IEEE International Conference on Acoustics, Speech, and Signal Processing, Apr 2018, Calgary, Canada. pp.1-5.
    2. Blind Source Separation Using Mixtures of Alpha-Stable Distributions
      Nicolas Keriven, Antoine Deleforge, Antoine Liutkus
      ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Canada. pp.1-5.
    3. Alpha-stable low-rank plus residual decomposition for speech enhancement
      Umut Simsekli, Halil Erdogan, Simon Leglaive, Antoine Liutkus, Roland Badeau, Gaël Richard
      ICASSP 2018 - IEEE International Conference on Acoustics, Speech, and Signal Processing, Apr 2018, Calgary, Canada. 2018.
    4. Audio source separation with magnitude priors: the BEADS model
      Antoine Liutkus, Christian Rohlfing, Antoine Deleforge
      ICASSP 2018 - IEEE International Conference on Acoustics, Speech, and Signal Processing, Apr 2018, Calgary, Canada. pp.1-5.
    5. Maximally Informative k-Itemset Mining from Massively Distributed Data Streams
      Mehdi Zitouni, Reza Akbarinia, Sadok Ben Yahia, Florent Masseglia
      SAC 2018 - 33rd ACM/SIGAPP Symposium On Applied Computing, Apr 2018, Pau, France. pp.1-10.
    6. The role of hydraulics FSPMs in the context of root breeding : a case study on Pearl Millet
      Adama Ndour, Christophe Pradal, Vincent Vadez, Sixtine Passot, Yann Guédon, Laurent Laplaze, Mikael Lucas
      EGU 2018, Apr 2018, Vienne, Austria. 20.

    2017

    1. DPiSAX: Massively Distributed Partitioned iSAX
      Djamel-Edine Yagoubi, Reza Akbarinia, Florent Masseglia, Themis Palpanas
      ICDM 2017: IEEE International Conference on Data Mining, Nov 2017, New Orleans, United States. pp.1-6, 2017.
    2. Querying Key-Value Stores Under Simple Semantic Constraints : Rewriting and Parallelization
      Olivier Rodriguez, Corentin Colomier, Cecilie Rivière, Reza Akbarinia, Federico Ulliana
      BDA: Conférence sur la Gestion de Données — Principes, Technologies et Applications ", Nov 2017, Nancy, France. 2017. <https://project.inria.fr/bda2017/>
    3. Efficient Scheduling of Scientific Workflows using Hot Metadata in a Multisite Cloud
      Ji Liu, Luis Pineda-Morales, Esther Pacitti, Alexandru Costan, Patrick Valduriez, Gabriel Antoniu, Marta Mattoso
      BDA: Conférence sur la Gestion de Données — Principes, Technologies et Applications, Nov 2017, Nancy, France. pp.13, 2017.
    4. TARS: An Array Model with Rich Semantics for Multidimensional Data
      Hermano Lustosa, Noel Lemus, Fabio Porto, Patrick Valduriez
      ER FORUM 2017: Conceptual Modeling : Research In Progress, Nov 2017, Valencia, Spain. 2017.
    5. End-to-end Graph Mapper
      Benjamin Billet, Mickaël Jurret, Didier Parigot, Patrick Valduriez
      BDA: Conférence sur la Gestion de Données — Principes, Technologies et Applications, Nov 2017, Nancy, France. 2017.
    6. Tracking of Online Parameter Fine-tuning in Scientific Workflows
      Renan Souza, Vitor Silva, José Camata, Alvaro Coutinho, Patrick Valduriez, Marta Mattoso
      Workflows in Support of Large-Scale Science (WORKS), in conjunction with ACM/IEEE Supercomputing., Nov 2017, Denver, United States. 2017.
    7. Pl@ntNet -My Business
      Alexis Joly, Pierre Bonnet, Antoine Affouard, Jean-Christophe Lombardo, Hervé Goëau
      ACM Multimedia 2017, Oct 2017, Mountain View, United States. pp.1-11.
    8. RadiusSketch: Massively Distributed Indexing of Time Series
      Djamel-Edine Yagoubi, Reza Akbarinia, Florent Masseglia, Dennis Shasha
      DSAA 2017: IEEE International Conference on Data Science and Advanced Analytics, Oct 2017, Tokyo, Japan. pp.1-10, 2017.
    9. Spark Scalability Analysis in a Scientific Workflow
      Renan Souza, Vitor Silva, Pedro Miranda, Alexandre Lima, Patrick Valduriez, Marta Mattoso
      SBBD 2017: 32th Brazilian Symposium on Databases, Oct 2017, Uberlandia, Brazil. pp.1-6, 2017.
    10. Automated Herbarium Specimen Identification using Deep Learning
      Jose Carranza-Rojas, Alexis Joly, Pierre Bonnet, Hervé Goëau, Erick Mata-Montero
      TDWG 2017 - Annual Conference on Biodiversity Information Standards, Oct 2017, Ottawa, Canada. 2017. <10.3897/tdwgproceedings.1.20302>
    11. LifeCLEF Bird Identification Task 2017
      Herve Goeau, Hervé Glotin, Willem-Pier Vellinga, Robert Planqué, Alexis Joly
      CLEF 2017 - Conference and Labs of the Evaluation Forum, Sep 2017, Dublin, Ireland. pp.1-9.
    12. Plant identification based on noisy web data: the amazing performance of deep learning (LifeCLEF 2017)
      Herve Goeau, Pierre Bonnet, Alexis Joly
      CLEF 2017 - Conference and Labs of the Evaluation Forum, Sep 2017, Dublin, Ireland. pp.1-13, 2017.
    13. LifeCLEF 2017 Lab Overview: Multimedia Species Identification Challenges
      Alexis Joly, Hervé Goëau, Hervé Glotin, Concetto Spampinato, Pierre Bonnet, Willem-Pier Vellinga, Jean-Christophe Lombardo, Robert Planque, Simone Palazzo, Henning Müller
      Gareth J.F. Jones; Séamus Lawless; Julio Gonzalo; Liadh Kelly; Lorraine Goeuriot; Thomas Mandl; Linda Cappellato; Nicola Ferro. CLEF: Cross-Language Evaluation Forum for European Languages, Sep 2017, Dublin, Ireland. Springer, 8th International Conference of the Cross-Language Evaluation Forum for European Language, LNCS (10456), pp.255-274, 2017, Experimental IR Meets Multilinguality, Multimodality, and Interaction.
    14. Pre-processing and Indexing techniques for Constellation Queries in Big Data
      Amir Khatibi, Fabio Porto, Joao Rittmeyer, Eduardo Ogasawara, Patrick Valduriez, Dennis Shasha
      DaWaK 2017: 19th International Conference on Big Data Analytics and Knowledge Discovery, Aug 2017, Lyon, France. Springer, LNCS, pp.74-87, 2017, Big Data Analytics and Knowledge Discovery.
    15. TARDIS: Optimal Execution of Scientific Workflows in Apache Spark
      Daniel Gaspar, Fabio Porto, Reza Akbarinia, Esther Pacitti
      DaWaK 2017: Data Warehousing and Knowledge Discovery, Aug 2017, Lyon, France. 19th International Conference on Big Data Analytics and Knowledge Discovery, pp.74-87, 2017, LNCS.
    16. Massively Distributed Environments and Closed Itemset Mining: The DCIM Approach
      Mehdi Zitouni, Reza Akbarinia, Sadok Ben Yahia, Florent Masseglia
      CAiSE: Advanced Information Systems Engineering, Jun 2017, Essen, Germany. 29th International Conference on Advanced Information Systems Engineering, LNCS (10253), pp.231-246, 2017.
    17. Pl@ntNet app in the era of deep learning
      Antoine Affouard, Hervé Goëau, Pierre Bonnet, Jean-Christophe Lombardo, Alexis Joly
      nnet, Jean-Christophe Lombardo, Alexis Joly. Pl@ntNet app in the era of deep learning. ICLR 2017 - Workshop Track - 5th International Conference on Learning Representations, Apr 2017, Toulon, France. pp.1-6.

    2016

    1. Managing Hot Metadata for Scientific Workflows on Multisite Clouds
      Luis Pineda-Morales, Ji Liu, Alexandru Costan, Esther Pacitti, Gabriel Antoniu, Patrick Valduriez, Marta Mattoso
      BIGDATA 2016 - 2016 IEEE International Conference on Big Data, Dec 2016, Washington, United States. 2016.
    2. Benchmarking Polystores: the CloudMdsQL Experience
      Boyan Kolev, Raquel Pau, Oleksandra Levchenko, Patrick Valduriez, Ricardo Jiménez-Peris, José Pereira
      Vijay Gadepally. International Conference on Big Data, Dec 2016, Washington, DC, United States. IEEE Computing Society, IEEE BigData 2016: Workshop on Methods to Manage Heterogeneous Big Data and Polystore Databases, 2017. <10.1109/BigData.2016.7840899>
    3. Extending CloudMdsQL with MFR for Big Data Integration
      Carlyna Bondiombouy, Boyan Kolev, Patrick Valduriez, Oleksandra Levchenko
      BDA: Bases de Données Avancées, Nov 2016, Poitiers, France. 32ème Conférence sur la Gestion de Données - Principes, Technologies et Applications, 2016. <https://bda2016.ensma.fr>
    4. Online Input Data Reduction in Scientific Workflows
      Renan Souza, Vítor Silva, Alvaro Coutinho, Patrick Valduriez, Marta Mattoso
      ACM SIGHPC; IEEE. WORKS: Workflows in Support of Large-scale Science, Nov 2016, Salt Lake City, United States. 11th Workshop on Workflows in Support of Large-scale Science, in conjunction with SC2016, 2016. <http://works.cs.cardiff.ac.uk>
    5. Crowdsourcing Biodiversity Monitoring: How Sharing your Photo Stream can Sustain our Planet
      Alexis Joly, Hervé Goëau, Julien Champ, Samuel Dufour-Kowalski, Henning Müller, Pierre Bonnet
      ACM Multimedia 2016, Oct 2016, Amsterdam, Netherlands. <http://www.acmmm.org/2016/>
    6. ThePlantGame: Actively Training Human Annotators for Domain-specific Crowdsourcing
      Maximilien Servajean, Alexis Joly, Dennis Shasha, Julien Champ, Esther Pacitti
      ACM Multimedia 2016, Oct 2016, Amsterdam, Netherlands.
    7. LifeCLEF Bird Identification Task 2016: The arrival of Deep learning
      Hervé Goëau, Hervé Glotin, Willem-Pier Vellinga, Robert Planqué, Alexis Joly
      Working Notes of CLEF 2016 - Conference and Labs of the Evaluation forum, Sep 2016, Evora, Portugal. pp.440--449, 2016.
    8. Unsupervised Individual Whales Identification: Spot the Difference in the Ocean
      Alexis Joly, Jean-Christophe Lombardo, Julien Champ, Anjara Saloma
      Working Notes of CLEF 2016 - Conference and Labs of the Evaluation forum, Sep 2016, Evora, Portugal. pp.469--480, 2016.
    9. Plant Identification in an Open-world (LifeCLEF 2016)
      Hervé Goëau, Pierre Bonnet, Alexis Joly
      CLEF 2016 - Conference and Labs of the Evaluation forum, Sep 2016, Évora, Portugal. Working Notes of CLEF 2016 - Conference and Labs of the Evaluation forum, pp.428--439, 2016.
    10. LifeCLEF 2016: Multimedia Life Species Identification Challenges
      Alexis Joly, Hervé Goëau, Hervé Glotin, Concetto Spampinato, Pierre Bonnet, Willem-Pier Vellinga, Julien Champ, Robert Planqué, Simone Palazzo, Henning Müller
      Norbert Fuhr; Paulo Quaresma; Teresa Gonçalves ; Birger Larsen ; Krisztian Balog ; Craig Macdonald; Linda Cappellato; Nicola Ferro. CLEF 2016 - 7th International Conference of the CLEF Association, Sep 2016, Evora, Portugal. Springer, pp.286--310, 2016, Experimental IR Meets Multilinguality, Multimodality, and Interaction.
    11. Floristic participation at LifeCLEF 2016 Plant Identification Task
      Julien Champ, Hervé Goëau, Alexis Joly
      CLEF 2016 - Conference and Labs of the Evaluation forum, Sep 2016, Évora, Portugal. Working Notes of CLEF 2016 - Conference and Labs of the Evaluation forum, pp.450--458, 2016.
    12. Enhancing Energy Production with Exascale HPC Methods
      José Camata, José Cela, Danilo Costa, Alvaro Lga Coutinho, Daniel Fernández-Galisteo, Carmen Jimenez, Vadim Kourdioumov, Marta Mattoso, Rafael Mayo-García, Thomas Miras, José Moríñigo, Jorge Navarro, Philippe Navaux, Daniel De Oliveira, Manuel Rodríguez-Pascual, Vítor Silva, Renan Souza, Patrick Valduriez
      CARLA 2016 - Latin American High Performance Computing Conference, Aug 2016, Mexico City, Mexico. Springer, 3rd Latin American High Performance Computing Conference, CCIS (697), pp.233-246, 2017.
    13. Scientific Workflow Scheduling with Provenance Support in Multisite Cloud
      Ji Liu, Esther Pacitti, Patrick Valduriez, Marta Mattoso
      VECPAR, Jun 2016, Porto, Portugal. 12th International Meeting on High Performance Computing for Computational Science, pp.8, 2016.
    14. The CloudMdsQL Multistore System
      Boyan Kolev, Carlyna Bondiombouy, Patrick Valduriez, Ricardo Jiménez-Peris, Raquel Pau, José Pereira
      SIGMOD, Jun 2016, San Francisco, United States. ACM SIGMOD/PODS Conference, 2016. <10.1145/2882903.2899400>
    15. Development of a knowledge system for Big Data: Case study to plant phenotyping data
      Luyen Le Ngoc, Anne Tireau, Aravind Venkatesan, Pascal Neveu, Pierre Larmande
      WIMS '16 Proceedings of the 6th International Conference on Web Intelligence, Mining and Semantics, Jun 2016, Nimes, France. ACM. <10.1145/2912845.2912869>
    16. Spatially Localized Visual Dictionary Learning
      Valentin Leveau, Alexis Joly, Olivier Buisson, Patrick Valduriez
      ICMR '16 Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval , Jun 2016, New York, United States. ACM, pp.367--370, 2016.
    17. Exposing French agronomic resources as Linked Open Data
      Aravind Venkatesan, Nordine El Hassouni, Florian Phillipe, Cyril Pommier, Hadi Quesneville, Manuel Ruiz, Pierre Larmande
      Ingenierie des Connaissances IC2016 - Workshop In Ovive, Jun 2016, Montpellier, France.
    18. A New Privacy-Preserving Solution for Clustering Massively Distributed Personal Times-Series
      Tristan Allard, Georges Hébrail, Florent Masseglia, Esther Pacitti
      ICDE: International Conference on Data Engineering, May 2016, Helsinki, Finland. 32nd IEEE International Conference on Data Engineering, ICDE 2016, 2016. <http://icde2016.fi/>
    19. Design and Implementation of the CloudMdsQL Multistore System
      Boyan Kolev, Carlyna Bondiombouy, Oleksandra Levchenko, Patrick Valduriez, Ricardo Jimenez-Péris, Raquel Pau, Jose Pereira
      CLOSER: Cloud Computing and Services Science, Apr 2016, Roma, Italy. 6th International Conference on Cloud Computing and Services Science, 1, pp.352-359, 2016, DataDiversityConvergence Workshop.

    2015

    1. Exposing French agronomic resources as Linked Open Data
      Aravind Venkatesan, Nordine El Hassouni, Florian Philippe, Cyril Pommier, Hadi Quesneville, Manuel Ruiz, Pierre Larmande
      SWAT4LS: Semantic Web Applications and Tools for Life Sciences, Dec 2015, Cambridge, United Kingdom. 1546, 2015. <http://ceur-ws.org/Vol-1546/>
    2. Managing Simulation Data with Multidimensional Arrays
      Hermano Lustosa, Fabio Porto, Ramon Costa, Pablo Blanco, Patrick Valduriez
      SBBD'2015: Simpósio Brasileiro de Banco de Dados, Oct 2015, Petropolis, Brazil. pp.7, 2015.
    3. Ontology-based services and knowledge management in the Agronomic Domain
      Pierre Larmande
      RDA: Research Data Alliance, Sep 2015, Paris, France. The 6th Research Data Alliance plenary meeting, 2015. <https://rd-alliance.org/plenary-meetings/rda-sixth-plenary-meeting.html>
    4. LifeCLEF Bird Identification Task 2015
      Hervé Goëau, Hervé Glotin, Willem-Pier Vellinga, Robert Planqué, Andreas Rauber, Alexis Joly
      CEUR-WS. CLEF: Conference and Labs of the Evaluation forum, Sep 2015, toulouse, France. Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum - Toulouse, France, September 8-11, 2015., 1391, 2015, CLEF2015 working notes. <http://ceur-ws.org/Vol-1391/>
    5. Shared nearest neighbors match kernel for bird songs identification -LifeCLEF 2015 challenge
      Alexis Joly, Valentin Leveau, Julien Champ, Olivier Buisson
      ceur-ws. CLEF: Conference and Labs of the Evaluation forum, Sep 2015, Toulouse, France. Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum - Toulouse, France, September 8-11, 2015., 1391, 2015, CLEF2015 working notes. <http://ceur-ws.org/Vol-1391/>
    6. LifeCLEF Plant Identification Task 2015
      Hervé Goëau, Pierre Bonnet, Alexis Joly
      CEUR-WS. CLEF: Conference and Labs of the Evaluation forum, Sep 2015, Toulouse, France. Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum - Toulouse, France, September 8-11, 2015., 1391, 2015, CLEF2015 Working notes. <http://ceur-ws.org/Vol-1391/>
    7. LifeCLEF 2015: Multimedia Life Species Identification Challenges
      Alexis Joly, Hervé Goëau, Hervé Glotin, Concetto Spampinato, Pierre Bonnet, Willem-Pier Vellinga, Robert Planqué, Andreas Rauber, Simone Palazzo, Bob Fisher, Henning Müller
      CLEF: Conference and Labs of the Evaluation forum, Sep 2015, Toulouse, France. Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum - Toulouse, France, September 8-11, 2015., 2015.
    8. A comparative study of fine-grained classification methods in the context of the LifeCLEF plant identification challenge 2015
      Julien Champ, Titouan Lorieul, Maximilien Servajean, Alexis Joly
      CEUR-WS. CLEF: Conference and Labs of the Evaluation forum, Sep 2015, Toulouse, France. Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum - Toulouse, France, September 8-11, 2015., 1391, 2015, CLEF2015 working notes. <http://ceur-ws.org/Vol-1391/>
    9. An Efficient Solution for Processing Skewed MapReduce Jobs
      Reza Akbarinia, Miguel Liroz-Gistau, Divyakant Agrawal, Patrick Valduriez
      Globe'2015: 8th International Conference on Data Management in Cloud, Grid and P2P Systems, Sep 2015, Valencia, Spain.
    10. Integrating Big Data and Relational Data with a Functional SQL-like Query Language
      Carlyna Bondiombouy, Boyan Kolev, Oleksandra Levchenko, Patrick Valduriez
      Qiming Chen; Abdelkader Hameurlain; Farouk Toumani; Roland Wagner; Hendrik Decker. DEXA’2015: 26th International Conference on Database and Expert Systems Applications, Sep 2015, Valencia, Spain. Lecture Notes in Computer Science 9261, Springer 2015, ISBN 978-3-319-22848-8, 2015.
    11. A Prime Number Based Approach for Closed Frequent Itemset Mining in Big Data
      Mehdi Zitouni, Reza Akbarinia, Sadok Ben Yahia, Florent Masseglia
      DEXA: Database and Expert Systems Applications, Sep 2015, Valencia, Spain. 26th International Conference on Database and Expert Systems Applications, LNCS (9261), pp.509-516, 2015.
    12. Data Partitioning for Fast Mining of Frequent Itemsets in Massively Distributed Environments
      Saber Salah, Reza Akbarinia, Florent Masseglia
      DEXA: Database and Expert Systems Applications, Sep 2015, Valencia, Spain. 26th International Conference on Database and Expert Systems Applications, 2015. <http://www.dexa.org>
    13. Fast Parallel Mining of Maximally Informative k-Itemsets in Big Data
      Saber Salah, Reza Akbarinia, Florent Masseglia
      IEEE International Conference on Data Mining, Aug 2015, Atlantic city, United States. 2015.
    14. When sharing computer science with everyone also helps avoiding digital prejudices.
      Marie Duflot, Martin Quinson, Florent Masseglia, Didier Roy, Julien Vaubourg, Thierry Viéville
      Escape computer dirty magic: learn Scratch !. Scratch2015AMS, Aug 2015, Amsterdam, Netherlands. 2015.
    15. On Term Selection Techniques for Patent Prior Art Search
      Mona Golestan Far, Scott Sanner, Mohamed Reda Bouadjenek, Gabriela Ferraro, David Hawking
      SIGIR: Research and Development in Information Retrieval, Aug 2015, Santiago, Chile. ACM, 2015, SIGIR '15: 38th International SIGIR Conference on Research and Development in Information Retrieval. <10.1145/2766462.2767801>
    16. Optimizing the Data-Process Relationship for Fast Mining of Frequent Itemsets in MapReduce
      Saber Salah, Reza Akbarinia, Florent Masseglia
      MLDM'2015: International Conference on Machine Learning and Data Mining, Jul 2015, Hamburg, Germany. Machine Learning and Data Mining in Pattern Recognition, 9166, pp.217-231, 2015, LNCS.
    17. Aggregation-Aware Compression of Probabilistic Streaming Time Series
      Reza Akbarinia, Florent Masseglia
      MLDM'2015: International Conference on Machine Learning and Data Mining, Jul 2015, Hamburg, Germany.
    18. Towards efficient data integration and knowledge management in the Agronomic domain
      Aravind Venkatesan, Nordine El Hassouni, Florian Phillipe, Cyril Pommier, Hadi Quesneville, Manuel Ruiz, Pierre Larmande
      APIA: Applications Pratiques de l'Intelligence Artificielle , Jul 2015, Rennes, France. 1ère conférence sur les Application Pratiques de l'Intelligence Artificielle (APIA), 2015. <http://pfia2015.inria.fr/actes/index.php?procpage=apia>
    19. OpenAlea: Scientific Workflows Combining Data Analysis and Simulation
      Christophe Pradal, Christian Fournier, Patrick Valduriez, Sarah Cohen-Boulakia
      SSDBM 2015: 27th International Conference on Scientific and Statistical Database Management, Jun 2015, San Diego, United States. <10.1145/2791347.2791365>
    20. DigInPix: Visual Named-Entities Identification in Images and Videos
      Pierre Letessier, Nicolas Hervé, Alexis Joly, Hakim Nabi, Mathieu Derval, Olivier Buisson
      ICRM: International Conference on Multimedia Retrieval, Jun 2015, Shanghai, China. ACM, Proceedings of the 5th ACM on International Conference on Multimedia Retrieval - ICMR '15, pp.661-664, 2015.
    21. Kernelizing Spatially Consistent Visual Matches for Fine-Grained Classification
      Valentin Leveau, Alexis Joly, Olivier Buisson, Patrick Valduriez
      International Conference on Multimedia Retrieval 2015, Jun 2015, Shangai, China.
    22. A Study of Query Reformulation for Patent Prior Art Search with Partial Patent Applications
      Mohamed Reda Bouadjenek, Scott Sanner, Gabriela Ferraro
      ICAIL: International Conference on Artificial Intelligence and Law, Jun 2015, San Diego, United States. 2015, ICAIL'2015: 15th International Conference on Artificial Intelligence and Law.
    23. Chiaroscuro: Transparency and Privacy for Massive Personal Time-Series Clustering
      Tristan Allard, Georges Hébrail, Florent Masseglia, Esther Pacitti
      ACM SIGMOD. SIGMOD: Conference on Management of Data, May 2015, Melbourne, Australia. SIGMOD '15- Proceedings of the 2015 ACM SIGMOD 34th International Conference on Management of Data, 2015. <10.1145/2723372.2749453>
    24. Data-intensive HPC: opportunities and challenges
      Patrick Valduriez
      BDEC'2015: Big Data and Extreme-scale Computing, Jan 2015, Barcelone, Spain. 2015.

    2014

    1. Fine-grained Visual Faceted Search
      Julien Champ, Alexis Joly, Bonnet Pierre
      ACM Multimedia, Nov 2014, Orlando, FL, United States. The 22nd ACM International Conference on Multimedia - November 3-7, 2014 | Orlando, FL, USA. <10.1145/2647868.2654875>
    2. Recognizing Thousands of Legal Entities through Instance-based Visual Classification
      Valentin Leveau, Alexis Joly, Olivier Buisson, Pierre Letessier, Patrick Valduriez
      ACM Multimedia, Nov 2014, Orlando, FL, United States. The 22nd ACM International Conference on Multimedia - November 3-7, 2014 | Orlando, FL, USA, 2014. <10.1145/2647868.2655038>
    3. NACluster: A Non-Supervised Clustering Algorithm for Matching Multi Catalogues
      Vinicius P. Freire, José A. F. De Macêdo, Fábio Porto, Reza Akbarinia
      IEEE e-Science Workshop, Oct 2014, Guarujá, SP, Brazil. 2014. <http://escience.ime.usp.br/preliminary-program/accepted-papers/accepted-papers-workshops>
    4. Layer Decomposition: An Effective Structure-based Approach for Scientific Workflow Similarity
      Johannes Starlinger, Sarah Cohen-Boulakia, Sanjeev Khanna, Susan Davidson, Ulf Leser
      IEEE e-Science conference, Oct 2014, Guarujá, Brazil. 2014.
    5. PlantRT : a Distributed Recommendation Tool for Citizen Science
      Maximilien Servajean, Esther Pacitti, Miguel Liroz-Gistau, Alexis Joly, Julien Champ
      BDA: Bases de Données Avancées, Oct 2014, Autrans, France. BDA 2014 : Gestion de données - principes, technologies et applications, pp.48-50, 2014.
    6. Exploiting Diversification in Distributed Recommendation
      Maximilien Servajean, Esther Pacitti, Miguel Liroz-Gistau, Sihem Amer-Yahia, Amr El Abbadi
      BDA: Bases de Données Avancées, Oct 2014, Grenoble-Autrans, France. INRIA-SILICONVALLEY, 2014, Gestion de Données – Principes, Technologies et Applications. <http://bda2014.imag.fr>
    7. Instance-based bird species identication with undiscriminant features pruning - LifeCLEF 2014
      Alexis Joly, Julien Champ, Olivier Buisson
      CLEF: Conference and Labs of the Evaluation Forum, Sep 2014, Sheffield, United Kingdom. 2014, Information Access Evaluation meets Multilinguality, Multimodality, and Interaction. <http://clef2014.clef-initiative.eu>
    8. Lifeclef 2014: multimedia life species identification challenges
      Alexis Joly, Hervé Goëau, Hervé Glotin, Concetto Spampinato, Pierre Bonnet, Willem-Pier Vellinga, Robert Planque, Andreas Rauber, Bob Fisher, Henning Müller
      CLEF: Conference and Labs of the Evaluation forum, Sep 2014, Sheffield, United Kingdom. 5th International Conference of the CLEF Initiative, CLEF 2014, Sheffield, UK, September 15-18, 2014. Proceedings, LNCS (8685), pp.229-249, 2014, Information Access Evaluation. Multilinguality, Multimodality, and Interaction.
    9. LifeCLEF Bird Identification Task 2014
      Hervé Goëau, Hervé Glotin, Willem-Pier Vellinga, Robert Planqué, Andreas Rauber, Alexis Joly
      CLEF: Conference and Labs of the Evaluation Forum, Sep 2014, Sheffield, United Kingdom. 2014, Information Access Evaluation meets Multilinguality, Multimodality, and Interaction. <http://clef2014.clef-initiative.eu>
    10. Exploiting Diversification in Gossip-Based Recommendation
      Maximilien Servajean, Esther Pacitti, Miguel Liroz-Gistau, Sihem Amer-Yahia, Amr El Abbadi
      Globe'2014: 7th International Conference, Sep 2014, Munich, Germany. INRIA-SILICONVALLEY, LNCS (8648), pp.25-36, 2014, Data Management in Cloud, Grid and P2P Systems.
    11. Scientific Workflow Partitioning in Multi-site Clouds
      Ji Liu, Esther Pacitti, Patrick Valduriez, Vitor Silva Souza, Marta Mattoso
      L. Lopes. BigDataCloud'2014: 3rd Workshop on Big Data Management in Clouds in conjunction with Euro-Par 2014, Aug 2014, Porto, Portugal. Springer, Lecture Notes in Computer Science, 8805, pp.105-116, 2014, LNCS.
    12. Towards Efficient Power Management in MapReduce: Investigation of CPU-Frequencies Scaling on Power Efficiency in Hadoop
      Shadi Ibrahim, Diana Moise, Houssem-Eddine Chihoub, Alexandra Carpen-Amarie, Luc Bougé, Gabriel Antoniu
      Workshop on Adaptive Resource Management and Scheduling for Cloud Computing, Held in conjunction with PODC 2014, Jul 2014, Paris, France.
    13. Pl@ntNet Mobile 2014: Android port and new features
      Hervé Goëau, Bonnet Pierre, Alexis Joly, Antoine Affouard, Vera Bakić, Julien Barbe, Samuel Dufour-Kowalski, Souheil Selmi, Yahiaoui Itheri, Christel Vignau, Daniel Barthelemy, Nozha Boujemaa
      ICMR 2014 International Conference on Multimedia Retrieval, Apr 2014, Glasgow, France. <10.1145/2578726.2582618>
    14. LifeCLEF: Multimedia Life Species Identification
      Alexis Joly, Robert Planque, Concetto Spampinato, Henning Müller, Hervé Goëau, Andreas Rauber, Bonnet Pierre, Willem-Pier Vellinga, Robert B. Fisher, Hervé Glotin
      EMR 2014, 1st International Workshop on Environnmental Multimedia Retrieval co-located with ACM International Conference on Multimedia Retrieval (ICMR 2014), Apr 2014, Glasgow, United Kingdom. <http://ceur-ws.org/Vol-1222/>

    2013

    1. OTmedia: The French Transmedia News Observatory
      Nicolas Hervé, Marie-Luce Viaud, Jérôme Thievre, Agnès Saulnier, Pierre Letessier, Julien Champ, Olivier Buisson, Alexis Joly
      MM '13: 21st ACM international conference on Multimedia, Oct 2013, Barcelone, Spain. pp.441-442, 2013.
    2. Small objects query suggestion in a large web-image collection
      Pierre Letessier, Nicolas Hervé, Champ Julien, Alexis Joly, Olivier Buisson, Amel Hamzaoui
      MM'13: ACM Multimedia, Oct 2013, Barcelone, Spain. ACM, 2013. <10.1145/2502081.2502248>
    3. The Imageclef Plant Identification Task 2013
      Alexis Joly, Hervé Goëau, Pierre Bonnet, Vera Bakić, Jean-François Molino, Daniel Barthélémy, Nozha Boujemaa
      International workshop on Multimedia analysis for ecological data, Oct 2013, Barcelone, Spain. 2013.
    4. Pl@ntNet Mobile App
      Hervé Goëau, Pierre Bonnet, Alexis Joly, Vera Bakić, Julien Barbe, Souheil Selmi, Jennifer Carré, Daniel Barthélémy, Nozha Boujemaa, Jean-François Molino, Grégoire Duché, Aurélien Perronet
      ACM Multimedia, Oct 2013, Barcelone, Spain. ACM, pp.423-424, 2013.
    5. Algebraic Dataflows for Big Data Analysis
      Dias Jonas, Eduardo Ogasawara, Oliveira Daniel De, Fabio Porto, Patrick Valduriez, Marta Mattoso
      BigData'2013: International Conference on Big Data, Oct 2013, Santa Clara, United States. IEEE, pp.6, 2013.
    6. A Density-Based Backward Approach to Isolate Rare Events in Large-Scale Applications
      Enikö Székely, Pascal Poncelet, Florent Masseglia, Maguelonne Teisseire, Renaud Cezar
      Johannes Fürnkranz and Eyke Hüllermeier and Tomoyuki Higuchi. DS: Discovery Science, Oct 2013, Singapore, Singapore. Springer, pp.249-264, 2013, Lecture Notes in Computer Science.
    7. Inria's participation at ImageCLEF 2013 Plant Identification Task
      Vera Bakić, Sofiène Mouine, Saloua Ouertani-Litayem, Anne Verroust-Blondet, Itheri Yahiaoui, Hervé Goëau, Alexis Joly
      CLEF (Online Working Notes/Labs/Workshop) 2013, Sep 2013, Valencia, Spain. 2013.
    8. Fast and Exact Mining of Probabilistic Data Streams
      Reza Akbarinia, Florent Masseglia
      PKDD'2013: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, Sep 2013, Prague, Czech Republic. Springer, pp.493-508, 2013, Lecture Notes in Computer Science.
    9. Imageclef 2013: the vision, the data and the open challenges
      Caputo Barbara, Muller Henning, Thomee Bart, Villegas Mauricio, Roberto Paredes, David Zellhofer, Hervé Goëau, Alexis Joly, Bonnet Pierre, Jesus Martinez Gomez, Ismael Garcia Varea, Miguel Cazorla
      CLEF 2013 - 4th Conference and Labs of the Evaluation Forum : Information Access Evaluation meets Multilinguality, Multimodality, and Visualization, Sep 2013, Valencia, Spain. Springer, LNCS, 8138, pp.250-268, 2013, CLEF 2013: Information Access Evaluation. Multilinguality, Multimodality, and Visualization.
    10. The ImageCLEF 2013 Plant Identification Task
      Hervé Goëau, Pierre Bonnet, Alexis Joly, Vera Bakić, Daniel Barthélémy, Nozha Boujemaa, Jean-François Molino
      CLEF, Sep 2013, Valencia, Spain. 2013.
    11. Data Partitioning for Minimizing Transferred Data in MapReduce
      Miguel Liroz-Gistau, Reza Akbarinia, Divyakant Agrawal, Esther Pacitti, Patrick Valduriez
      Hameurlain, Abdelkader and Rahayu, Wenny and Taniar, David. Globe'2013: 6th International Conference on Data Management in Cloud, Grid and P2P Systems, Aug 2013, Prague, Czech Republic. Springer, pp.1-12, 2013, LNCS.
    12. The Price is Right: Models and Algorithms for Pricing Data
      Tang Ruiming, Wu Huayu, Bao Zhifeng, Bressan Stephane, Patrick Valduriez
      Hendrik Decker and Lenka Lhotska and Sebastian Link. DEXA'2013: 24th International Conference on Database and Expert Systems Applications, Aug 2013, Czech Republic. Springer, pp.380-394, 2013.
    13. What you Pay for is What you Get
      Tang Ruiming, Shao Dongxu, Stephane Bressan, Patrick Valduriez
      Hendrik Decker and Lenka Lhotska and Sebastian Link. DEXA'2013: 24th International Conference on Database and Expert Systems Applications, Aug 2013, Prague, Czech Republic. Springer, pp.395-409, 2013.
    14. WebSmatch: a tool for Open Data
      Emmanuel Castanier, Remi Coletta, Patrick Valduriez, Christian Frisch
      WOD: Workshop on Open Data, Jun 2013, Paris, France. 2nd International Workshop on Open Data, pp.#10, 2013.
    15. Profile Diversity in Search and Recommendation
      Maximilien Servajean, Esther Pacitti, Sihem Amer-Yahia, Pascal Neveu
      Ido Guy; Michelle X. Zhou; Li Chen. SRS: Social Recommender Systems (in conjunction WWW 2013), May 2013, Rio de Janeiro, Brazil. IW3C2, International World Wide Web Conference Committee (IW3C2) - SRS 2013: 4th International Workshop on Social Recommender Systems (in conjunction WWW 2013 Companion, ACM 978-1-4503-2038-2/13/05., pp.973-980, 2013.
    16. Mining frequent itemsets over tuple-evolving data streams
      Chongsheng Zhang, Yuan Hao, Mirjana Mazuran, Carlo Zaniolo, Hamid Mousavi, Florent Masseglia
      SAC'13: Symposium on Applied Computing, Mar 2013, Coimbra, Portugal. pp.267-274, 2013.

Publications majeures depuis 2008

R. Akbarinia, P. Valduriez, G. Verger, Efficient Evaluation of SUM Queries Over Probabilistic Data. IEEE Transactions on Knowledge and Data Engineering, Data. Vol. 25, No. 4, 764-775, 2013.

M. El Dick, E. Pacitti, R. Akbarinia, B. Kemme, Building a Peer-to-Peer Content Distribution Network with High Performance, Scalability and Robustness, Information Systems, Vol. 36, No 2, p. 222-247, 2011.

P. Letessier, O. Buisson, A. Joly, N. Boujemaa, Scalable Mining of Small Visual Objects, ACM Multimedia Conf.,  2012.

E. Ogasawara, D. De Oliveira, P. Valduriez, J. Dias, F. Porto, M. Mattoso, An Algebraic Approach for Data-Centric Scientific Workflows, Proceedings of VLDB, Vol. 4, No 11, p. 1328-1339, 2011. 

F. Petitjean, F. Masseglia, P. Gançarski, G. Forestier, Discovering Significant Evolution Patterns from Satelllite Image Time Series, International Journal of Neural Systems, Vol. 21, No 6, 475-489, 2011.

Mots-clés

Big data, Données scientifiques, Gestion de données distribuées et parallèles, Analyse et fouille de données, Recommandation et recherche de contenus, Communautés en ligne, Workflows scientifiques, Intégration, Confidentialité, Recherche d’information par contenu, P2P, Grid, Cloud

Dernière mise à jour le 29/03/2018

Département : Informatique

Responsable : Esther PACITTI

Adjoint : Florent MASSEGLIA

Site de l'équipe : http://team.inria.fr/zenith

Fiche-équipe ZENITH

Télécharger la fiche-équipe ZENITH du rapport d'activité 2008-2013 :