ZENITH: Gestion de données scientifiques

Zenith s’attaque aux défis posés par la gestion (stockage, partage, traitement, recherche analyse) des données massives (big data, données scientifiques). Ces défis (correspondant aux trois big V : Volume, Velocity, Variety) peuvent se résumer ainsi:

1. très grande échelle (big data, big analytics) ;

2. données en continu (produits par des capteurs, des appareils mobiles, …) ;

3. hétérogénéité et complexité des données (différences sémantiques, données incertaines ou multi-échelles, …).

Notre objectif est d’apporter des solutions innovantes, en démontrant des avantages en termes de passage à l’échelle, fonctionnalité, facilité d’usage et performance, dans des environnements distribués et parallèles (P2P, grid, cloud).

Nous cherchons à produire des résultats fondamentaux et algorithmiques, que nous pouvons implémenter dans des environnements spécifiques, par ex. Grid5K. Pour valider nos solutions, nous collaborons avec des partenaires scientifiques (INRA, CIRAD, IRD, etc.) et industriels (Data Publica, Bull, EDF, Orange, Microsoft, MonetDB, Sparsity, etc.).

Membres

Permanents

Non permanents

Thématiques de recherche

Le projet Zenith est organisé en trois thèmes complémentaires :

1. Gestion de données et métadonnées : gestion et intégration de données et métadonnées (schémas, ontologies) à grande échelle, en particulier, stockage de big data, résolution d’entités incertaines et traitement de requêtes probabilistes.

2. Partage de données et processus : gestion des données et processus scientifiques dans des environnements distribués et parallèles, avec partage de données en P2P, recommandation dans les communautés en ligne et support des workflows scientifiques.

3. Analyse de données : fouille de données et recherche de données par contenu en exploitant le parallélisme du cloud et les nouvelles technologies NoSQL et MapReduce.

Ces trois thèmes reflètent le continuum qui va de la capture des données, en passant par leur intégration, gestion et partage, jusqu’à leur analyse, afin de produire informations et connaissances.

Publications depuis 2013 - Evaluation 2019

Articles de revues internationales

2018

  1. Non-parametric Bayesian annotator combination
    Maximilien Servajean, Romain Chailan, Alexis Joly
    Information Sciences, Elsevier, 2018, 436-437, pp.131-145.
  2. Species distribution modeling based on the automated identification of citizen observations
    Christophe Botella, Alexis Joly, Pierre Bonnet, Pascal Monestiez, François Munoz
    Applications in Plant Sciences, Wiley, 2018, Green Digitization: Online Botanical Collections Data Answering Real‐World Questions, 6 (2), pp.1-11.
  3. Data reduction in scientific workflows using provenance monitoring and user steering
    Renan Souza, Vitor Silva, Alvaro L.G.A. Coutinho, Patrick Valduriez, Marta Mattoso
    Future Generation Computer Systems, Elsevier, In press, pp.1-21.
  4. In situ visualization and data analysis for turbidity currents simulation
    José Camata, Vitor Silva, Patrick Valduriez, Marta Mattoso, Alvaro Coutinho
    Computers & Geosciences, Elsevier, 2018, 110, pp.23-31.
  5. AutoWIG: automatic generation of python bindings for C++ libraries
    Pierre Fernique, Christophe Pradal
    PeerJ Computer Science, PeerJ, 2018, 4. <10.7717/peerj-cs.149>
  6. An Overview of Lead and Accompaniment Separation in Music
    Zafar Rafii, Antoine Liutkus, Fabian-Robert Stöter, Stylianos Ioannis Mimilakis, Derry Fitzgerald, Bryan Pardo
    IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2018. <10.1109/TASLP.2018.2825440>
  7. A Survey of Scheduling Frameworks in Big Data Systems
    Ji Liu, Esther Pacitti, Patrick Valduriez
    International Journal of Cloud Computing, Inderscience Publishers, In press, pp.1-27.

2017

  1. Going deeper in the automated identification of Herbarium specimens
    Jose Carranza-Rojas, Herve Goeau, Pierre Bonnet, Erick Mata-Montero, Alexis Joly
    BMC Evolutionary Biology, BioMed Central, 2017, 17 (1), pp.181.
  2. Crowdsourcing Thousands of Specialized Labels: A Bayesian Active Training Approach
    Maximilien Servajean, Alexis Joly, Dennis Shasha, Julien Champ, Esther Pacitti
    IEEE Transactions on Multimedia, Institute of Electrical and Electronics Engineers, 2017, 19 (6), pp.1376-1391.
  3. InfraPhenoGrid: A scientific workflow infrastructure for Plant Phenomics on the Grid
    Christophe Pradal, Simon Artzet, Jerome Chopard, Dimitri Dupuis, Christian Fournier, Michael Mielewczik, Vincent Negre, Pascal Neveu, Didier Parigot, Patrick Valduriez, Sarah Cohen-Boulakia
    Future Generation Computer Systems, Elsevier, 2017, 67, pp.341-353.
  4. A Highly Scalable Parallel Algorithm for Maximally Informative k-Itemset Mining
    Saber Salah, Reza Akbarinia, Florent Masseglia
    Knowledge and Information Systems (KAIS), Springer, 2017.
  5. Raw data queries during data-intensive parallel workflow execution
    Vítor Silva, José Leite, José Camata, Daniel De Oliveira, Alvaro Coutinho, Patrick Valduriez, Marta Mattoso
    Future Generation Computer Systems, Elsevier, 2017, 75, pp.402-422.
  6. Scientific Workflow Scheduling with Provenance Data in a Multisite Cloud
    Ji Liu, Esther Pacitti, Patrick Valduriez, Marta Mattoso
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2017, 33, pp.80-112.
  7. Data placement in massively distributed environments for fast parallel mining of frequent itemsets
    Saber Salah, Reza Akbarinia, Florent Masseglia
    Knowledge and Information Systems (KAIS), Springer, 2017, 53 (1), pp.207-237.
  8. Scientific workflows for computational reproducibility in the life sciences: Status, challenges and opportunities
    Sarah Cohen-Boulakia, Khalid Belhajjame, Olivier Collin, Jérôme Chopard, Christine Froidevaux, Alban Gaignard, Konrad Hinsen, Pierre Larmande, Yvan Le Bras, Frédéric Lemoine, Fabien Mareuil, Hervé Ménager, Christophe Pradal, Christophe Blanchet
    Future Generation Computer Systems, Elsevier, 2017. <10.1016/j.future.2017.01.012>

2016

  1. CloudMdsQL: Querying Heterogeneous Cloud Data Stores with a Common Language
    Boyan Kolev, Patrick Valduriez, Carlyna Bondiombouy, Ricardo Jimenez-Peris, Raquel Pau, José Pereira
    Distributed and Parallel Databases, Springer, 2016, 34 (4), pp.463-503.
  2. AgroLD API. Une architecture orientée services pour l'extraction de connaissances dans la base de données liées AgroLD
    Gildas Tagny Ngompe, Aravind Venkatesan, Nordine El Hassouni, Manuel Ruiz, Pierre Larmande
    Revue des Sciences et Technologies de l'Information - Série ISI : Ingénierie des Systèmes d'Information, Lavoisier, 2016, 21 (5-6), pp.133-158.
  3. Database System Support of Simulation Data
    Hermano Lustosa, Fabio Porto, Pablo Blanco, Patrick Valduriez
    Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2016, 9 (13), pp.1329-1340.
  4. Categorizing plant images at the variety level: Did you say fine-grained?
    Julien Champ, Titouan Lorieul, Pierre Bonnet, Najate Maghnaoui, Christophe Sereno, Thierry Dessup, Jean-Michel Boursiquot, Laurent Audeguin, Thierry Lacombe, Alexis Joly
    Pattern Recognition Letters, Elsevier, 2016, In press. <10.1016/j.patrec.2016.05.022>
  5. Gigwa—Genotype investigator for genome- wide analyses
    Guilhem Sempéré, Florian Philippe, Alexis Dereeper, Manuel Ruiz, Gautier Sarah, Pierre Larmande
    GigaScience, BioMed Central, 2016. <10.1186/s13742-016-0131-8>
  6. Social Networks and Information Retrieval, How Are They Converging? A Survey, a Taxonomy and an Analysis of Social Information Retrieval Approaches and Platforms
    Mohamed Reda Bouadjenek, Hakim Hacid, Mokrane Bouzeghoub
    Information Systems, Elsevier, 2016, 56, pp.1-18.
  7. Effective and Efficient Similarity Search in Scientific Workflow Repositories
    Johannes Starlinger, Sarah Cohen-Boulakia, Sanjeev Khanna, Susan Davidson, Ulf Leser
    Future Generation Computer Systems, Elsevier, 2016, 56, pp.584-594.
  8. Multi-Objective Scheduling of Scientific Workflows in Multisite Clouds
    Ji Liu, Esther Pacitti, Patrick Valduriez, Daniel De Oliveira, Marta Mattoso
    Future Generation Computer Systems, Elsevier, 2016, 63, pp.76-95.
  9. Query processing in multistore systems: an overview
    Carlyna Bondiombouy, Patrick Valduriez
    International Journal of Cloud Computing, Inderscience Publishers, 2016, pp.38.
  10. FP-Hadoop: Efficient Processing of Skewed MapReduce Jobs
    Miguel Liroz-Gistau, Reza Akbarinia, Divyakant Agrawal, Patrick Valduriez
    Information Systems, Elsevier, 2016, 60, pp.69-84.
  11. Analyzing Related Raw Data Files through Dataflows
    Vitor Silva Souza, Oliveira Daniel De, Patrick Valduriez, Marta Mattoso
    Concurrency and Computation: Practice and Experience, Wiley, 2016, 28 (8), pp.2528-2545.
  12. Guest Editorial: Environmental Multimedia Retrieval
    Stefanos Vrochidis, Kostas D. Karatzas, Ari Karppinen, Alexis Joly
    Multimedia Tools and Applications, Springer Verlag, 2016, 75 (3), pp.1557--1562.
  13. Plant identification: Man vs. Machine
    Pierre Bonnet, Alexis Joly, Hervé Goëau, Julien Champ, Christel Vignau, Jean-François Molino, Daniel Barthélémy, Nozha Boujemaa
    Multimedia Tools and Applications, Springer Verlag, 2016, LifeCLEF 2014 plant identification challenge, 75 (3), pp.1647-1665.
  14. Multistore Big Data Integration with CloudMdsQL
    Carlyna Bondiombouy, Boyan Kolev, Oleksandra Levchenko, Patrick Valduriez
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2016, 28, pp.48-74.

2015

  1. Rank aggregation with ties: Experiments and Analysis
    Bryan Brancotte, Bo Yang, Guillaume Blin, Sarah Cohen-Boulakia, Alain Denise, Sylvie Hamel
    Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2015, pp.2051.
  2. Increasing Coverage in Distributed Search and Recommendation with Profile Diversity
    Maximilien Servajean, Esther Pacitti, Miguel Liroz-Gistau, Sihem Amer-Yahia, Amr El Abbadi
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2015, LNCS (9430), pp.115-144.
  3. Profile Diversity for Query Processing using User Recommendations
    Maximilien Servajean, Reza Akbarinia, Esther Pacitti, Sihem Amer-Yahia
    Information Systems, Elsevier, 2015, Information Systems, 48, pp.44-63.
  4. A Survey of Data-Intensive Scientific Workflow Management
    Ji Liu, Esther Pacitti, Patrick Valduriez, Marta Mattoso
    Journal of Grid Computing, Springer Verlag, 2015, 13, 44 p. <10.1007/s10723-015-9329-8>
  5. Data-Centric Iteration in Dynamic Workflows
    Jonas Dias, Gabriel Guerra, Fernando Rochinha, Alvaro Coutinho, Patrick Valduriez, Marta Mattoso
    Future Generation Computer Systems, Elsevier, 2015, 46, pp.114-126.
  6. A look inside the Pl@ntNet experience
    Alexis Joly, Pierre Bonnet, Hervé Goëau, Julien Barbe, Souheil Selmi, Julien Champ, Samuel Dufour-Kowalski, Antoine Affouard, Jennifer Carré, Jean-François Molino, Nozha Boujemaa, Daniel Barthélémy
    Multimedia Systems, Springer Verlag, 2015, pp.16.
  7. FP-Hadoop: Efficient Execution of Parallel Jobs Over Skewed Data
    Miguel Liroz-Gistau, Reza Akbarinia, Patrick Valduriez
    Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2015, 8 (12), pp.1856-1867.

2014

  1. Autonomic Intrusion Detection: Adaptively Detecting Anomalies over Unlabeled Audit Data Streams in Computer Networks
    Wei Wang, Thomas Guyet, René Quiniou, Marie-Odile Cordier, Florent Masseglia, Xiangliang Zhang
    Knowledge-Based Systems, Elsevier, 2014.
  2. Special section on data-intensive cloud infrastructure
    Ashraf Aboulnaga, Beng Chin Ooi, Patrick Valduriez
    The VLDB Journal, Springer, 2014, pp.1.
  3. The anti-bouncing data stream model for web usage streams with intralinkings
    Chongsheng Zhang, Florent Masseglia, Yves Lechevallier
    Information Sciences, Elsevier, 2014, 278, pp.757-772.
  4. Similarity Search for Scientific Workflows
    Johannes Starlinger, Bryan Brancotte, Sarah Cohen-Boulakia, Ulf Leser
    Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2014, 7 (12), pp.1143-1154.
  5. Query Reformulation in PDMS Based on Social Relevance
    Angela Bonifati, Gianvito Summa, Esther Pacitti, Fady Draidi
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2014, Transactions on Large-Scale Data- and Knowledge-Centered Systems XIII, LNCS, pp.59-90.
  6. Entity Resolution for Probabilistic Data
    Ayat Naser, Reza Akbarinia, Hamideh Afsarmanesh, Patrick Valduriez
    Information Sciences, Elsevier, 2014, 277, pp.492-511.
  7. Interactive plant identification based on social image data
    Alexis Joly, Hervé Goëau, Pierre Bonnet, Vera Bakić, Julien Barbe, Souheil Selmi, Itheri Yahiaoui, Jennifer Carré, Elise Mouysset, Jean-François Molino, Nozha Boujemaa, Daniel Barthélémy
    Ecological Informatics, Elsevier, 2014, 23, pp.22-34.
  8. Object-based visual query suggestion
    Amel Hamzaoui, Pierre Letessier, Alexis Joly, Olivier Buisson, Nozha Boujemaa
    Multimedia Tools and Applications, Springer Verlag, 2014, Multimedia Tools and Applications, 68 (2), pp.429-454.
  9. Dynamic Workload-Based Partitioning Algorithms for Continuously Growing Databases
    Miguel Liroz-Gistau, Reza Akbarinia, Esther Pacitti, Fabio Porto, Patrick Valduriez
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2014, pp.105.

2013

  1. Stress Testing of Transactional Database Systems
    Jorge Augusto Meira, Eduardo Cunha de Almeida, Gerson Sunyé, Yves Le Traon, Patrick Valduriez
    Journal of Information and Data Management, Brazilian Computer Society, 2013, 4 (3).
  2. A Hierarchical Grid Index (HGI), spatial queries in wireless data broadcasting
    Kwangjin Park, Patrick Valduriez
    Distributed and Parallel Databases, Springer, 2013, 31 (3), pp.413-446.
  3. Chiron: A Parallel Engine for Algebraic Scientific Workflows
    Eduardo Ogasawara, Dias Jonas, Vitor Silva, Chirigati Fernando, Oliveira Daniel De, Fabio Porto, Marta Mattoso, Patrick Valduriez
    Concurrency and Computation: Practice and Experience, Wiley, 2013, 25 (16), pp.2327-2341.
  4. Entity Resolution for Distributed Probabilistic Data
    Naser Ayat, Reza Akbarinia, Hamideh Afsarmanesh, Patrick Valduriez
    Distributed and Parallel Databases, Springer, 2013, 31 (4), pp.509-542.
  5. As-Soon-As-Possible Top-k Query Processing in P2P Systems
    William Kokou Dedzoe, Philippe Lamarre, Reza Akbarinia, Patrick Valduriez
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2013, Part IX, LNCS (7980), pp.1-27.
  6. Efficient Evaluation of SUM Queries Over Probabilistic Data
    Reza Akbarinia, Patrick Valduriez, Guillaume Verger
    IEEE Transactions on Knowledge and Data Engineering, Institute of Electrical and Electronics Engineers, 2013, 25 (4), pp.764-775.

Communications internationales

2018

  1. The 2018 Signal Separation Evaluation Campaign
    Fabian-Robert Stöter, Antoine Liutkus, Nobutaka Ito
    LVA ICA : 14th International Conference on Latent Variable Analysis and Signal Separation, Jul 2018, Surrey, United Kingdom. 2018.
  2. Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition
    Mathieu Fontaine, Fabian-Robert Stöter, Antoine Liutkus, Umut Simsekli, Romain Serizel, Roland Badeau
    LVA ICA 2018 - 14th International Conference on Latent Variable Analysis and Signal Separation, Jul 2018, Surrey, United Kingdom. 2018.
  3. Alpha-stable low-rank plus residual decomposition for speech enhancement
    Umut Simsekli, Halil Erdogan, Simon Leglaive, Antoine Liutkus, Roland Badeau, Gaël Richard
    ICASSP 2018 - IEEE International Conference on Acoustics, Speech, and Signal Processing, Apr 2018, Calgary, Canada. 2018.
  4. Blind Source Separation Using Mixtures of Alpha-Stable Distributions
    Nicolas Keriven, Antoine Deleforge, Antoine Liutkus
    ICASSP 2018 - International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Canada. pp.1-5.
  5. Audio source separation with magnitude priors: the BEADS model
    Antoine Liutkus, Christian Rohlfing, Antoine Deleforge
    ICASSP 2018 - IEEE International Conference on Acoustics, Speech, and Signal Processing, Apr 2018, Calgary, Canada. pp.1-5.
  6. Interference reduction on full-length live recordings
    Diego Carlo, Antoine Liutkus, Ken Déguernel
    ICASSP 2018 - IEEE International Conference on Acoustics, Speech, and Signal Processing, Apr 2018, Calgary, Canada. pp.1-5.
  7. Maximally Informative k-Itemset Mining from Massively Distributed Data Streams
    Mehdi Zitouni, Reza Akbarinia, Sadok Ben Yahia, Florent Masseglia
    SAC 2018 - 33rd ACM/SIGAPP Symposium On Applied Computing, Apr 2018, Pau, France. pp.1-10.

2017

  1. DPiSAX: Massively Distributed Partitioned iSAX
    Djamel-Edine Yagoubi, Reza Akbarinia, Florent Masseglia, Themis Palpanas
    ICDM 2017: IEEE International Conference on Data Mining, Nov 2017, New Orleans, United States. pp.1-6, 2017.
  2. Querying Key-Value Stores Under Simple Semantic Constraints : Rewriting and Parallelization
    Olivier Rodriguez, Corentin Colomier, Cecilie Rivière, Reza Akbarinia, Federico Ulliana
    BDA: Conférence sur la Gestion de Données — Principes, Technologies et Applications ", Nov 2017, Nancy, France. 2017.
  3. Efficient Scheduling of Scientific Workflows using Hot Metadata in a Multisite Cloud
    Ji Liu, Luis Pineda-Morales, Esther Pacitti, Alexandru Costan, Patrick Valduriez, Gabriel Antoniu, Marta Mattoso
    BDA: Conférence sur la Gestion de Données — Principes, Technologies et Applications, Nov 2017, Nancy, France. pp.13, 2017.
  4. TARS: An Array Model with Rich Semantics for Multidimensional Data
    Hermano Lustosa, Noel Lemus, Fabio Porto, Patrick Valduriez
    ER FORUM 2017: Conceptual Modeling : Research In Progress, Nov 2017, Valencia, Spain. 2017.
  5. End-to-end Graph Mapper
    Benjamin Billet, Mickaël Jurret, Didier Parigot, Patrick Valduriez
    BDA: Conférence sur la Gestion de Données — Principes, Technologies et Applications, Nov 2017, Nancy, France. 2017.
  6. Tracking of Online Parameter Fine-tuning in Scientific Workflows
    Renan Souza, Vitor Silva, José Camata, Alvaro Coutinho, Patrick Valduriez, Marta Mattoso
    Workflows in Support of Large-Scale Science (WORKS), in conjunction with ACM/IEEE Supercomputing., Nov 2017, Denver, United States. 2017.
  7. Pl@ntNet -My Business
    Alexis Joly, Pierre Bonnet, Antoine Affouard, Jean-Christophe Lombardo, Hervé Goëau
    ACM Multimedia 2017, Oct 2017, Mountain View, United States. pp.1-11.
  8. RadiusSketch: Massively Distributed Indexing of Time Series
    Djamel-Edine Yagoubi, Reza Akbarinia, Florent Masseglia, Dennis Shasha
    DSAA 2017: IEEE International Conference on Data Science and Advanced Analytics, Oct 2017, Tokyo, Japan. pp.1-10, 2017.
  9. Spark Scalability Analysis in a Scientific Workflow
    Renan Souza, Vitor Silva, Pedro Miranda, Alexandre Lima, Patrick Valduriez, Marta Mattoso
    SBBD 2017: 32th Brazilian Symposium on Databases, Oct 2017, Uberlandia, Brazil. pp.1-6, 2017.
  10. Automated Herbarium Specimen Identification using Deep Learning
    Jose Carranza-Rojas, Alexis Joly, Pierre Bonnet, Hervé Goëau, Erick Mata-Montero
    TDWG 2017 - Annual Conference on Biodiversity Information Standards, Oct 2017, Ottawa, Canada. 2017. <10.3897/tdwgproceedings.1.20302>
  11. LifeCLEF 2017 Lab Overview: Multimedia Species Identification Challenges
    Alexis Joly, Hervé Goëau, Hervé Glotin, Concetto Spampinato, Pierre Bonnet, Willem-Pier Vellinga, Jean-Christophe Lombardo, Robert Planque, Simone Palazzo, Henning Müller
    Gareth J.F. Jones; Séamus Lawless; Julio Gonzalo; Liadh Kelly; Lorraine Goeuriot; Thomas Mandl; Linda Cappellato; Nicola Ferro. CLEF: Cross-Language Evaluation Forum for European Languages, Sep 2017, Dublin, Ireland. Springer, 8th International Conference of the Cross-Language Evaluation Forum for European Language, LNCS (10456), pp.255-274, 2017, Experimental IR Meets Multilinguality, Multimodality, and Interaction.
  12. Plant identification based on noisy web data: the amazing performance of deep learning (LifeCLEF 2017)
    Herve Goeau, Pierre Bonnet, Alexis Joly
    CLEF 2017 - Conference and Labs of the Evaluation Forum, Sep 2017, Dublin, Ireland. pp.1-13, 2017.
  13. LifeCLEF Bird Identification Task 2017
    Herve Goeau, Hervé Glotin, Willem-Pier Vellinga, Robert Planqué, Alexis Joly
    CLEF 2017 - Conference and Labs of the Evaluation Forum, Sep 2017, Dublin, Ireland. pp.1-9.
  14. TARDIS: Optimal Execution of Scientific Workflows in Apache Spark
    Daniel Gaspar, Fabio Porto, Reza Akbarinia, Esther Pacitti
    DaWaK 2017: Data Warehousing and Knowledge Discovery, Aug 2017, Lyon, France. 19th International Conference on Big Data Analytics and Knowledge Discovery, pp.74-87, 2017, LNCS.
  15. Pre-processing and Indexing techniques for Constellation Queries in Big Data
    Amir Khatibi, Fabio Porto, Joao Rittmeyer, Eduardo Ogasawara, Patrick Valduriez, Dennis Shasha
    DaWaK 2017: 19th International Conference on Big Data Analytics and Knowledge Discovery, Aug 2017, Lyon, France. Springer, LNCS, pp.74-87, 2017, Big Data Analytics and Knowledge Discovery.
  16. Massively Distributed Environments and Closed Itemset Mining: The DCIM Approach
    Mehdi Zitouni, Reza Akbarinia, Sadok Ben Yahia, Florent Masseglia
    CAiSE: Advanced Information Systems Engineering, Jun 2017, Essen, Germany. 29th International Conference on Advanced Information Systems Engineering, LNCS (10253), pp.231-246, 2017.
  17. Pl@ntNet app in the era of deep learning
    Antoine Affouard, Hervé Goëau, Pierre Bonnet, Jean-Christophe Lombardo, Alexis Joly
    nnet, Jean-Christophe Lombardo, Alexis Joly. Pl@ntNet app in the era of deep learning. ICLR 2017 - Workshop Track - 5th International Conference on Learning Representations, Apr 2017, Toulon, France. pp.1-6.

2016

  1. Benchmarking Polystores: the CloudMdsQL Experience
    Boyan Kolev, Raquel Pau, Oleksandra Levchenko, Patrick Valduriez, Ricardo Jiménez-Peris, José Pereira
    Vijay Gadepally. International Conference on Big Data, Dec 2016, Washington, DC, United States. IEEE Computing Society, IEEE BigData 2016: Workshop on Methods to Manage Heterogeneous Big Data and Polystore Databases, 2017. <10.1109/BigData.2016.7840899>
  2. Managing Hot Metadata for Scientific Workflows on Multisite Clouds
    Luis Pineda-Morales, Ji Liu, Alexandru Costan, Esther Pacitti, Gabriel Antoniu, Patrick Valduriez, Marta Mattoso
    BIGDATA 2016 - 2016 IEEE International Conference on Big Data, Dec 2016, Washington, United States. 2016.
  3. Extending CloudMdsQL with MFR for Big Data Integration
    Carlyna Bondiombouy, Boyan Kolev, Patrick Valduriez, Oleksandra Levchenko
    BDA: Bases de Données Avancées, Nov 2016, Poitiers, France. 32ème Conférence sur la Gestion de Données - Principes, Technologies et Applications, 2016.
  4. Online Input Data Reduction in Scientific Workflows
    Renan Souza, Vítor Silva, Alvaro Coutinho, Patrick Valduriez, Marta Mattoso
    ACM SIGHPC; IEEE. WORKS: Workflows in Support of Large-scale Science, Nov 2016, Salt Lake City, United States. 11th Workshop on Workflows in Support of Large-scale Science, in conjunction with SC2016, 2016.
  5. Crowdsourcing Biodiversity Monitoring: How Sharing your Photo Stream can Sustain our Planet
    Alexis Joly, Hervé Goëau, Julien Champ, Samuel Dufour-Kowalski, Henning Müller, Pierre Bonnet
    ACM Multimedia 2016, Oct 2016, Amsterdam, Netherlands.
  6. ThePlantGame: Actively Training Human Annotators for Domain-specific Crowdsourcing
    Maximilien Servajean, Alexis Joly, Dennis Shasha, Julien Champ, Esther Pacitti
    ACM Multimedia 2016, Oct 2016, Amsterdam, Netherlands.
  7. Plant Identification in an Open-world (LifeCLEF 2016)
    Hervé Goëau, Pierre Bonnet, Alexis Joly
    CLEF 2016 - Conference and Labs of the Evaluation forum, Sep 2016, Évora, Portugal. Working Notes of CLEF 2016 - Conference and Labs of the Evaluation forum, pp.428--439, 2016.
  8. LifeCLEF Bird Identification Task 2016: The arrival of Deep learning
    Hervé Goëau, Hervé Glotin, Willem-Pier Vellinga, Robert Planqué, Alexis Joly
    Working Notes of CLEF 2016 - Conference and Labs of the Evaluation forum, Sep 2016, Evora, Portugal. pp.440--449, 2016.
  9. Unsupervised Individual Whales Identification: Spot the Difference in the Ocean
    Alexis Joly, Jean-Christophe Lombardo, Julien Champ, Anjara Saloma
    Working Notes of CLEF 2016 - Conference and Labs of the Evaluation forum, Sep 2016, Evora, Portugal. pp.469--480, 2016.
  10. LifeCLEF 2016: Multimedia Life Species Identification Challenges
    Alexis Joly, Hervé Goëau, Hervé Glotin, Concetto Spampinato, Pierre Bonnet, Willem-Pier Vellinga, Julien Champ, Robert Planqué, Simone Palazzo, Henning Müller
    Norbert Fuhr; Paulo Quaresma; Teresa Gonçalves ; Birger Larsen ; Krisztian Balog ; Craig Macdonald; Linda Cappellato; Nicola Ferro. CLEF 2016 - 7th International Conference of the CLEF Association, Sep 2016, Evora, Portugal. Springer, pp.286--310, 2016, Experimental IR Meets Multilinguality, Multimodality, and Interaction.
  11. Floristic participation at LifeCLEF 2016 Plant Identification Task
    Julien Champ, Hervé Goëau, Alexis Joly
    CLEF 2016 - Conference and Labs of the Evaluation forum, Sep 2016, Évora, Portugal. Working Notes of CLEF 2016 - Conference and Labs of the Evaluation forum, pp.450--458, 2016.
  12. Enhancing Energy Production with Exascale HPC Methods
    José Camata, José Cela, Danilo Costa, Alvaro Lga Coutinho, Daniel Fernández-Galisteo, Carmen Jimenez, Vadim Kourdioumov, Marta Mattoso, Rafael Mayo-García, Thomas Miras, José Moríñigo, Jorge Navarro, Philippe Navaux, Daniel De Oliveira, Manuel Rodríguez-Pascual, Vítor Silva, Renan Souza, Patrick Valduriez
    CARLA 2016 - Latin American High Performance Computing Conference, Aug 2016, Mexico City, Mexico. Springer, 3rd Latin American High Performance Computing Conference, CCIS (697), pp.233-246, 2017.
  13. Scientific Workflow Scheduling with Provenance Support in Multisite Cloud
    Ji Liu, Esther Pacitti, Patrick Valduriez, Marta Mattoso
    VECPAR, Jun 2016, Porto, Portugal. 12th International Meeting on High Performance Computing for Computational Science, pp.8, 2016.
  14. The CloudMdsQL Multistore System
    Boyan Kolev, Carlyna Bondiombouy, Patrick Valduriez, Ricardo Jiménez-Peris, Raquel Pau, José Pereira
    SIGMOD, Jun 2016, San Francisco, United States. ACM SIGMOD/PODS Conference, 2016. <10.1145/2882903.2899400>
  15. Development of a knowledge system for Big Data: Case study to plant phenotyping data
    Luyen Le Ngoc, Anne Tireau, Aravind Venkatesan, Pascal Neveu, Pierre Larmande
    WIMS '16 Proceedings of the 6th International Conference on Web Intelligence, Mining and Semantics, Jun 2016, Nimes, France. ACM. <10.1145/2912845.2912869>
  16. Exposing French agronomic resources as Linked Open Data
    Aravind Venkatesan, Nordine El Hassouni, Florian Phillipe, Cyril Pommier, Hadi Quesneville, Manuel Ruiz, Pierre Larmande
    Ingenierie des Connaissances IC2016 - Workshop In Ovive, Jun 2016, Montpellier, France.
  17. Spatially Localized Visual Dictionary Learning
    Valentin Leveau, Alexis Joly, Olivier Buisson, Patrick Valduriez
    ICMR '16 Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval , Jun 2016, New York, United States. ACM, pp.367--370, 2016.
  18. A New Privacy-Preserving Solution for Clustering Massively Distributed Personal Times-Series
    Tristan Allard, Georges Hébrail, Florent Masseglia, Esther Pacitti
    ICDE: International Conference on Data Engineering, May 2016, Helsinki, Finland. 32nd IEEE International Conference on Data Engineering, ICDE 2016, 2016.
  19. Design and Implementation of the CloudMdsQL Multistore System
    Boyan Kolev, Carlyna Bondiombouy, Oleksandra Levchenko, Patrick Valduriez, Ricardo Jimenez-Péris, Raquel Pau, Jose Pereira
    CLOSER: Cloud Computing and Services Science, Apr 2016, Roma, Italy. 6th International Conference on Cloud Computing and Services Science, 1, pp.352-359, 2016, DataDiversityConvergence Workshop.

2015

  1. Exposing French agronomic resources as Linked Open Data
    Aravind Venkatesan, Nordine El Hassouni, Florian Philippe, Cyril Pommier, Hadi Quesneville, Manuel Ruiz, Pierre Larmande
    SWAT4LS: Semantic Web Applications and Tools for Life Sciences, Dec 2015, Cambridge, United Kingdom. 1546, 2015.
  2. Managing Simulation Data with Multidimensional Arrays
    Hermano Lustosa, Fabio Porto, Ramon Costa, Pablo Blanco, Patrick Valduriez
    SBBD'2015: Simpósio Brasileiro de Banco de Dados, Oct 2015, Petropolis, Brazil. pp.7, 2015.
  3. Ontology-based services and knowledge management in the Agronomic Domain
    Pierre Larmande
    RDA: Research Data Alliance, Sep 2015, Paris, France. The 6th Research Data Alliance plenary meeting, 2015.
  4. LifeCLEF 2015: Multimedia Life Species Identification Challenges
    Alexis Joly, Hervé Goëau, Hervé Glotin, Concetto Spampinato, Pierre Bonnet, Willem-Pier Vellinga, Robert Planqué, Andreas Rauber, Simone Palazzo, Bob Fisher, Henning Müller
    CLEF: Conference and Labs of the Evaluation forum, Sep 2015, Toulouse, France. Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum - Toulouse, France, September 8-11, 2015., 2015.
  5. LifeCLEF Plant Identification Task 2015
    Hervé Goëau, Pierre Bonnet, Alexis Joly
    CEUR-WS. CLEF: Conference and Labs of the Evaluation forum, Sep 2015, Toulouse, France. Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum - Toulouse, France, September 8-11, 2015., 1391, 2015, CLEF2015 Working notes.
  6. A comparative study of fine-grained classification methods in the context of the LifeCLEF plant identification challenge 2015
    Julien Champ, Titouan Lorieul, Maximilien Servajean, Alexis Joly
    CEUR-WS. CLEF: Conference and Labs of the Evaluation forum, Sep 2015, Toulouse, France. Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum - Toulouse, France, September 8-11, 2015., 1391, 2015, CLEF2015 working notes.
  7. LifeCLEF Bird Identification Task 2015
    Hervé Goëau, Hervé Glotin, Willem-Pier Vellinga, Robert Planqué, Andreas Rauber, Alexis Joly
    CEUR-WS. CLEF: Conference and Labs of the Evaluation forum, Sep 2015, toulouse, France. Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum - Toulouse, France, September 8-11, 2015., 1391, 2015, CLEF2015 working notes.
  8. Shared nearest neighbors match kernel for bird songs identification -LifeCLEF 2015 challenge
    Alexis Joly, Valentin Leveau, Julien Champ, Olivier Buisson
    ceur-ws. CLEF: Conference and Labs of the Evaluation forum, Sep 2015, Toulouse, France. Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum - Toulouse, France, September 8-11, 2015., 1391, 2015, CLEF2015 working notes.
  9. Integrating Big Data and Relational Data with a Functional SQL-like Query Language
    Carlyna Bondiombouy, Boyan Kolev, Oleksandra Levchenko, Patrick Valduriez
    Qiming Chen; Abdelkader Hameurlain; Farouk Toumani; Roland Wagner; Hendrik Decker. DEXA’2015: 26th International Conference on Database and Expert Systems Applications, Sep 2015, Valencia, Spain. Lecture Notes in Computer Science 9261, Springer 2015, ISBN 978-3-319-22848-8, 2015.
  10. A Prime Number Based Approach for Closed Frequent Itemset Mining in Big Data
    Mehdi Zitouni, Reza Akbarinia, Sadok Ben Yahia, Florent Masseglia
    DEXA: Database and Expert Systems Applications, Sep 2015, Valencia, Spain. 26th International Conference on Database and Expert Systems Applications, LNCS (9261), pp.509-516, 2015.
  11. Data Partitioning for Fast Mining of Frequent Itemsets in Massively Distributed Environments
    Saber Salah, Reza Akbarinia, Florent Masseglia
    DEXA: Database and Expert Systems Applications, Sep 2015, Valencia, Spain. 26th International Conference on Database and Expert Systems Applications, 2015.
  12. An Efficient Solution for Processing Skewed MapReduce Jobs
    Reza Akbarinia, Miguel Liroz-Gistau, Divyakant Agrawal, Patrick Valduriez
    Globe'2015: 8th International Conference on Data Management in Cloud, Grid and P2P Systems, Sep 2015, Valencia, Spain.
  13. Fast Parallel Mining of Maximally Informative k-Itemsets in Big Data
    Saber Salah, Reza Akbarinia, Florent Masseglia
    IEEE International Conference on Data Mining, Aug 2015, Atlantic city, United States. 2015.
  14. When sharing computer science with everyone also helps avoiding digital prejudices.
    Marie Duflot, Martin Quinson, Florent Masseglia, Didier Roy, Julien Vaubourg, Thierry Viéville
    Escape computer dirty magic: learn Scratch !. Scratch2015AMS, Aug 2015, Amsterdam, Netherlands. 2015.
  15. On Term Selection Techniques for Patent Prior Art Search
    Mona Golestan Far, Scott Sanner, Mohamed Reda Bouadjenek, Gabriela Ferraro, David Hawking
    SIGIR: Research and Development in Information Retrieval, Aug 2015, Santiago, Chile. ACM, 2015, SIGIR '15: 38th International SIGIR Conference on Research and Development in Information Retrieval. <10.1145/2766462.2767801>
  16. Aggregation-Aware Compression of Probabilistic Streaming Time Series
    Reza Akbarinia, Florent Masseglia
    MLDM'2015: International Conference on Machine Learning and Data Mining, Jul 2015, Hamburg, Germany.
  17. Optimizing the Data-Process Relationship for Fast Mining of Frequent Itemsets in MapReduce
    Saber Salah, Reza Akbarinia, Florent Masseglia
    MLDM'2015: International Conference on Machine Learning and Data Mining, Jul 2015, Hamburg, Germany. Machine Learning and Data Mining in Pattern Recognition, 9166, pp.217-231, 2015, LNCS.
  18. Towards efficient data integration and knowledge management in the Agronomic domain
    Aravind Venkatesan, Nordine El Hassouni, Florian Phillipe, Cyril Pommier, Hadi Quesneville, Manuel Ruiz, Pierre Larmande
    APIA: Applications Pratiques de l'Intelligence Artificielle , Jul 2015, Rennes, France. 1ère conférence sur les Application Pratiques de l'Intelligence Artificielle (APIA), 2015.
  19. OpenAlea: Scientific Workflows Combining Data Analysis and Simulation
    Christophe Pradal, Christian Fournier, Patrick Valduriez, Sarah Cohen-Boulakia
    SSDBM 2015: 27th International Conference on Scientific and Statistical Database Management, Jun 2015, San Diego, United States. <10.1145/2791347.2791365>
  20. Kernelizing Spatially Consistent Visual Matches for Fine-Grained Classification
    Valentin Leveau, Alexis Joly, Olivier Buisson, Patrick Valduriez
    International Conference on Multimedia Retrieval 2015, Jun 2015, Shangai, China.
  21. DigInPix: Visual Named-Entities Identification in Images and Videos
    Pierre Letessier, Nicolas Hervé, Alexis Joly, Hakim Nabi, Mathieu Derval, Olivier Buisson
    ICRM: International Conference on Multimedia Retrieval, Jun 2015, Shanghai, China. ACM, Proceedings of the 5th ACM on International Conference on Multimedia Retrieval - ICMR '15, pp.661-664, 2015.
  22. A Study of Query Reformulation for Patent Prior Art Search with Partial Patent Applications
    Mohamed Reda Bouadjenek, Scott Sanner, Gabriela Ferraro
    ICAIL: International Conference on Artificial Intelligence and Law, Jun 2015, San Diego, United States. 2015, ICAIL'2015: 15th International Conference on Artificial Intelligence and Law.
  23. Chiaroscuro: Transparency and Privacy for Massive Personal Time-Series Clustering
    Tristan Allard, Georges Hébrail, Florent Masseglia, Esther Pacitti
    ACM SIGMOD. SIGMOD: Conference on Management of Data, May 2015, Melbourne, Australia. SIGMOD '15- Proceedings of the 2015 ACM SIGMOD 34th International Conference on Management of Data, 2015. <10.1145/2723372.2749453>
  24. Data-intensive HPC: opportunities and challenges
    Patrick Valduriez
    BDEC'2015: Big Data and Extreme-scale Computing, Jan 2015, Barcelone, Spain. 2015.

2014

  1. Fine-grained Visual Faceted Search
    Julien Champ, Alexis Joly, Bonnet Pierre
    ACM Multimedia, Nov 2014, Orlando, FL, United States. The 22nd ACM International Conference on Multimedia - November 3-7, 2014 | Orlando, FL, USA. <10.1145/2647868.2654875>
  2. Recognizing Thousands of Legal Entities through Instance-based Visual Classification
    Valentin Leveau, Alexis Joly, Olivier Buisson, Pierre Letessier, Patrick Valduriez
    ACM Multimedia, Nov 2014, Orlando, FL, United States. The 22nd ACM International Conference on Multimedia - November 3-7, 2014 | Orlando, FL, USA, 2014. <10.1145/2647868.2655038>
  3. NACluster: A Non-Supervised Clustering Algorithm for Matching Multi Catalogues
    Vinicius P. Freire, José A. F. De Macêdo, Fábio Porto, Reza Akbarinia
    IEEE e-Science Workshop, Oct 2014, Guarujá, SP, Brazil. 2014.
  4. Layer Decomposition: An Effective Structure-based Approach for Scientific Workflow Similarity
    Johannes Starlinger, Sarah Cohen-Boulakia, Sanjeev Khanna, Susan Davidson, Ulf Leser
    IEEE e-Science conference, Oct 2014, Guarujá, Brazil. 2014.
  5. Exploiting Diversification in Distributed Recommendation
    Maximilien Servajean, Esther Pacitti, Miguel Liroz-Gistau, Sihem Amer-Yahia, Amr El Abbadi
    BDA: Bases de Données Avancées, Oct 2014, Grenoble-Autrans, France. INRIA-SILICONVALLEY, 2014, Gestion de Données – Principes, Technologies et Applications.
  6. PlantRT : a Distributed Recommendation Tool for Citizen Science
    Maximilien Servajean, Esther Pacitti, Miguel Liroz-Gistau, Alexis Joly, Julien Champ
    BDA: Bases de Données Avancées, Oct 2014, Autrans, France. BDA 2014 : Gestion de données - principes, technologies et applications, pp.48-50, 2014.
  7. LifeCLEF Bird Identification Task 2014
    Hervé Goëau, Hervé Glotin, Willem-Pier Vellinga, Robert Planqué, Andreas Rauber, Alexis Joly
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2014, Sheffield, United Kingdom. 2014, Information Access Evaluation meets Multilinguality, Multimodality, and Interaction.
  8. Instance-based bird species identication with undiscriminant features pruning - LifeCLEF 2014
    Alexis Joly, Julien Champ, Olivier Buisson
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2014, Sheffield, United Kingdom. 2014, Information Access Evaluation meets Multilinguality, Multimodality, and Interaction.
  9. Lifeclef 2014: multimedia life species identification challenges
    Alexis Joly, Hervé Goëau, Hervé Glotin, Concetto Spampinato, Pierre Bonnet, Willem-Pier Vellinga, Robert Planque, Andreas Rauber, Bob Fisher, Henning Müller
    CLEF: Conference and Labs of the Evaluation forum, Sep 2014, Sheffield, United Kingdom. 5th International Conference of the CLEF Initiative, CLEF 2014, Sheffield, UK, September 15-18, 2014. Proceedings, LNCS (8685), pp.229-249, 2014, Information Access Evaluation. Multilinguality, Multimodality, and Interaction.
  10. Exploiting Diversification in Gossip-Based Recommendation
    Maximilien Servajean, Esther Pacitti, Miguel Liroz-Gistau, Sihem Amer-Yahia, Amr El Abbadi
    Globe'2014: 7th International Conference, Sep 2014, Munich, Germany. INRIA-SILICONVALLEY, LNCS (8648), pp.25-36, 2014, Data Management in Cloud, Grid and P2P Systems.
  11. Scientific Workflow Partitioning in Multi-site Clouds
    Ji Liu, Esther Pacitti, Patrick Valduriez, Vitor Silva Souza, Marta Mattoso
    L. Lopes. BigDataCloud'2014: 3rd Workshop on Big Data Management in Clouds in conjunction with Euro-Par 2014, Aug 2014, Porto, Portugal. Springer, Lecture Notes in Computer Science, 8805, pp.105-116, 2014, LNCS.
  12. Towards Efficient Power Management in MapReduce: Investigation of CPU-Frequencies Scaling on Power Efficiency in Hadoop
    Shadi Ibrahim, Diana Moise, Houssem-Eddine Chihoub, Alexandra Carpen-Amarie, Luc Bougé, Gabriel Antoniu
    Workshop on Adaptive Resource Management and Scheduling for Cloud Computing, Held in conjunction with PODC 2014, Jul 2014, Paris, France.
  13. Pl@ntNet Mobile 2014: Android port and new features
    Hervé Goëau, Bonnet Pierre, Alexis Joly, Antoine Affouard, Vera Bakić, Julien Barbe, Samuel Dufour-Kowalski, Souheil Selmi, Yahiaoui Itheri, Christel Vignau, Daniel Barthelemy, Nozha Boujemaa
    ICMR 2014 International Conference on Multimedia Retrieval, Apr 2014, Glasgow, France. <10.1145/2578726.2582618>
  14. LifeCLEF: Multimedia Life Species Identification
    Alexis Joly, Robert Planque, Concetto Spampinato, Henning Müller, Hervé Goëau, Andreas Rauber, Bonnet Pierre, Willem-Pier Vellinga, Robert B. Fisher, Hervé Glotin
    EMR 2014, 1st International Workshop on Environnmental Multimedia Retrieval co-located with ACM International Conference on Multimedia Retrieval (ICMR 2014), Apr 2014, Glasgow, United Kingdom.

2013

  1. Small objects query suggestion in a large web-image collection
    Pierre Letessier, Nicolas Hervé, Champ Julien, Alexis Joly, Olivier Buisson, Amel Hamzaoui
    MM'13: ACM Multimedia, Oct 2013, Barcelone, Spain. ACM, 2013. <10.1145/2502081.2502248>
  2. The Imageclef Plant Identification Task 2013
    Alexis Joly, Hervé Goëau, Pierre Bonnet, Vera Bakić, Jean-François Molino, Daniel Barthélémy, Nozha Boujemaa
    International workshop on Multimedia analysis for ecological data, Oct 2013, Barcelone, Spain. 2013.
  3. Pl@ntNet Mobile App
    Hervé Goëau, Pierre Bonnet, Alexis Joly, Vera Bakić, Julien Barbe, Souheil Selmi, Jennifer Carré, Daniel Barthélémy, Nozha Boujemaa, Jean-François Molino, Grégoire Duché, Aurélien Perronet
    ACM Multimedia, Oct 2013, Barcelone, Spain. ACM, pp.423-424, 2013.
  4. OTmedia: The French Transmedia News Observatory
    Nicolas Hervé, Marie-Luce Viaud, Jérôme Thievre, Agnès Saulnier, Pierre Letessier, Julien Champ, Olivier Buisson, Alexis Joly
    MM '13: 21st ACM international conference on Multimedia, Oct 2013, Barcelone, Spain. pp.441-442, 2013.
  5. A Density-Based Backward Approach to Isolate Rare Events in Large-Scale Applications
    Enikö Székely, Pascal Poncelet, Florent Masseglia, Maguelonne Teisseire, Renaud Cezar
    Johannes Fürnkranz and Eyke Hüllermeier and Tomoyuki Higuchi. DS: Discovery Science, Oct 2013, Singapore, Singapore. Springer, pp.249-264, 2013, Lecture Notes in Computer Science.
  6. Algebraic Dataflows for Big Data Analysis
    Dias Jonas, Eduardo Ogasawara, Oliveira Daniel De, Fabio Porto, Patrick Valduriez, Marta Mattoso
    BigData'2013: International Conference on Big Data, Oct 2013, Santa Clara, United States. IEEE, pp.6, 2013.
  7. Fast and Exact Mining of Probabilistic Data Streams
    Reza Akbarinia, Florent Masseglia
    PKDD'2013: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, Sep 2013, Prague, Czech Republic. Springer, pp.493-508, 2013, Lecture Notes in Computer Science.
  8. Inria's participation at ImageCLEF 2013 Plant Identification Task
    Vera Bakić, Sofiène Mouine, Saloua Ouertani-Litayem, Anne Verroust-Blondet, Itheri Yahiaoui, Hervé Goëau, Alexis Joly
    CLEF (Online Working Notes/Labs/Workshop) 2013, Sep 2013, Valencia, Spain. 2013.
  9. The ImageCLEF 2013 Plant Identification Task
    Hervé Goëau, Pierre Bonnet, Alexis Joly, Vera Bakić, Daniel Barthélémy, Nozha Boujemaa, Jean-François Molino
    CLEF, Sep 2013, Valencia, Spain. 2013.
  10. Imageclef 2013: the vision, the data and the open challenges
    Caputo Barbara, Muller Henning, Thomee Bart, Villegas Mauricio, Roberto Paredes, David Zellhofer, Hervé Goëau, Alexis Joly, Bonnet Pierre, Jesus Martinez Gomez, Ismael Garcia Varea, Miguel Cazorla
    CLEF 2013 - 4th Conference and Labs of the Evaluation Forum : Information Access Evaluation meets Multilinguality, Multimodality, and Visualization, Sep 2013, Valencia, Spain. Springer, LNCS, 8138, pp.250-268, 2013, CLEF 2013: Information Access Evaluation. Multilinguality, Multimodality, and Visualization.
  11. Data Partitioning for Minimizing Transferred Data in MapReduce
    Miguel Liroz-Gistau, Reza Akbarinia, Divyakant Agrawal, Esther Pacitti, Patrick Valduriez
    Hameurlain, Abdelkader and Rahayu, Wenny and Taniar, David. Globe'2013: 6th International Conference on Data Management in Cloud, Grid and P2P Systems, Aug 2013, Prague, Czech Republic. Springer, pp.1-12, 2013, LNCS.
  12. The Price is Right: Models and Algorithms for Pricing Data
    Tang Ruiming, Wu Huayu, Bao Zhifeng, Bressan Stephane, Patrick Valduriez
    Hendrik Decker and Lenka Lhotska and Sebastian Link. DEXA'2013: 24th International Conference on Database and Expert Systems Applications, Aug 2013, Czech Republic. Springer, pp.380-394, 2013.
  13. What you Pay for is What you Get
    Tang Ruiming, Shao Dongxu, Stephane Bressan, Patrick Valduriez
    Hendrik Decker and Lenka Lhotska and Sebastian Link. DEXA'2013: 24th International Conference on Database and Expert Systems Applications, Aug 2013, Prague, Czech Republic. Springer, pp.395-409, 2013.
  14. WebSmatch: a tool for Open Data
    Emmanuel Castanier, Remi Coletta, Patrick Valduriez, Christian Frisch
    WOD: Workshop on Open Data, Jun 2013, Paris, France. 2nd International Workshop on Open Data, pp.#10, 2013.
  15. Profile Diversity in Search and Recommendation
    Maximilien Servajean, Esther Pacitti, Sihem Amer-Yahia, Pascal Neveu
    Ido Guy; Michelle X. Zhou; Li Chen. SRS: Social Recommender Systems (in conjunction WWW 2013), May 2013, Rio de Janeiro, Brazil. IW3C2, International World Wide Web Conference Committee (IW3C2) - SRS 2013: 4th International Workshop on Social Recommender Systems (in conjunction WWW 2013 Companion, ACM 978-1-4503-2038-2/13/05., pp.973-980, 2013.
  16. Mining frequent itemsets over tuple-evolving data streams
    Chongsheng Zhang, Yuan Hao, Mirjana Mazuran, Carlo Zaniolo, Hamid Mousavi, Florent Masseglia
    SAC'13: Symposium on Applied Computing, Mar 2013, Coimbra, Portugal. pp.267-274, 2013.
  17. Opening the Black Box of Ontology Matching
    Duy Hoa Ngo, Zohra Bellahsene, Konstantin Todorov
    ESWC'2013: 10th Semantics and Big Data, Montpellier, France. pp.16-30, 2013.

Publications majeures depuis 2008

R. Akbarinia, P. Valduriez, G. Verger, Efficient Evaluation of SUM Queries Over Probabilistic Data. IEEE Transactions on Knowledge and Data Engineering, Data. Vol. 25, No. 4, 764-775, 2013.

M. El Dick, E. Pacitti, R. Akbarinia, B. Kemme, Building a Peer-to-Peer Content Distribution Network with High Performance, Scalability and Robustness, Information Systems, Vol. 36, No 2, p. 222-247, 2011.

P. Letessier, O. Buisson, A. Joly, N. Boujemaa, Scalable Mining of Small Visual Objects, ACM Multimedia Conf.,  2012.

E. Ogasawara, D. De Oliveira, P. Valduriez, J. Dias, F. Porto, M. Mattoso, An Algebraic Approach for Data-Centric Scientific Workflows, Proceedings of VLDB, Vol. 4, No 11, p. 1328-1339, 2011. 

F. Petitjean, F. Masseglia, P. Gançarski, G. Forestier, Discovering Significant Evolution Patterns from Satelllite Image Time Series, International Journal of Neural Systems, Vol. 21, No 6, 475-489, 2011.

Mots-clés

Big data, Données scientifiques, Gestion de données distribuées et parallèles, Analyse et fouille de données, Recommandation et recherche de contenus, Communautés en ligne, Workflows scientifiques, Intégration, Confidentialité, Recherche d’information par contenu, P2P, Grid, Cloud

Dernière mise à jour le 29/03/2018