Appel: Call for Demos, The First International Workshop on Big Data Discovery and Curation
Thierry Hamon
thierry.hamon at UNIV-PARIS13.FR
Tue Jul 1 19:36:32 UTC 2014
Date: Sun, 29 Jun 2014 14:29:25 -0400
From: Call For Papers <cfp.2014conference at GMAIL.COM>
Message-ID: <CAPMCzSSDoV1WeJW_VN=KB+qv1DMf5fB5aHuMnKqmMcFcF+EOHQ at mail.gmail.com>
X-url: https://sites.google.com/site/bddc2014/
Call for Demos: The First International Workshop on Big Data Discovery
and Curation
Traditionally, data warehouses have been used to provide business users
ways to consolidate information from different sources for analysis and
reporting. For getting data ready for analysis, ETL
(extract-transfrom-load) is used which involves reading data from
different sources, cleaning the data, converting the format of the input
data so that it conforms to the target database, and writing it to the
target database.
Big data paradigm is changing this problem due to three V’s: volume,
velocity, and variety. In big data paradigm, potentially a large number
of data sources and data assets are considered for analytics. One needs
to discover, integrate, and analyze large volumes of diverse data
quickly.
Finding relevant data for analytics is an important data discovery
problem. Data diversity makes this problem difficult. The diversity of
the data can be due to data model; type of data—structured,
semi-structured, or unstructured; enterprise data vs. open public data;
integrating social media data, etc. One also needs to handle data
quality and data governance issues. In this workshop we invite
demonstrations displaying techniques for identifying relevant sets of
data, finding different kinds of relationships between structured,
semi-structured, and unstructured data, curating the data for further
analysis, integrating data using various join, union, and merge
techniques, validating the integrated data, and analyzing it, from
various industry domains.
Topics of interest include (but are not limited to):
- Cleaning big data
- Integration of big heterogeneous data
- Metadata extraction
- Automated rule generation
- Curating data
- Data discovery
- Provisioning and data lineage
We welcome good demonstrations, including of previously accepted
papers/demos, for this workshop. Authors need to send manuscript
describing the demo in up to 2 pages (2 column format) inclusive of all
references and figures. Manuscripts must be written in English, and
formatted according to IEEE proceedings templates. Please see the
workshop website https://sites.google.com/site/bddc2014/ for more
details.
Important dates:
Demo proposals due: July 5, 2014
Notification of acceptance: July 15, 2014
Workshop: August 24, 2014
-------------------------------------------------------------------------
Message diffuse par la liste Langage Naturel <LN at cines.fr>
Informations, abonnement : http://www.atala.org/article.php3?id_article=48
English version :
Archives : http://listserv.linguistlist.org/archives/ln.html
http://liste.cines.fr/info/ln
La liste LN est parrainee par l'ATALA (Association pour le Traitement
Automatique des Langues)
Information et adhesion : http://www.atala.org/
ATALA décline toute responsabilité concernant le contenu des
messages diffusés sur la liste LN
-------------------------------------------------------------------------
More information about the Ln
mailing list