[Corpora-List] Call for Papers: Mining User-GeneratedContent for Security - MINUCS 2009

Fri Aug 14 01:02:13 UTC 2009

Gents:

Many, many years ago, a three letter agency contracted with the firm I
worked for at the time to help them identify news stories involving these
kinds of issues. 
The firm I worked for performed similar content collection services for
government agencies (The UN, The US Senate, etc) private companies (AT&T,
Verizon, Canadian National Railroad, Deutschesbank, Canadian Wheat Board,
and virtually every IT firm you ever heard of. 
Call me under-outraged, but there is a fundamental difference between
auditing people's mail or email or telephone calls without cause and trying
to perform a function that has been performed by the likes of
Dialog/NewsEdge/Thompson/Factiva etc for years. 

Using knowledge discovery tools such as text mining, semantic
categorization, and weblinks

*          to identify content sets and content generators 

*          to prioritize content importance 

*          and to classify author and audience types as well as to determine
the level of credibility to assign authors 

has common been a common practice of public and private institutions
(especially news organizations) for better than 15 years. The purpose of
these activities was intelligence - competitive intelligence, market
intelligence, medical intelligence, academic intelligence and yes, national
security stuff.

I would submit that there is little new or threatening about someone using
knowledge tools to assess publicly published content. If you publish
something on the web or in the newspaper, there is always the possibility
that someone will read it and try to understand it better. 

I'm all for being outraged by outrages but I'm having some difficulty seeing
a basis for anyone's distress here. 

J V Bryar

-----Original Message-----
From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of
Mcenery, Tony
Sent: Wednesday, August 12, 2009 6:15 AM
To: corpora at uib.no
Subject: Re: [Corpora-List] Call for Papers: Mining User-GeneratedContent
for Security - MINUCS 2009

Well said Adam. I find the wilful blindness of this conference to the
significant moral and legal issues that it should consider startling.

  _____  

From: Adam Kilgarriff
Sent: Wed 8/12/2009 09:16
To: Jakub Piskorski
Cc: corpora at uib.no
Subject: Re: [Corpora-List] Call for Papers: Mining User-Generated Content
for Security - MINUCS 2009

> Submissions that focus on legal questions stemming
> from snooping, spying, privacy infringement or violation, etc., will not
be
> considered relevant to the Theme of the Workshop 

Hey we're the guys inventing the next Atom Bomb, we're much too busy to
discuss ethics

2009/8/11 Jakub Piskorski <jakub.piskorski at frontex.europa.eu>

-------------------
 SECOND  CALL  FOR  PAPERS
-------------------

Mining User-Generated Content for Security - MINUCS 2009

9 December 2009, Venice, Italy

web: http://www.usercentricmedia.org/workshops/minucs

email: minucs [ad] cs [dot] helsinki [dot] fi

This event is co-located with the First International Conference on User
Centric Media - UCMedia 2009 (http://www.usercentricmedia.org/) in Venice,
9-11 December 2009

---------------
 AIM AND SCOPE
---------------

The vast and growing amount of user-generated textual content, including
online news streams, blogs, electronic encyclopedias (e.g., the Wikipedia),
and other openly accessible and dynamically changing data readily available
on the Web has led to the emergence of new approaches to extracting
valuable, structured, and previously unknown information from such data. The
aim of this workshop is to bring together researchers from academia and
industry who develop technologies for mining open-source user-generated
textual data on the Web, as well as end-users interested in exploiting such
technologies for knowledge discovery. The emphasis is placed on large-scale
text mining systems and application-oriented approaches to processing
on-line textual content in the context of security-related applications.
Examples of such applications include:

 * global medical and epidemic surveillance,
 * conflict early warning,
 * early detection of man-made or environmental hazards,
 * risk assessment,
 * border surveillance,
 * cross-border crime detection,
 * terrorism counterintelligence,
 * other applications relevant for security, law enforcement, and public
health institutions

Due to a multitude of challenges of diverse and complex nature that are
related to automating the process of mining user-generated content on the
Web, we believe that this workshop will serve as a forum to bring together
researchers from different areas, including data mining, language
technology, computational linguistics, information sciences, information
retrieval and Web mining, for sharing ideas and discussion. In particular,
we believe that there is an important gap to be filled, since the
aforementioned research communities have had limited interaction previously
in the context of the topic of the Workshop. The second major goal is to
engage governmental and inter-state user communities, and to bring them
together with scientists and funding agencies.

--------------------
 TOPICS OF INTEREST
--------------------

 * Mining from news streams, blogs, document repositories, and other openly
accessible and dynamically changing data, including Web 2.0 content, for the
purpose of identifying threats to security or public health,

 * Emphasis on multilingual approaches, and work on languages other than
English,

 * Applications, such as information extraction, classification,
summarization, sentiment detection, event detection, event forecasting,
trend detection, information fusion, and more,

 * Contributions in the form of applications (working systems and
prototypes) as well as theoretical results are welcome,

 * Application domains include crisis-related event reporting, political and
environmental analysis, and medical intelligence, under the general umbrella
of the security intelligence domain,

 * Methods including machine learning, rule-based, and hybrid approaches.

NB: Please note, this Workshop welcomes all work on computational approaches
to the analysis of textual data for gathering information from openly
accessible sources only. Submissions that focus on legal questions stemming
from snooping, spying, privacy infringement or violation, etc., will not be
considered relevant to the Theme of the Workshop, and the Committee will not
be able to review them.

------------
 SUBMISSION
------------

We invite papers addressing primarily the language technology, natural
language processing, data mining and information retrieval communities, as
well as the relevant end-user groups.
Submissions are invited in two categories:

 * regular: research papers presenting novel approaches and solutions, and
 * short (posters): system demonstrations, descriptions, and work in
progress

Submissions are electronic and in PDF format via a web-based submission
server.
Authors are encouraged to use Springer LNICST style for LaTeX in producing
the PDF document.
More information on this style can be found at:
http://www.springer.com/computer?SGWID=0-146-6-564009-0

The page limit for regular papers is 6 pages, whereas short papers are
limited to 4 pages.
The information about the author(s) should be omitted in the submitted
papers since the review process wil be blind.

More detailed information about submission is available on:
http://www.usercentricmedia.org/workshops/minucs/authors.shtml

Each submissions will be reviewed by at least 3 members of the Program
Committee.
Authors of accepted papers will receive guidelines regarding how to produce
camera-ready versions.

-------------
 PUBLICATION
-------------

All workshop papers will be published in the official proceedings, Springer
Lecture Notes of the Institute for Computer Sciences, Social-Informatics and
Telecommunications Engineering - LNICST, of the main conference.

Selected workshop papers will be published in "ACM Springer Mobile Networks
and Applications (MONET) Journal Special Issue on Mobility and User-Centric
Media".

All workshop papers will be also published on the UCMedia2009 Website.

-----------------
 IMPORTANT DATES
-----------------

Paper submissions due:  27 September 2009
Notification of acceptance:  20 October 2009
Camera-ready versions due: 8 November 2009
Workhop Date: 9 December 2009

------------------------------------
 PROGRAM COMMITTEE (to be extended)
------------------------------------

Fabio Crestani (University of Lugano (USI) - Faculty of Informatics,
Switzerland)
Gregory Grefenstette (Exalead, France)
Marko Grobelnik (Jožef Stefan Institute,Slovenia)
Ben Hachey (Macquarie University, Australia)
David L. Hicks (Aalborg University, Denmark)
Mijail Kabadjov (Joint Research Centre of the European Commission, Italy)
Sadao Kurohashi (Kyoto University, Japan)
Nasrullah Memon (The Maersk Mc-Kinney Moller Institute, Denmark)
Maria Milosavljevic (Capital Markets CRC, Australia)
Marie-Francine Moens (Katholieke Universiteit Leuven, Belgium)
Horacio Saggion (University of Sheffield, UK)
Satoshi Sekine (New York University, USA)
Ralf Steinberger (Joint Research Centre of the European Commission, Italy)
Mark Stevenson (University of Sheffield, UK)

----------------------
 ORGANISING COMMITTEE
----------------------

Ulf Brefeld (Technische Universität Berlin, Department of Computer Science,
Germany)
Jakub Piskorski (FRONTEX, Research&Development, Warsaw, Poland)
Roman Yangarber (University of Helsinki, Department of Computer Science,
Finland)

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora

-- 
================================================
Adam Kilgarriff
http://www.kilgarriff.co.uk/              
Lexical Computing Ltd                   http://www.sketchengine.co.uk/
Lexicography MasterClass Ltd      http://www.lexmasterclass.com/
Universities of Leeds and Sussex       adam at lexmasterclass.com
================================================

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20090813/b911849c/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.gif
Type: image/gif
Size: 328 bytes
Desc: not available
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20090813/b911849c/attachment-0001.gif>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora