<html><head>

<meta http-equiv="content-type" content="text/html; charset=UTF-8"></head><body

 bgcolor="#FFFFFF" text="#000000">

Apologies for multiple postings. Please distribute to colleagues. <br>

Please note that the url of the CogALex website (announced beginning 

march) has changed.<br>

The one mentioned in this message is the correct one (sorry for causing 

any inconvenience)<br>

  <br>

  <div style="text-align: center;">---------------------------------------------<br>

    <br>

  </div>

  <div style="text-align: center;">1st Call for Papers<br>

    <br>

4th Workshop on  Cognitive Aspects of the Lexicon (CogALex)<br>

together with a shared task concerning the 'lexical access-problem'<br>

    <br>

Pre-conference workshop at COLING 2014 (August 23d, Dublin, Ireland)<br>

    <br>

Submission deadline: May 25, 2014<br>

    <br>

Invited speaker : Roberto Navigli (Sapienza University of Rome)<br>

    <br>

<a class="moz-txt-link-freetext" href="http://pageperso.lif.univ-mrs.fr/~michael.zock/CogALex-IV/cogalex-webpage/index.html">http://pageperso.lif.univ-mrs.fr/~michael.zock/CogALex-IV/cogalex-webpage/index.html</a><br>

  </div>

  <br>

  <br>

GOAL<br>

  <br>

The aim of the workshop is to bring together researchers involved in the

 construction and application of electronic dictionaries to discuss 

modifications of existing resources in line with the users' needs, 

thereby fully exploiting the advantages of the digital form. Given the 

breadth of the questions, we welcome reports on work from many 

perspectives, including but not limited to: computational lexicography, 

psycholinguistics, cognitive psychology, language learning and 

ergonomics.<br>

  <br>

MOTIVATION<br>

  <br>

The way we look at dictionaries (their creation and use) has changed 

dramatically over the past 30 years. While being considered as an 

appendix to grammar in the past, by now they have moved to centre stage.

 Indeed, there is hardly any task in NLP which can be conducted without 

them. Also, rather than being static entities (data-base view), 

dictionaries are now viewed as dynamic networks, i.e. graphs, whose 

nodes and links (connection strengths) may change over time. 

Interestingly, properties concerning topology, clustering and evolution 

known from other disciplines (society, economy, human brain) also apply 

to dictionaries: everything is linked, hence accessible, and everything 

is evolving. Given these similarities, one may wonder what we can learn 

from these disciplines.<br>

 <br>

In this 4th edition of the CogALex workshop we therefore also invite 

scientists working in these fields, with the goal to broaden the 

picture, i.e. to gain a better understanding concerning the mental 

lexicon and to integrate these findings into our dictionaries in order 

to support navigation. Given recent advances in neurosciences, it 

appears timely to seek inspiration from neuroscientists studying the 

human brain. There is also a lot to be learned from other fields 

studying graphs and networks, even if their object of study is something

 else than language, for example biology, economy or society.<br>

  <br>

TOPICS OF INTEREST<br>

  <br>

This workshop is about possible enhancements of lexical resources and 

electronic dictionaries. To perform the groundwork for the next 

generation of such resources we invite researchers involved in the 

building of such tools. The idea is to discuss modifications of existing

 resources by taking the users’ needs and knowledge states into account,

 and to capitalize on the advantages of the digital media. For this 

workshop we solicit papers including but not limited to the following 

topics, each of which can be considered from various points of view: 

linguistics, neuro- or psycholinguistics (tip of the tongue problem, 

associations), network related sciences (sociology, economy, biology), 

mathematics (vector-based approaches, graph theory, small-world 

problem), etc.<br>

  <br>

1) Analysis of the conceptual input of a dictionary user<br>

  <br>

* What does a language producer start from (bag of words)?<br>

* What is in the authors' minds when they are generating a message and 

looking for a word?<br>

* What does it take to bridge the gap between this input and the desired

 output (target word)? <br>

  <br>

2) The meaning of words<br>

  <br>

* Lexical representation (holistic, decomposed)<br>

* Meaning representation (concept based, primitives)<br>

* Revelation of hidden information (distributional semantics, latent 

semantics, vector-based approaches: LSA/HAL)<br>

* Neural models, neurosemantics, neurocomputational theories of content 

representation.<br>

  <br>

3) Structure of the lexicon<br>

  <br>

* Discovering structures in the lexicon: formal and semantic point of 

view (clustering, topical structure)<br>

* Creative ways of getting access to and using word associations 

(reading between the lines, subliminal communication);<br>

* Evolution, i.e. dynamic aspects of the lexicon (changes of weights)<br>

* Neural models of the mental lexicon (distribution of information 

concerning words, organisation of words)<br>

  <br>

4) Methods for crafting dictionaries or indexes<br>

  <br>

* Manual, automatic or collaborative building of dictionaries and 

indexes (crowd-sourcing, serious games, etc.)<br>

* Impact and use of social networks (Facebook, Twitter) for building 

dictionaries, for organizing and indexing the data (clustering of 

words), and for allowing to track navigational strategies, etc.<br>

* (Semi-) automatic induction of the link type (e.g. synonym, hypernym, 

meronym, association, collocation, ...)<br>

* Use of corpora and patterns (data-mining) for getting access to words,

 their uses, combinations and associations <br>

  <br>

5) Dictionary access (navigation and search strategies, interface 

issues,...)<br>

  <br>

* Search based on sound, meaning or associations <br>

* Search (simple query vs multiple words)<br>

* Context-dependent search (modification of users’ goals during search)<br>

* Recovery<br>

* Navigation (frequent navigational patterns or search strategies used 

by people)<br>

* Interface problems, data-visualisation<br>

  <br>

6) Dictionary applications<br>

  <br>

* Methods supporting vocabulary learning (for example, creation of 

data-bases showing words in various contexts)<br>

* Tools for supporting Human translation<br>

  <br>

IMPORTANT DATES<br>

  <br>

* Deadline for paper submissions: May 25, 2014<br>

* Notification of acceptance: June 15, 2014<br>

* Camera-ready papers due : July 7, 2014<br>

* Worskhop date: August 23, 2014<br>

  <br>

SUBMISSION INFORMATION<br>

  <br>

Papers should follow the COLING main conference formatting details 

(<a class="moz-txt-link-freetext" href="http://www.coling-2014.org/call-for-papers.php">http://www.coling-2014.org/call-for-papers.php</a>) and should be submitted

 as a PDF-file via the START workshop manager at 

<a class="moz-txt-link-freetext" href="https://www.softconf.com/coling2014/WS-1/">https://www.softconf.com/coling2014/WS-1/</a> (you must register first).<br>

  <br>

Contributions can be short or long papers. Short paper submission must 

describe original and unpublished work without exceeding six (6) pages 

(references included). Characteristics of short papers include: a small,

 focused contribution; work in progress; a negative result; a piece of 

opinion; an interesting application nugget. Long paper submissions must 

describe substantial, original, completed and unpublished work without 

exceeding twelve (12) pages (references included).<br>

  <br>

Reviewing will be double blind, so the papers should not reveal the 

authors' identity. Accepted papers will be published in the workshop 

proceedings.<br>

  <br>

For further details see: <br>

<a class="moz-txt-link-freetext" href="http://pageperso.lif.univ-mrs.fr/~michael.zock/CogALex-IV/cogalex-webpage/index.html">http://pageperso.lif.univ-mrs.fr/~michael.zock/CogALex-IV/cogalex-webpage/index.html</a><br>

  <br>

  <br>

SHARED TASK ON THE LEXICAL ACCESS PROBLEM (COMPUTING ASSOCIATIONS WHEN 

GIVEN MULTIPLE STIMULI)<br>

  <br>

In the framework of the 4th Workshop on Cognitive Aspects of the Lexicon

 (CogALex) to be held at COLING 2014, we invite participation in a 

shared task devoted to the problem of lexical access in language 

production, with the aim of providing a quantitative comparison between 

different systems.<br>

  <br>

 <br>

MOTIVATION<br>

  <br>

The quality of a dictionary depends not only on coverage, but also on 

the accessibility of the information. That is a crucial point is 

dictionary access. Access strategies vary with the task (text 

understanding vs. text production) and the knowledge available at the 

very moment of consultation (words, concepts, speech sounds). Unlike 

readers who look for meanings, writers start from them, searching for 

the corresponding words. While paper dictionaries are static, permitting

 only limited strategies for accessing information, their electronic 

counterparts promise dynamic, proactive search via multiple criteria 

(meaning, sound, related words) and via diverse access routes. 

Navigation takes place in a huge conceptual lexical space, and the 

results are displayable in a multitude of forms (e.g. as trees, as 

lists, as graphs, or sorted alphabetically, by topic, by frequency).<br>

  <br>

To bring some structure into this multitude of possibilities, the shared

 task will concentrate on a crucial subtask, namely multiword 

association.  What we mean by this in the context of this workshop is 

the following. Suppose, we were looking for a word expressing the 

following ideas: 'superior dark coffee made of beans from Arabia', but 

could not remember the intended word 'mocha' due to the 

tip-of-the-tongue problem. Since people always remember something 

concerning the elusive word, it would be nice to have a system accepting

 this kind of input, to propose then a number of candidates for the 

target word. Given the above example, we might enter 'dark', 'coffee', 

'beans', and 'Arabia', and the system would be supposed to come up with 

one or several associated words such as 'mocha', 'espresso', or 

'cappuccino'.<br>

  <br>

 <br>

TASK DEFINITION<br>

  <br>

The participants will receive lists of five given words (primes) such as

 'circus', 'funny', 'nose', 'fool', and 'fun' and are supposed to 

compute the word which is most closely associated to all of them. In 

this case, the word 'clown' would be the expected response. Here are 

some more examples:<br>

  <br>

   given words:  gin, drink, scotch, bottle, soda<br>

   target word:  whisky<br>

  <br>

   given words:  wheel, driver, bus, drive, lorry<br>

   target word:  car<br>

  <br>

   given words:  neck, animal, zoo, long, tall<br>

   target word:  giraffe<br>

  <br>

   given words:  holiday, work, sun, summer, abroad<br>

   target word:  vacation<br>

  <br>

   given words:  home, garden, door, boat, chimney<br>

   target word:  house<br>

  <br>

   given words:  blue, cloud, stars, night, high<br>

   target word:  sky<br>

  <br>

We will provide a training set of 2000 sets of five input words 

(multiword stimuli), together with the expected target words 

(associative responses). The participants will have about five weeks to 

train their systems on this data. After the training phase, we will 

release a test set containing another 2000 sets of five input words, but

 without providing the expected target words. <br>

  <br>

Participants will have five days to run their systems on the test data, 

thereby predicting the target words. For each system, we will compare 

the results to the expected target words and compute an accuracy. The 

participants will be invited to submit a paper describing their approach

 and their results.<br>

  <br>

For the participating systems, we will distinguish two categories: <br>

  <br>

(1) Unrestricted systems. They can use any kind of data to compute their

 results. <br>

(2) Restricted systems: These systems are only allowed to draw on the 

freely available ukWaC corpus in order to extract information on word 

associations. The ukWaC corpus comprises about 2 billion words and is 

can be downloaded from <a class="moz-txt-link-freetext" href="http://wacky.sslmit.unibo.it/doku.php?id=corpora">http://wacky.sslmit.unibo.it/doku.php?id=corpora</a>.

 <br>

  <br>

Participants are allowed to compete in either category or in both.<br>

  <br>

  <br>

VENUE<br>

  <br>

The shared task will take place as part of the CogALex workshop which is

 co-located with COLING 2014 (Dublin). The workshop date is August 23, 

2014. Shared task participants who wish to have a paper published in the

 workshop proceedings will be required to present their work at the 

workshop.<br>

  <br>

  <br>

SHARED TASK SCHEDULE<br>

  <br>

Training data release:  March 27, 2014<br>

Test data release:  May 5, 2014<br>

Final results due:  May 9, 2014<br>

Deadline for paper submission: May 25, 2014  <br>

Reviewers' feedback:  June, 15, 2014<br>

Camera-ready version:  July 7, 2014<br>

Workshop date:  August 23, 2014<br>

  <br>

  <br>

FURTHER INFORMATION<br>

  <br>

CogALex workshop website: 

<a class="moz-txt-link-freetext" href="http://pageperso.lif.univ-mrs.fr/~michael.zock/CogALex-IV/cogalex-webpage/index.html">http://pageperso.lif.univ-mrs.fr/~michael.zock/CogALex-IV/cogalex-webpage/index.html</a><br>

Data releases: To be found on the above workshop website from the dates 

given in the schedule.<br>

Registration for the shared task: Send e-mail to Michael Zock, with 

Reinhard Rapp in copy.<br>

  <br>

  <br>

PROGRAMME COMMITTEE<br>

  <br>

* Bel Enguix, Gemma (LIF-CNRS, France)<br>

* Chang, Jason (National Tsing Hua University, Taiwan)<br>

* Cook, Paul (University of Melbourne, Australia)<br>

* Cristea, Dan (University A.I.Cuza, Iasi, Romania)<br>

* De Deyne, Simon (Experimental Psychology, Leuven, Belgium) and 

(Adelaide, Australia)<br>

* De Melo, Gerard (IIIS, Tsinghua University, Beijing, China)<br>

* Ferret, Olivier (CEA LIST, Gif sur Yvette, France)<br>

* Fontenelle, Thierry (CDT, Luxemburg)<br>

* Gala, Nuria (LIF-CNRS, Aix Marseille University, Marseille, France)<br>

* Granger, Sylviane (Université Catholique de Louvain, Belgium)<br>

* Grefenstette, Gregory (Inria, Saclay, France)<br>

* Hirst, Graeme (University of Toronto, Canada)<br>

* Hovy, Eduard (CMU, Pittsburgh, USA)<br>

* Hsieh, Shu-Kai (National Taiwan University, Taipei, Taiwan)<br>

* Huang, Chu-Ren (Hongkong Polytechnic University, China)<br>

* Joyce, Terry (Tama University, Kanagawa-ken, Japan)<br>

* Lapalme, Guy (RALI, University of Montreal, Canada)<br>

* Lenci, Alessandro (CNR, university of Pisa, Italy)<br>

* L'Homme, Marie Claude (University of Montreal, Canada)<br>

* Mihalcea, Rada (University of Texas, USA)<br>

* Navigli, Roberto (Sapienza, University of Rome, Italy)<br>

* Pirrelli, Vito (ILC, Pisa, Italy)<br>

* Polguère, Alain (ATILF-CNRS, Nancy, France)<br>

* Rapp, Reinhard (LIF-CNRS, France) and (Mainz, Germany)<br>

* Rosso, Paolo (NLEL, Universitat Politècnica de València, Spain)<br>

* Schwab, Didier (LIG-GETALP, Grenoble, France)<br>

* Serasset, Gilles (IMAG, Grenoble, France)<br>

* Sharoff, Serge (University of Leeds, UK)<br>

* Su, Jun-Ming (University of Tainan, Taiwan)<br>

* Tiberius, Carole (Institute for Dutch Lexicology, The Netherlands)<br>

* Tokunaga, Takenobu (TITECH, Tokyo, Japan)<br>

* Tufis, Dan (RACAI, Bucharest, Romania)<br>

* Valitutti, Alessandro (Helsinki Institute of Information Technology, 

Finland)<br>

* Wandmacher, Tonio (IRT SystemX, Saclay, France)<br>

* Zock, Michael (LIF-CNRS, Marseille, France), currently (University of 

Tainan, Taiwan)<br>

  <br>

 <br>

WORKSHOP ORGANIZERS and CONTACT PERSONS<br>

  <br>

* Michael Zock (LIF-CNRS, Marseille, France), michael.zock AT 

lif.univ-mrs.fr<br>

* Reinhard Rapp (University of Aix Marseille (France) and Mainz 

(Germany), reinhardrapp AT gmx.de<br>

* Chu-Ren Huang (The Hong Kong Polytechnic University, Hong Kong), 

churen.huang AT inet.polyu.edu.hk<br>

  <br>

  <br>

  <br>

  <div class="moz-signature">-- <br>

------------------------------------<br>

Michael ZOCK & Lih-Juang FANG ZOCK<br>

83, bd du Redon, Bât. E - 8<br>

F-13009 Marseille<br>

  <br>

<span style="color: rgb(0, 0, 102);">Mail</span>:                <a class="moz-txt-link-abbreviated" href="mailto:zock@free.fr">zock@free.fr</a> <br>

Tel.            +33 (0) 9.51.89.97.07<br>

portable:  +33 (0) 6.01.34.81.22<br>

------------------------------------

  </div>

</body>

</html>