<span style="font-family:'.HelveticaNeueUI';white-space:nowrap"><a href="http://jpetrie.myweb.uga.edu/poor_richard.html">http://jpetrie.myweb.uga.edu/poor_richard.html</a></span><br><br>On Tuesday, July 30, 2013,   wrote:<br>

<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Today's Topics:<br>

<br>

   1.  Two open positions in computational linguistics, IMS<br>

      Stuttgart (Sebastian Pado)<br>

   2.  DEADLINE EXTENSION: Web Data as a Challenge for Theoretical<br>

      Linguistics and Corpus Design (Marburg, March 5-7, 2014)<br>

      (Roland Schäfer)<br>

   3.  First release of an English lexicon for DBpedia (christina unger)<br>

   4.  DEADLINE EXTENSION: IEEE Computational Intelligence Magazine<br>

      (Impact Factor: 4.629) (Erik Cambria)<br>

   5.  WorldCIST'14 - World Conference on IST, 15 - 18 April 2014,<br>

      at Madeira Island (maria Lemos)<br>

   6.  Fwd: Call for Participation: ACL/BSNLP -- The Fourth<br>

      Workshop on Balto-Slavic Natural Language Processing (Roman Yangarber)<br>

<br>

<br>

----------------------------------------------------------------------<br>

<br>

Message: 1<br>

Date: Mon, 29 Jul 2013 12:20:58 +0200<br>

From: Sebastian Pado <<a>pado@ims.uni-stuttgart.de</a>><br>

Subject: [Corpora-List] Two open positions in computational<br>

        linguistics,    IMS Stuttgart<br>

To: <a>corpora@uib.no</a><br>

<br>

One PhD and one postdoctoral position in Computational Linguistics<br>

-----------------------------------------------------------------<br>

<br>

The research group of Theoretical Computational Linguistics at IMS,<br>

Stuttgart University, is looking to fill two positions: one PhD<br>

student position and one postdoctoral researcher position. Payment is<br>

according to the German TV-L pay scale (TV-L 13 50% and 100%,<br>

respectively; see [1] for details). Both positions are initially open<br>

for three years.<br>

<br>

The research group, headed by Sebastian Pado, is concerned primarily<br>

with the cognitive and linguistic aspects of word and phrase meaning,<br>

machine learning methods for meaning acquisition, applications of<br>

lexical semantics as well as cross-lingual semantic analysis [2].<br>

<br>

* The PhD position is in the area of semantics of<br>

discourse. Prerequisite is a master's degree or diploma in<br>

computational linguistics, computer science, or a related field.<br>

<br>

* The postdoctoral researcher is expected to have a clear research<br>

profile, contribute to the research agenda, and conduct independent<br>

research. Prerequisite is a PhD degree in computational linguistics,<br>

computer science, or a related field.<br>

<br>

Candidates for both positions are expected to have substantial<br>

previous knowledge of natural language processing, solid programming<br>

skills. Prior experience in the specific areas outlined above is a<br>

plus. Since both positions come with teaching obligations, candidates<br>

need to have a good command of German, or be willing to acquire it<br>

quickly.<br>

<br>

Please send applications (a single PDF file containing a motivation<br>

letter, CV, list of publications, research statement, and names of two<br>

references) by email to Sebastian Pado, <a>pado@ims.uni-stuttgart.de</a>.<br>

Applications received by August 31st will receive full consideration.<br>

The positions are open until filled.<br>

<br>

For further information, send me an email to <a>pado@ims.uni-stuttgart.de</a><br>

or meet me directly at ACL 2013.<br>

<br>

Stuttgart University is an equal opportunity employer. Applications of<br>

women are strongly encouraged. Severely challenged persons will be given<br>

preference in case of equal qualifications.<br>

<br>

[1] <a href="http://oeffentlicher-dienst.info/c/t/rechner/tv-l/west?id=tv-l-2013" target="_blank">http://oeffentlicher-dienst.info/c/t/rechner/tv-l/west?id=tv-l-2013</a><br>

[2] <a href="http://www.nlpado.de/~sebastian" target="_blank">http://www.nlpado.de/~sebastian</a><br>

<br>

<br>

<br>

------------------------------<br>

<br>

Message: 2<br>

Date: Mon, 29 Jul 2013 13:30:33 +0200<br>

From: Roland Schäfer <<a>roland.schaefer@fu-berlin.de</a>><br>

Subject: [Corpora-List] DEADLINE EXTENSION: Web Data as a Challenge<br>

        for Theoretical Linguistics and Corpus Design (Marburg, March 5-7,<br>

        2014)<br>

To: list corpora <<a>corpora@uib.no</a>><br>

<br>

*DEADLINE EXTENSION: August 12, 2013*<br>

<br>

Web Data as a Challenge for Theoretical Linguistics and Corpus Design<br>

<br>

Workshop at the 36th Annual Conference of the German Linguistic Society<br>

(March 5-7, 2014 at Marburg University, Marburg/Lahn, Germany)<br>

<br>

Website:   <a href="http://hpsg.fu-berlin.de/cow/dgfs2014/" target="_blank">http://hpsg.fu-berlin.de/cow/dgfs2014/</a><br>

EasyChair: <a href="https://www.easychair.org/conferences/?conf=webtl2014" target="_blank">https://www.easychair.org/conferences/?conf=webtl2014</a><br>

<br>

Organizers:<br>

  Felix Bildhauer (Freie Universität Berlin/SFB632)<br>

  Roland Schäfer (Freie Universität Berlin)<br>

<br>

Program Committee:<br>

  Chris Biemann<br>

  Stefan Evert<br>

  Matthias Hüning<br>

  Anke Lüdeling<br>

  Alexander Mehler<br>

  Uwe Quasthoff<br>

  Amir Zeldes<br>

  Torsten Zesch<br>

  Arne Zeschel<br>

<br>

Important Dates:<br>

  First call for papers:         Monday, June 17, 2013<br>

  Second call for papers:        Friday, July 19, 2013<br>

  EXTENDED Submission Deadline:  Monday, August 12, 2013 @ 23:59 GMT<br>

  Workshop:                      March 5-7, 2014<br>

<br>

<br>

*Aim of the Workshop and Call for Papers*<br>

<br>

The huge amounts of linguistic data on the web offer exciting new<br>

possibilities in empirically based theoretical linguistics. Web-derived<br>

linguistic resources can contain greater amounts of variation as well as<br>

non-standard grammar and writing compared to traditionally compiled<br>

corpora. Also, whole new registers and genres have been described to<br>

emerge on the web. Like spoken language - although clearly distinct from<br>

it - the language found on the web can thus challenge linguistic<br>

theories which are based mainly on standard written language as well as<br>

the categories assumed within these theories. At the same time, such<br>

non-standard features make the data harder to process for computational<br>

linguists, and additional care is required in making the decision of<br>

labeling material as "noise", because it might be considered valuable<br>

data by some linguists.<br>

<br>

This workshop aims to bring together researchers working in Theoretical<br>

Linguistics and Corpus Linguistics with those who create resources from<br>

web data. The primary question of the workshop is: Which new linguistic<br>

insights can we derive from web data? Secondarily, we ask how web data<br>

is (and how it should be) processed to produce easily accessible<br>

high-quality resources and thus facilitate this kind of innovative<br>

linguistic research.<br>

<br>

Possible subjects for talks include (but are by no means restricted to):<br>

<br>

- theoretically motivated empirical studies of linguistic phenomena in<br>

  web data,<br>

- work on problems with established linguistic categories specific to<br>

  certain types of web data (problems with traditional part-of-speech<br>

  classification, syntactic categories, register and genre<br>

  classification, etc.),<br>

- problems of working with web corpora from the user's perspective in<br>

  concrete studies (low quality of: tokenization, POS tagging, named<br>

  entity recognition, etc.; availability and lack of meta data),<br>

- assessments and improvements of the quality of available and newly<br>

  designed tools and models to process or classify web data,<br>

- approaches to normalization of web data and evaluations of the<br>

  acceptability of such normalizations from a linguistic perspective,<br>

- sampling of web data (e.g., stratified vs. randomly compiled corpora,<br>

  linguistic web characterization)<br>

<br>

We invite submissions for 30 minute talks (20 minutes plus 10 minutes of<br>

discussion) about completed or ongoing original research in which web<br>

data is used or which is about the creation and/or evaluation of web<br>

data resources. The scope of the workshop is neither restricted to<br>

resources of a specific size or nature nor to any specific language(s).<br>

Submitted abstracts will be reviewed anonymously by at least two<br>

reviewers. We hope to offer authors of accepted talks the opportunity to<br>

publish an extended version of their talk in a special issue of a<br>

peer-reviewed corpus linguistics journal.<br>

<br>

<br>

*Submission Details*<br>

<br>

- Submitted abstracts for 30 minute presentations (20 minutes plus 10<br>

  minutes discussion) should be between 800 and 1,000 words long<br>

  (excluding references and tables).<br>

- Submissions must be anonymous. Please take care in removing<br>

  information from the file which could reveal your identity.<br>

- The language of all abstracts and the workshop is English.<br>

- The only accepted file format for submission is PDF.<br>

- Submission must be made via EasyChair (WEBTL-2014):<br>

  <a href="https://www.easychair.org/conferences/?conf=webtl2014" target="_blank">https://www.easychair.org/conferences/?conf=webtl2014</a><br>

- Authors of accepted papers will be asked to provide a shorter 200 word<br>

  abstract to be printed in the conference program as an MS Word or<br>

  OpenDocument file.<br>

<br>

<br>

<br>

------------------------------<br>

<br>

Message: 3<br>

Date: Mon, 29 Jul 2013 17:39:37 +0200<br>

From: christina unger <<a>cunger@cit-ec.uni-bielefeld.de</a>><br>

Subject: [Corpora-List] First release of an English lexicon for<br>

        DBpedia<br>

To: <a>corpora@uib.no</a><br>

<br>

<br>

Dear all,<br>

<br>

The Semantic Computing group at Bielefeld University [1] is happy to<br>

announce the first release of an English lexicon for DBpedia in lemon<br>

[2] format:<br>

<br>

      <a href="http://lemon-model.net/lexica/dbpedia_en" target="_blank">http://lemon-model.net/lexica/dbpedia_en</a><br>

<br>

It comprises 1,217 verbalizations covering 353 classes as well as 300<br>

properties (all those that have more than 10,000 occurrences in the<br>

DBpedia dataset). It can prove useful for a wide range of ontology-based<br>

NLP applications, such as question answering and natural language<br>

generation from RDF or SPARQL.<br>

<br>

We invite you to browse and use the lexicon, and would love to get<br>

feedback.<br>

<br>

Also, everyone is welcome to improve the lexicon, extend it, and port it<br>

to other languages (we are currently working on an initial version of a<br>

Spanish and a German lexicon). The source is available on GitHub:<br>

<br>

     <a href="https://github.com/cunger/lemon.dbpedia" target="_blank">https://github.com/cunger/lemon.dbpedia</a><br>

<br>

If you are interested in contributing, we'd be happy to hear from you!<br>

<br>

With best regards,<br>

Christina<br>

<br>

<br>

[1] <a href="http://www.sc.cit-ec.uni-bielefeld.de" target="_blank">http://www.sc.cit-ec.uni-bielefeld.de</a><br>

[2] <a href="http://lemon-model.net" target="_blank">http://lemon-model.net</a><br>

<br>

--<br>

Christina Unger, PhD<br>

Semantic Computing Group<br>

CITEC, Bielefeld University<br>

<br>

Office: H1-124<br>

Phone: 0521 106 12224<br>

Homepage: <a href="http://www.sc.cit-ec.uni-bielefeld.de/people/cunger" target="_blank">http://www.sc.cit-ec.uni-bielefeld.de/people/cunger</a><br>

<br>

<br>

<br>

<br>

------------------------------<br>

<br>

Message: 4<br>

Date: Tue, 30 Jul 2013 01:21:58 +0800<br>

From: Erik Cambria <<a>cambria@nus.edu.sg</a>><br>

Subject: [Corpora-List] DEADLINE EXTENSION: IEEE Computational<br>

        Intelligence Magazine (Impact Factor: 4.629)<br>

To: Corpora-List <<a>corpora@uib.no</a>><br>

<br>

Apologies for cross-posting.<br>

<br>

Due to the many extension requests, the submission deadline of the IEEE CIM special issue on Computational Intelligence for Natural Language Processing has been postponed to the end of this week. For more/up-to-date info, please visit <a href="http://sentic.net/cinlp" target="_blank">http://sentic.net/cinlp</a><br>


<br>

RATIONALE<br>

The textual information available on the Web can be broadly grouped into two main categories: facts and opinions. Facts are objective expressions about entities or events. Opinions are usually subjective expressions that describe people's sentiments, appraisals, or feelings towards such entities and events. Much of the existing research on textual information processing has been focused on mining and retrieval of factual information, e.g., text classification, text recognition, text clustering, and many other text mining and natural language processing (NLP) tasks. Little work had been done on the processing of opinions until only recently.<br>


<br>

One of the main reasons for the lack of studies on opinions is the fact that there was little opinionated text available before the recent passage from a read-only to a read-write Web. Before that, in fact, when people needed to make a decision, they typically asked for opinions from friends and family. Similarly, when organizations wanted to find the opinions or sentiments of the general public about their products and services, they had to specifically ask people by conducting opinion polls and surveys.<br>


<br>

However, with the advent of the Social Web, the way people express their views and opinions has dramatically changed. They can now post reviews of products at merchant sites and express their views on almost anything in Internet forums, discussion groups, and blogs. Such online word-of-mouth behavior represents new and measurable sources of information with many practical applications. Nonetheless, finding opinion sources and monitoring them can be a formidable task because there are a large number of diverse sources and each source may also have a huge volume of opinionated text.<br>


<br>

In many cases, in fact, opinions are hidden in long forum posts and blogs. It is extremely time-consuming for a human reader to find relevant sources, extract related sentences with opinions, read them, summarize them, and organize them into usable forms. Thus, automated opinion discovery and summarization systems are needed. Sentiment analysis grows out of this need: it is a very challenging NLP or text mining problem. Due to its tremendous value for practical applications, there has been an explosive growth of both research in academia and applications in the industry.<br>


<br>

All the sentiment analysis tasks, however, are very challenging. Our understanding and knowledge of the problem and its solution are still limited. The main reason is that it is a NLP task, and NLP has no easy problems. Another reason may be due to our popular ways of doing research. So far, in fact, researchers have relied a lot on traditional machine learning algorithms. Some of the most effective machine learning algorithms, however, produce no human understandable results. Apart from some superficial knowledge gained in the manual feature engineering process, in fact, such algorithms may achieve improved accuracy, but little about how and why is actually known. All such approaches, moreover, rely on syntactic structure of text, which is far from the way human mind processes natural language.<br>


<br>

TOPICS<br>

Articles are thus invited in area of computational intelligence for natural language processing and understanding. The broader context of the Special Issue comprehends artificial intelligence, knowledge representation and reasoning, data mining, artificial neural networks, evolutionary computation, and fuzzy logic. Topics include, but are not limited to:<br>


- Computational intelligence for big social data analysis<br>

- Biologically inspired opinion mining<br>

- Concept-level opinion and sentiment analysis<br>

- Computational intelligence for social media retrieval and analysis<br>

- Computational intelligence for social media marketing<br>

- Social network modeling, simulation, and visualization<br>

- Semantic multi-dimensional scaling for sentiment analysis<br>

- Computational intelligence for patient opinion mining<br>

- Sentic computing<br>

- Multilingual and multimodal sentiment analysis<br>

- Multimodal fusion for continuous interpretation of semantics<br>

- Computational intelligence for time-evolving sentiment tracking<br>

- Computational intelligence for cognitive agent-based computing<br>

- Human-agent, -computer, and -robot interaction<br>

- Domain adaptation for sentiment classification<br>

- Affective common-sense reasoning<br>

- Computational intelligence for user profiling and personalization<br>

- Computational intelligence for knowledge acquisition<br>

<br>

TIMEFRAME<br>

August 4th, 2013: Paper submission deadline<br>

September 1st, 2013: Notification of acceptance<br>

October 1st, 2013: Final manuscript due<br>

February, 2014: Publication<br>

<br>

SUBMISSION<br>

The maximum length for the manuscript is typically 25 pages in single column with double-spacing, including figures and references. Authors of papers should specify in the first page of their manuscripts corresponding author?s contact and up to 5 keywords. Submission should be made via email to one of the guest editors below.<br>


<br>

GUEST EDITORS<br>

- Erik Cambria, National University of Singapore (Singapore)<br>

- Bebo White, Stanford University (USA)<br>

- Tariq S. Durrani, Royal Society of Edinburgh (UK)<br>

- Newton Howard, MIT Media Laboratory (USA)<br>

<br>

<br>

------------------------------<br>

<br>

Message: 5<br>

Date: Mon, 29 Jul 2013 23:09:50 +0100<br>

From: "maria Lemos" <<a>marialemos72@gmail.com</a>><br>

Subject: [Corpora-List] WorldCIST'14 - World Conference on IST, 15 -<br>

        18 April 2014, at Madeira Island<br>

To: <a>corpora@uib.no</a><br>

<br>

Apologies if you are receiving this mail more than once...<br>

<br>

Please disseminate by colleagues, researchers, students, etc. Thanks a lot!<br>

<br>

<br>

**********************************************************************************<br>

                             WorldCIST'14<br>

The 2014 World Conference on Information Systems and Technologies<br>

                April 15 - 18, Madeira Island, Portugal<br>

            </blockquote>