<?xml 
version="1.0" encoding="utf-8"?><?xml-stylesheet title="XSL formatting" type="text/xsl" href="https://clef2018.clef-initiative.eu/mc2/spip.php?page=backend.xslt" ?>
<rss version="2.0" 
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:atom="http://www.w3.org/2005/Atom"
>

<channel xml:lang="fr">
	<title>MC2 2018 Lab</title>
	<link>https://clef2018.clef-initiative.eu/mc2/</link>
	<description>MC2 CLEF Lab is centered on mining the social media sphere surrounding cultural events such as festivals and movies, It provides access for registered participants to the microbolg collection of the GAFES project funded by the French National Research Agency and lead by the University of Avignon.</description>
	<language>fr</language>
	<generator>SPIP - www.spip.net</generator>
	<atom:link href="https://clef2018.clef-initiative.eu/mc2/spip.php?id_auteur=1&amp;page=backend" rel="self" type="application/rss+xml" />




<item xml:lang="en">
		<title>Milestones and timetable 2018</title>
		<link>https://clef2018.clef-initiative.eu/mc2/spip.php?page=article&amp;id_article=5</link>
		<guid isPermaLink="true">https://clef2018.clef-initiative.eu/mc2/spip.php?page=article&amp;id_article=5</guid>
		<dc:date>2018-02-21T13:03:43Z</dc:date>
		<dc:format>text/html</dc:format>
		<dc:language>en</dc:language>
		<dc:creator>sanjuan</dc:creator>


		<dc:subject>Mile Stones</dc:subject>

		<description>
&lt;p&gt;Registration opens: 8 february 2018 (Task2) Registration closes: 30 April 2018 End Evaluation Cycle: 19 May 2018 Submission of Participant Papers [CEUR-WS]: 31 May 2018 Submission of Lab Overviews [LNCS]: 8 June 2018 Notification of Acceptance Participant Papers [CEUR-WS]: 15 June 2018 Notification of Acceptance Lab Overviews [LNCS]: 15 June 2018 Camera Ready Copy of Lab Overviews [LNCS]: 22 June 2018 Camera Ready Copy of Participant Papers and Extended Lab Overviews [CEUR-WS]: 29 June 2018 (&#8230;)&lt;/p&gt;


-
&lt;a href="https://clef2018.clef-initiative.eu/mc2/spip.php?page=rubrique&amp;id_rubrique=4" rel="directory"&gt;Organization&lt;/a&gt;

/ 
&lt;a href="https://clef2018.clef-initiative.eu/mc2/spip.php?page=mot&amp;id_mot=1" rel="tag"&gt;Mile Stones&lt;/a&gt;

		</description>


 <content:encoded>&lt;div class='rss_texte'&gt;&lt;ul class=&#034;spip&#034; role=&#034;list&#034;&gt;&lt;li&gt; Registration opens: 8 february 2018 (Task2)&lt;/li&gt;&lt;li&gt; Registration closes: 30 April 2018&lt;/li&gt;&lt;li&gt; End Evaluation Cycle: 19 May 2018&lt;/li&gt;&lt;li&gt; Submission of Participant Papers [CEUR-WS]: 31 May 2018&lt;/li&gt;&lt;li&gt; Submission of Lab Overviews [LNCS]: 8 June 2018&lt;/li&gt;&lt;li&gt; Notification of Acceptance Participant Papers [CEUR-WS]: 15 June 2018&lt;/li&gt;&lt;li&gt; Notification of Acceptance Lab Overviews [LNCS]: 15 June 2018&lt;/li&gt;&lt;li&gt; Camera Ready Copy of Lab Overviews [LNCS]: 22 June 2018&lt;/li&gt;&lt;li&gt; Camera Ready Copy of Participant Papers and Extended Lab Overviews [CEUR-WS]: 29 June 2018&lt;/li&gt;&lt;li&gt; CEUR-WS Working Notes Preview for Checking by Authors and Lab Organizers: 18-24 July 2018&lt;/li&gt;&lt;li&gt; September 10-14 2018 CLEF 2018 Conference&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;
		
		</content:encoded>


		

	</item>
<item xml:lang="en">
		<title>Task objectives and Evaluation process</title>
		<link>https://clef2018.clef-initiative.eu/mc2/spip.php?page=article&amp;id_article=17</link>
		<guid isPermaLink="true">https://clef2018.clef-initiative.eu/mc2/spip.php?page=article&amp;id_article=17</guid>
		<dc:date>2018-02-21T12:50:00Z</dc:date>
		<dc:format>text/html</dc:format>
		<dc:language>en</dc:language>
		<dc:creator>Jean-val&#232;re, olivier, sanjuan</dc:creator>



		<description>
&lt;p&gt;Objective &lt;br class='autobr' /&gt;
Vodkaster ( http://www.vodkaster.com/ ) is a French social network about movies where participants can share comments about movies under the form of microcritics not longer than a tweet. The main differences are the restricted cultural domain and the form. The objective of the task is for a given movie and microcitic and each language among French, English, Spanish, Portuguese and Arabic to provide a summary of the related microblogs. Microblogs included is a summary should (&#8230;)&lt;/p&gt;


-
&lt;a href="https://clef2018.clef-initiative.eu/mc2/spip.php?page=rubrique&amp;id_rubrique=11" rel="directory"&gt;1 - Cross Language cultural microblog search&lt;/a&gt;


		</description>


 <content:encoded>&lt;div class='rss_texte'&gt;&lt;h2 class=&#034;spip&#034;&gt;Objective&lt;/h2&gt;
&lt;p&gt;Vodkaster ( &lt;a href=&#034;http://www.vodkaster.com/&#034; class=&#034;spip_url spip_out auto&#034; rel=&#034;nofollow external&#034;&gt;http://www.vodkaster.com/&lt;/a&gt; ) is a French social network about movies where participants can share comments about movies under the form of microcritics not longer than a tweet. The main differences are the restricted cultural domain and the form. &lt;br class='autobr' /&gt;
The objective of the task is for a given movie and microcitic and each language among French, English, Spanish, Portuguese and Arabic to provide a summary of the related microblogs.&lt;br class='autobr' /&gt;
Microblogs included is a summary should provide relevant information about at least one of the following aspects:&lt;/p&gt;
&lt;ul class=&#034;spip&#034; role=&#034;list&#034;&gt;&lt;li&gt; the film mentioned in the microcritic including subject, genre, presence in festivals,&lt;br class='autobr' /&gt;
reception, audience, critics and opinions as well as actors and producers careers.&lt;/li&gt;&lt;li&gt; events like festivals mentioned in the microcritic if any, including opinions and narratives.&lt;/li&gt;&lt;li&gt; comments and critics in twitter similar to those in the microcritic if any.&lt;br class='autobr' /&gt;
Extended summaries can include microblogs about closely related films and events.&lt;br class='autobr' /&gt;
Promotional, automatic tweets or retweets are not considered as relevant. However, retweets by movie aficionados or movie makers are relevant.&lt;/li&gt;&lt;/ul&gt;&lt;h2 class=&#034;spip&#034;&gt;Task description&lt;/h2&gt;
&lt;p&gt;Browsing the VodKaster website allows french readers to get personal short comments (microcritics)&lt;br class='autobr' /&gt;
about movies. You can get similar and/or complementary opinions on twitter but they are less specific to movies and harder to find. The use case is to display to the reader a concise summary of microblogs related to the microcritics he/she is reading, considering bilingual and trilingual users that would read microblogs in other languages than French.&lt;br class='autobr' /&gt;
Summaries are exclusively made of extracts from microblog contents and can include author names if considered as informative. They should be readable and codes like external URLs and references to multimedia objects should be removed. Three different summary lengths in words are considered: 50, 150 and 250.&lt;br class='autobr' /&gt;
Summaries are intended to provide an idea of all relevant information included in the corpus.&lt;br class='autobr' /&gt;
Diversity among top ranked microblogs is important. If the summary does not provide any microblog directly related to the topic it suggests that there is none in the corpus.&lt;/p&gt;
&lt;h2 class=&#034;spip&#034;&gt;Evaluation process&lt;/h2&gt;
&lt;p&gt;Runs will be primarily evaluated on informativeness following INEX Tweet&lt;br class='autobr' /&gt;
Contextualization methodology [1] and based on the FRESA 2 [2] software extended to Arabic, French, Portuguese and Spanish. All FRESA metrics will be computed between runs top ranked microblog extracts and a textual reference to be provided by organizers. Following [1], this reference will based on both manual runs and pools from participant submissions. &lt;br class='autobr' /&gt;
Graded standard q-rels for microblogs will be automatically generated based on FRESA [2] scores to be used with standard TREC eval tools. However, due to the impact of microblog high redundancy and reposts over q-rels exhaustivity [3], these measures won't be considered as official.&lt;br class='autobr' /&gt;
Alternative Nugget-based Information Retrieval Evaluation references and scores [4] will be also tentatively provided and discussed at the Lab.&lt;br class='autobr' /&gt;
Readability of results provided by systems will be also manually checked, the user case requiring these results to be displayed to the user.&lt;/p&gt;
&lt;p&gt;&lt;i&gt;[1] INEX Tweet Contextualization task : Evaluation, results and lesson learned&lt;br class='autobr' /&gt;
Patrice Bellot , V&#233;ronique Moriceau , Josiane Mothe, Eric SanJuan , Xavier Tannier :- Inf. Process.&lt;br class='autobr' /&gt;
Manage. 52(5) : 801-819 (2016)&lt;br class='autobr' /&gt;
[2] &lt;a href=&#034;http://fresa.talne.eu&#034; class=&#034;spip_url spip_out auto&#034; rel=&#034;nofollow external&#034;&gt;http://fresa.talne.eu&lt;/a&gt;&lt;br class='autobr' /&gt;
[3] Philippe Mulhem, Lorraine Goeuriot, Nayanika Dogra, Nawal Ould Amer: TimeLine Illustration&lt;br class='autobr' /&gt;
Based on Microblogs: When Diversification Meets Metadata Re-ranking. CLEF 2017 Proceedings.&lt;br class='autobr' /&gt;
Lecture Notes in Computer Science 10456, Springer 2017 : 224-235&lt;br class='autobr' /&gt;
[4] &lt;a href=&#034;http://www.ccs.neu.edu/home/jaa/IIS-1256172/&#034; class=&#034;spip_url spip_out auto&#034; rel=&#034;nofollow external&#034;&gt;http://www.ccs.neu.edu/home/jaa/IIS-1256172/&lt;/a&gt;&lt;/i&gt;&lt;/p&gt;
&lt;h2 class=&#034;spip&#034;&gt;Submission&lt;/h2&gt;
&lt;p&gt;Submitted summaries should be in TREC like format, a tabulated file with five fields:&lt;/p&gt;
&lt;ol class=&#034;spip&#034; role=&#034;list&#034;&gt;&lt;li&gt; a run ID&lt;/li&gt;&lt;li&gt; an integer indicating its position in the summary&lt;/li&gt;&lt;li&gt; a float number as an estimation of its relevance&lt;/li&gt;&lt;li&gt; the main language of the microblog content (fr, en, es, pt or ar)&lt;/li&gt;&lt;li&gt; an extract of the microblog content with the author name if considered as relevant&lt;/li&gt;&lt;/ol&gt;
&lt;p&gt;Runs will be truncated at 50, 150 and 300 words, content will be concatenated and displayed to evaluators that will highlight relevant passages. Therefore, the concatenation of content in the last column should be readable by a human (i.e. this column needs to be readable on its own).&lt;/p&gt;
&lt;p&gt;Each team can submit up to three runs in each language (Arabic, English, French, Portuguese and Spanish). Teams will be invited to share there queries in different languages. Organizers will facilitate running submitted sets of queries on the following baseline systems.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Baseline system&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A baseline system powered by Indri is provided to participants to run complex focus nested queries.&lt;br class='autobr' /&gt;
In this index, microblogs have been merge into XML documents per autor to allow expansions. Indri XML index permits to retrieve XML elements based on nested content.&lt;/p&gt;
&lt;div class=&#034;precode&#034;&gt;&lt;pre class='spip_code spip_code_block' dir='ltr' style='text-align:left;'&gt;&lt;code&gt;&lt;!ELEMENT xml (f, m)+&gt;
&lt;!ELEMENT f (#user_id)&gt;
&lt;!ELEMENT m (i, u, l, c d, t)&gt;
&lt;!ELEMENT i (#microblog_id)&gt;
&lt;!ELEMENT u (#user)&gt;
&lt;!ELEMENT l (#ISO_language_code)&gt;
&lt;!ELEMENT c (#client&gt;
&lt;!ELEMENT d (#date)&gt;
&lt;!ELEMENT t (#PCDATA)&gt;
Example:
&lt;xml&gt;&lt;f&gt;20666489&lt;/f&gt;
&lt;m&gt;&lt;i&gt;727389569688178688&lt;/i&gt;
&lt;u&gt;soulsurvivornl&lt;/u&gt;
&lt;l&gt;en&lt;/l&gt;
&lt;c&gt;Twitter for iPhone&lt;/c&gt;
&lt;d&gt;2016-05-03&lt;/d&gt;
&lt;t&gt;RT @ndnl: Dit weekend begon het Soul Surivor Festival.&lt;/t&gt;
&lt;/m&gt;
&lt;m&gt;&lt;i&gt;727944506507669504&lt;/i&gt;
&lt;u&gt;soulsurvivornl&lt;/u&gt;
&lt;l&gt;en&lt;/l&gt;
&lt;c&gt;Facebook&lt;/c&gt;
&lt;d&gt;2016-05-04&lt;/d&gt;
&lt;t&gt;Last van een festival-hangover?&lt;/t&gt;
&lt;/m&gt;
&lt;/xml&gt;&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;&lt;/div&gt;
		
		</content:encoded>


		

	</item>
<item xml:lang="en">
		<title>Wikipedia XML corpus for summary generation</title>
		<link>https://clef2018.clef-initiative.eu/mc2/spip.php?page=article&amp;id_article=13</link>
		<guid isPermaLink="true">https://clef2018.clef-initiative.eu/mc2/spip.php?page=article&amp;id_article=13</guid>
		<dc:date>2016-10-18T16:44:45Z</dc:date>
		<dc:format>text/html</dc:format>
		<dc:language>en</dc:language>
		<dc:creator>sanjuan</dc:creator>


		<dc:subject>data</dc:subject>
		<dc:subject>CLEF 2016</dc:subject>

		<description>
&lt;p&gt;Wikipedia is under Creative Commons license, and its contents can be used to contextualize tweets or to build complex queries referring to Wikipedia entities. &lt;br class='autobr' /&gt;
We have extracted an average of 10 million XML documents from Wikipedia per year since 2012 in the four main twitter languages:- en, es, fr and pt. &lt;br class='autobr' /&gt;
These documents reproduce in an easy-to-use XML structure the contents of the main Wikipedia pages: title, abstract, section and subsections as well as Wikipedia internal links. Other (&#8230;)&lt;/p&gt;


-
&lt;a href="https://clef2018.clef-initiative.eu/mc2/spip.php?page=rubrique&amp;id_rubrique=5" rel="directory"&gt;1 - Content Analysis&lt;/a&gt;

/ 
&lt;a href="https://clef2018.clef-initiative.eu/mc2/spip.php?page=mot&amp;id_mot=2" rel="tag"&gt;data&lt;/a&gt;, 
&lt;a href="https://clef2018.clef-initiative.eu/mc2/spip.php?page=mot&amp;id_mot=3" rel="tag"&gt;CLEF 2016&lt;/a&gt;

		</description>


 <content:encoded>&lt;div class='rss_texte'&gt;&lt;p&gt;Wikipedia is under Creative Commons license, and its contents can be used to contextualize tweets or to build complex queries referring to Wikipedia entities.&lt;/p&gt;
&lt;p&gt;We have extracted an average of 10 million XML documents from Wikipedia per year since 2012 in the four main twitter languages:- en, es, fr and pt.&lt;/p&gt;
&lt;p&gt;These documents reproduce in an easy-to-use XML structure the contents of the main Wikipedia pages: title, abstract, section and subsections as well as Wikipedia internal links. Other contents such as images, footnotes and external links are stripped out in order to obtain a corpus easier to process using standard NLP tools.&lt;/p&gt;
&lt;p&gt;By comparing contents over the years, it is possible to detect long term trends&lt;/p&gt;&lt;/div&gt;
		&lt;div class="hyperlien"&gt;View online : &lt;a href="http://tc.talne.eu/" class="spip_out"&gt;Micro Blog Contextualization CLEF &amp; Inex tracks data and tools&lt;/a&gt;&lt;/div&gt;
		
		</content:encoded>


		

	</item>
<item xml:lang="en">
		<title>The festival galleries dataset</title>
		<link>https://clef2018.clef-initiative.eu/mc2/spip.php?page=article&amp;id_article=12</link>
		<guid isPermaLink="true">https://clef2018.clef-initiative.eu/mc2/spip.php?page=article&amp;id_article=12</guid>
		<dc:date>2016-10-18T16:31:57Z</dc:date>
		<dc:format>text/html</dc:format>
		<dc:language>en</dc:language>
		<dc:creator>sanjuan</dc:creator>


		<dc:subject>data</dc:subject>

		<description>
&lt;p&gt;This data set allows to experiment microblog search and stream summarization. &lt;br class='autobr' /&gt;
Microblog collection &lt;br class='autobr' /&gt;
The document collection is provided to registered participants by ANR GAFES project. It consists in a pool of more than 50M unique micro-blogs from different sources with their meta-information as well as ground truth for the evaluation. &lt;br class='autobr' /&gt;
The microblog collection contains a very large pool of public posts on Twitter using the keyword festival since June 2015. These micro-blogs are (&#8230;)&lt;/p&gt;


-
&lt;a href="https://clef2018.clef-initiative.eu/mc2/spip.php?page=rubrique&amp;id_rubrique=9" rel="directory"&gt;Data&lt;/a&gt;

/ 
&lt;a href="https://clef2018.clef-initiative.eu/mc2/spip.php?page=mot&amp;id_mot=2" rel="tag"&gt;data&lt;/a&gt;

		</description>


 <content:encoded>&lt;div class='rss_texte'&gt;&lt;p&gt;This data set allows to experiment microblog search and stream summarization.&lt;/p&gt;
&lt;h2 class=&#034;spip&#034;&gt;Microblog collection&lt;/h2&gt;
&lt;p&gt;The document collection is provided to registered participants by ANR GAFES project. It consists in a pool of more than 50M unique micro-blogs from different sources with their meta-information as well as ground truth for the evaluation.&lt;/p&gt;
&lt;p&gt;The microblog collection contains a very large pool of public posts on Twitter using the keyword festival since June 2015. These micro-blogs are collected using private archive services based on streaming API. The average of unique microblog posts (i.e. without re-twitts) between June and September is 2, 616, 008 per month. The total number of collected micro-blog posts after one year (from May 2015 to May 2016) is 50, 490, 815 (24, 684, 975 without re-posts). These micro-blog posts are available online on a relational database with associated fields.&lt;/p&gt;
&lt;p&gt;Because of privacy issues, they cannot be publicly released but can be analyzed inside the organization that purchased these archives and among collaborators under privacy agreement. The CM2 lab provides this opportunity to share this data among academic participants. These archives can be indexed, analyzed and general results acquired from them can be published without restriction.&lt;/p&gt;
&lt;h2 class=&#034;spip&#034;&gt;Linked web pages &lt;/h2&gt;
&lt;p&gt;66% of the collected micro-blog posts contain Twittert.co compressed URLs. Sometimes these URLs refer to other online services like adf.ly, cur.lv, dlvr.it, ow.ly that hide the real URL. We used the spider mode of the GNU wget tool to get the real URL, this process required multiple DNS requests.&lt;/p&gt;
&lt;p&gt;The number of unique uncompressed urls collected in one year is 11,580,788 from 641,042 distinct domains.&lt;/p&gt;
&lt;h2 class=&#034;spip&#034;&gt;Getting access to the data set for scholars&lt;/h2&gt;&lt;ol class=&#034;spip&#034; role=&#034;list&#034;&gt;&lt;li&gt; register your institution to CLEF&lt;/li&gt;&lt;li&gt; send a request by email to admin@talne.eu from the same domain as your institution with full contact information.&lt;/li&gt;&lt;li&gt; if accepted, you will receive a confidential agreement to be approved by your institution.&lt;/li&gt;&lt;li&gt; once we get back the agreement you will receive personal information to access lab data servers.&lt;/li&gt;&lt;/ol&gt;&lt;/div&gt;
		&lt;div class="hyperlien"&gt;View online : &lt;a href="http://ceur-ws.org/Vol-1609/16091197.pdf" class="spip_out"&gt;Cultural micro-blog Contextualization 2016 Workshop Overview: data and pilot tasks &lt;/a&gt;&lt;/div&gt;
		
		</content:encoded>


		

	</item>
<item xml:lang="en">
		<title>Microblog Cultural Contextualization 2017 lab introduction</title>
		<link>https://clef2018.clef-initiative.eu/mc2/spip.php?page=article&amp;id_article=11</link>
		<guid isPermaLink="true">https://clef2018.clef-initiative.eu/mc2/spip.php?page=article&amp;id_article=11</guid>
		<dc:date>2016-10-18T12:38:54Z</dc:date>
		<dc:format>text/html</dc:format>
		<dc:language>en</dc:language>
		<dc:creator>sanjuan</dc:creator>



		<description>
&lt;p&gt;These are the slides used to presented at CLEF 2016 in Evora to introduce the CM2 lab. Overall Procedure Take a microblog about an event with an url. Identify its language. Identify a related cultural event or filter it out. Reveal When, Where, Who ... Relate it to Wikipedia entities2017 Organization Task 1: language, filtering and localization lead by Toulouse, Montr&#233;al and Paris starts &#8230; now! Task 2: entity extraction, summarization and linking starts in November 2016 lead by Avignon, (&#8230;)&lt;/p&gt;


-
&lt;a href="https://clef2018.clef-initiative.eu/mc2/spip.php?page=rubrique&amp;id_rubrique=3" rel="directory"&gt;Tasks 2017&lt;/a&gt;


		</description>


 <content:encoded>&lt;div class='rss_texte'&gt;&lt;p&gt;These are the slides used to presented at CLEF 2016 in Evora to introduce the CM2 lab.&lt;/p&gt;
&lt;h2 class=&#034;spip&#034;&gt;Overall Procedure&lt;/h2&gt;&lt;ol class=&#034;spip&#034; role=&#034;list&#034;&gt;&lt;li&gt; Take a microblog about an event with an url.&lt;/li&gt;&lt;li&gt; Identify its language.&lt;/li&gt;&lt;li&gt; Identify a related cultural event or filter it out.&lt;/li&gt;&lt;li&gt; Reveal When, Where, Who ...&lt;/li&gt;&lt;li&gt; Relate it to Wikipedia entities&lt;/li&gt;&lt;/ol&gt;&lt;h2 class=&#034;spip&#034;&gt;2017 Organization&lt;/h2&gt;&lt;ul class=&#034;spip&#034; role=&#034;list&#034;&gt;&lt;li&gt; Task 1: language, filtering and localization lead by Toulouse, Montr&#233;al and Paris starts &#8230; now!&lt;/li&gt;&lt;li&gt; Task 2: entity extraction, summarization and linking starts in November 2016 lead by Avignon, London University and Syllabs.&lt;/li&gt;&lt;li&gt; Task 3: time-line illustration starts in January 2017 lead by Grenoble.&lt;/li&gt;&lt;/ul&gt;
&lt;p&gt;More inside the slides ...&lt;/p&gt;&lt;/div&gt;
		&lt;div class="hyperlien"&gt;View online : &lt;a href="https://docs.google.com/presentation/d/1d09TE5Za5AizOAOQE71WaCyTPkb81rgTUis8mlTPbQg/edit?usp=sharing" class="spip_out"&gt;Evora's CM2 lab presentation slides&lt;/a&gt;&lt;/div&gt;
		
		</content:encoded>


		

	</item>
<item xml:lang="en">
		<title>Microlog Data Set</title>
		<link>https://clef2018.clef-initiative.eu/mc2/spip.php?page=article&amp;id_article=4</link>
		<guid isPermaLink="true">https://clef2018.clef-initiative.eu/mc2/spip.php?page=article&amp;id_article=4</guid>
		<dc:date>2015-11-02T08:08:38Z</dc:date>
		<dc:format>text/html</dc:format>
		<dc:language>en</dc:language>
		<dc:creator>sanjuan</dc:creator>


		<dc:subject>data</dc:subject>

		<description>
&lt;p&gt;The document collection provided by GAFES project consists a pool of more than 70M unique microblogs from different sources with their meta-information and expanded URLs on a MySQL server. Due to legal terms the access to this database is restricted to registered participants under privacy agreement. &lt;br class='autobr' /&gt;
Along with the microblog corpus, a clean simplified xml dump of wikipedia easy to index and to process with state of the art NLP tools is made available to participants. Ground truth (&#8230;)&lt;/p&gt;


-
&lt;a href="https://clef2018.clef-initiative.eu/mc2/spip.php?page=rubrique&amp;id_rubrique=6" rel="directory"&gt;2 - MicroBlog Search&lt;/a&gt;

/ 
&lt;a href="https://clef2018.clef-initiative.eu/mc2/spip.php?page=mot&amp;id_mot=2" rel="tag"&gt;data&lt;/a&gt;

		</description>


 <content:encoded>&lt;div class='rss_texte'&gt;&lt;p&gt;The document collection provided by GAFES project consists a pool of more than 70M unique microblogs from different sources with their meta-information and expanded URLs on a MySQL server. Due to legal terms the access to this database is restricted to registered participants under privacy agreement.&lt;/p&gt;
&lt;p&gt;Along with the microblog corpus, a clean simplified xml dump of wikipedia easy to index and to process with state of the art NLP tools is made available to participants. Ground truth material is the following:&lt;/p&gt;&lt;/div&gt;
		
		</content:encoded>


		

	</item>
<item xml:lang="en">
		<title>Evaluation Methodology</title>
		<link>https://clef2018.clef-initiative.eu/mc2/spip.php?page=article&amp;id_article=3</link>
		<guid isPermaLink="true">https://clef2018.clef-initiative.eu/mc2/spip.php?page=article&amp;id_article=3</guid>
		<dc:date>2015-11-02T07:40:36Z</dc:date>
		<dc:format>text/html</dc:format>
		<dc:language>en</dc:language>
		<dc:creator>sanjuan</dc:creator>



		<description>
&lt;p&gt;Systems will be evaluated mainly on informativeness and relevance, but readability and ergonomy will be also checked. Informativeness evaluation will rely on textual references established by experts in project GAFES, following the strict methodology &lt;br class='autobr' /&gt;
at CLEF-INEX tweet contextualization track (http://inex.mmci.uni-saarland.de/tracks/qa/). Readability and ergonomy would be carried out on the output for specific festivals based on questionnaires to be filled out by lab participants. Best (&#8230;)&lt;/p&gt;


-
&lt;a href="https://clef2018.clef-initiative.eu/mc2/spip.php?page=rubrique&amp;id_rubrique=3" rel="directory"&gt;Tasks 2017&lt;/a&gt;


		</description>


 <content:encoded>&lt;div class='rss_texte'&gt;&lt;p&gt;Systems will be evaluated mainly on informativeness and relevance, but readability and ergonomy will be also checked. Informativeness evaluation will rely on textual references established by experts in project GAFES, following the strict methodology &lt;br class='autobr' /&gt;
at CLEF-INEX tweet contextualization track (&lt;a href=&#034;http://inex.mmci.uni-saarland.de/tracks/qa/&#034; class=&#034;spip_url spip_out auto&#034; rel=&#034;nofollow external&#034;&gt;http://inex.mmci.uni-saarland.de/tracks/qa/&lt;/a&gt;). Readability and ergonomy would be carried out on the output for specific festivals based on questionnaires to be filled out by lab participants. Best systems will have the opportunity to be experimented in july 2016 for real with the support of the label French Tech Culture (&lt;a href=&#034;http://frenchculture.org/digital-cultures&#034; class=&#034;spip_url spip_out auto&#034; rel=&#034;nofollow external&#034;&gt;http://frenchculture.org/digital-cultures&lt;/a&gt;).&lt;/p&gt;
&lt;p&gt;Therefore, informativeness and relevance evaluation will be automatic and reproducible while readability and ergonomy would only be available for lab participants. All systems will be required to run on a dedicated LINUX server (allowing virtual machines) provided by organizers to will have to run in real time (maximum 5s per query). Access to full micro blog data will only be authorized for applications running on this server.&lt;/p&gt;&lt;/div&gt;
		
		</content:encoded>


		
		<enclosure url="https://clef2018.clef-initiative.eu/mc2/IMG/pdf/mc2_pres-2.pdf" length="164649" type="application/pdf" />
		

	</item>



</channel>

</rss>
