Sir,<br><br>I T.Sai Deepak doing my M.Tech from IIT Roorkee. I am presently working on "Paraphrase Detection". <br><br>For my work I need to access wikipedia. I found your API as very much useful, but I am not able to download Wikipedia data since it is an FTP connection which requires authentication.<br>
<br>Is there any other possible way to download this data??<br><br>As mentioned in the Jwpl software document, I have downloaded the wikipedia data form <a href="http://download.wikimedia.org/backup-index.html">http://download.wikimedia.org/backup-index.html</a><br>
<br>The three archives which i have downloaded are:<br><br> * [LANGCODE]wiki-[DATE]-pages-articles.xml.bz2<br> * [LANGCODE]wiki-[DATE]-pagelinks.sql.gz<br> * [LANGCODE]wiki-[DATE]-categorylinks.sql.gz<br><br><br>But for most of the pages I am getting an error that "Page not available" even though the page is available in Wikipedia. Can you please suggest me a solution for this problem.<br>
<br>Thanks<br><br>Regards<br>T. Sai Deepak<br>M.Tech CSE<br>IIT Roorkee.<br><br>