Hi Sankarshan,<div><br></div><div>Thank you for prompt inisitiative after talking at Kolkata bookfair. Bengali wikipedia community ( wiki source, <a href="http://bn.wikisource.org">bn.wikisource.org</a>), are ready to do a nothing except coding to crack this OCR issues. As all you know that, this will not only help for us, it will be the most awaited wishes from longtime.</div>
<div><br></div><div>Regards,<span></span></div><div>Jayanta</div><div><br><br>On Monday, February 3, 2014, Sankarshan Mukhopadhyay <<a href="mailto:sankarshan.mukhopadhyay@gmail.com">sankarshan.mukhopadhyay@gmail.com</a>> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi Rabindra,<br>
<br>
Thank you for writing in.<br>
<br>
I am replying as a top-post because I have copied in the mailing list<br>
we use to discuss project ideas (subscription interface should be<br>
available from <<a href="http://lists.ankur.org.in/listinfo.cgi/project-ideas-ankur.org.in" target="_blank">http://lists.ankur.org.in/listinfo.cgi/project-ideas-ankur.org.in</a>><br>
<br>
I have also added Jayanta Nath in the list. I met Jayanta yesterday<br>
(after a suitably long period of interactions over email) and, we<br>
ended up chatting about the usual - "how to crack this OCR issue in a<br>
manner that helps the Bengali Wikipedia community and, especially<br>
Wikisource"<br>
<br>
I am glad to note that you have taken a look at Abhishek's existing<br>
work. Have you been able to reach out to him and discuss in some level<br>
of detail the current state of the work? The voting piece is somewhat<br>
based on the concept that a larger number of users of the system can<br>
help train the system for higher degree of accuracy.<br>
<br>
<a href="http://ankur.org.in" target="_blank">ankur.org.in</a> will be putting in an application as a mentoring<br>
organization. However, the acceptance in GSoC2014 is always subject to<br>
- [1] good set of project ideas; [2] reasonable success from previous<br>
year etc. So, there is a period of waiting before one gets to know<br>
about being selected as a mentoring organization and, thereafter<br>
begins the process of selecting strong applications from students.<br>
<br>
I would recommend that you spend this time catching up with Abhishek<br>
and also Jayanta in order to be able to understand a real-life<br>
utilization of your project (should <a href="http://ankur.org.in" target="_blank">ankur.org.in</a> be selected and, you<br>
are accepted as a student)<br>
<br>
/sankarshan<br>
<br>
On Mon, Feb 3, 2014 at 12:56 PM, Rabindra Rakshit <<a href="javascript:;" onclick="_e(event, 'cvml', 'rovir2r@gmail.com')">rovir2r@gmail.com</a>> wrote:<br>
> I (Rabindra Rakshit), am interested in applying for GSOC 2014, and would<br>
> like to know if Ankur India is applying as a mentoring organization this<br>
> year also.<br>
><br>
> I am currently pursuing my B.tech in Computer Science(CSE) from College of<br>
> Engineering and Management, Kolaghat, and being born a Bengali, would love<br>
> to see my language flourish in the open source community.<br>
><br>
> I am particularly interested in the project about Improving information<br>
> retrieval methods for OCR data sets consisting of Indic scripts(Info<br>
> Rescue). I had a look on the work plan of Abhishek Gupta, the final voting<br>
> system in a general(abstract) manner is yet to be implemented.<br>
><br>
> I don't have any exact experience about OCR, but I do have experience of<br>
> working with Information Retrieval Systems, in fact, right now I am working<br>
> on Consensus Sequence Segmentation, an Unsupervised Text Segmentation<br>
> algorithm that relies entirely on statistical relationships among alphabets<br>
> in the input sequence to detect location of word boundaries. I have attached<br>
> a document of our work which is still in progress.<br>
><br>
> Link: <a href="http://arxiv.org/abs/1308.3839" target="_blank">http://arxiv.org/abs/1308.3839</a><br>
<br>
--<br>
sankarshan mukhopadhyay<br>
<<a href="https://twitter.com/#!/sankarshan" target="_blank">https://twitter.com/#!/sankarshan</a>><br>
</blockquote></div>