[Project-ideas] follow up discussions - improve accuracy of bengali OCR

Debajyoti Nag dave0908 at gmail.com
Tue Apr 23 02:52:22 PDT 2013


Hi,
I have applied through the Melange system now, and as the sytem itself is
not very user-friendly, my detailed proposal is available at the
"Additional Info" link.

/regards,
Debajyoti


On Tue, Apr 23, 2013 at 9:37 AM, Sankarshan Mukhopadhyay <
sankarshan.mukhopadhyay at gmail.com> wrote:

> On Tue, Apr 23, 2013 at 7:48 AM, Debajyoti Nag <dave0908 at gmail.com> wrote:
>
> > After further thoughts, I believe that its best to rely on Tesseract's
> > pre-processor for now.
> > Noise due to those factors are not language specific, and hence,the minor
> > noise-corrections could be included only as an optional objective of the
> > project, which is to be pursued if time permits.
> > Things could always be improved.
>
> > Tesseract 3.02 has good support for some connected scripts, but Bengali
> is
> > not among them, however, the methodology should be useful. It's
> mentioned in
> > more detail in my proposal
> >
> > Please find attached the first draft of my proposal for the project. I
> tried
> > to build it based on the points mentioned on the Project Ideas page.
>
> Since the proposal window has opened up, I think it would prudent to
> use the Melange system to submit your write up. For the benefit of
> others, the email with the attachment was stuck in moderation queue
> and I think it is best that we have proposal discussions on the
> Melange system.
>
>
> --
> sankarshan mukhopadhyay
> <https://twitter.com/#!/sankarshan>
>



-- 
-Regards,
Debajyoti Nag
http://en.gravatar.com/dj496
http://twitter.com/aramis7d
http://dj496.wordpress.com/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.ankur.org.in/pipermail/project-ideas-ankur.org.in/attachments/20130423/94248698/attachment-0003.htm>


More information about the Project-ideas mailing list