<div dir="ltr">Hi Sankarshan,<br><br>Having done some more reading[1], I am now positive that the factor of domain adaptability (due to poor scan or tattered documents), that I was concerned with in my last email, is out of the scope for now, however it my be included when trying to make the system more robust.<div>
<br clear="all"><div style>I can see most of the work has been done with tesserct 2.x , but I would like to look into tesseract 3.x, which is reported to have better support for connected-script based languages. I am currently trying to fond out more details about the implementation of support for hindi [2].</div>
<div><br></div><div style>At this point, I would also like to read about the proposal/work approach from last year on the same project. Could you provide me with a copy of the same?</div><div><br></div><div>[1] <a href="http://www.cvc.uab.es/icdar2009/papers/3725a671.pdf">http://www.cvc.uab.es/icdar2009/papers/3725a671.pdf<br>
</a>[2] <a href="http://research.ijcaonline.org/volume39/number6/pxc3877076.pdf">http://research.ijcaonline.org/volume39/number6/pxc3877076.pdf</a></div><div><br></div><div><br></div><div>-- </div>-Regards,<div>Debajyoti Nag</div>
<div><a href="http://twitter.com/aramis7d" target="_blank">http://twitter.com/aramis7d</a><br></div><div><br></div>
</div></div>