<div dir="ltr"><div style><span style="font-family:arial,sans-serif;font-size:13px">Hi,</span></div><div style><span style="font-family:arial,sans-serif;font-size:13px"><br></span></div><div style><span style="font-family:arial,sans-serif;font-size:13px">>> Also, M.A.Hasnat, the developer of BanglaOCR pointed to me that the accuracy</span><br style="font-family:arial,sans-serif;font-size:13px">
<span style="font-family:arial,sans-serif;font-size:13px">>> may not be same for all domains, eg., newspaper, book, typewriting docs,</span><br style="font-family:arial,sans-serif;font-size:13px"><span style="font-family:arial,sans-serif;font-size:13px">>>etc, so, domain adaptability should be considered.</span><br style="font-family:arial,sans-serif;font-size:13px">
</div><span style="font-family:arial,sans-serif;font-size:13px"><div><span style="font-family:arial,sans-serif;font-size:13px"><br></span></div>>have you wondered why it would be so ?</span><br style="font-family:arial,sans-serif;font-size:13px">
<div style="font-family:arial,sans-serif;font-size:13px"><br></div><div style="font-family:arial,sans-serif;font-size:13px">I have not given much thought to it, but depending on the initial pre-processing, those factors (like quality of page, print, scan etc) could affect the actual input being supplied to the OCR. Assuming the input data is uniform in aspects of quality (resolution should also effect, but it should not be a difficult task to alter resolution of digital data), the OCR should have the same performance.</div>
<div style="font-family:arial,sans-serif;font-size:13px"><br>But to begin with, I would like to focus more on the post-processing, and selectively on some of the pre-processing steps, but not as a whole. I shall describe my plan in more detail in my proposal. (Still working on it, taking longer than I expected)</div>
<div style="font-family:arial,sans-serif;font-size:13px"><br></div><div style="font-family:arial,sans-serif;font-size:13px">Maybe this particular problem is more relevant to the other OCR project. ( !? )</div><div><br></div>
<div style>p.s. - apologies for starting a new thread, I only get daily digest mails, and could not figure out how to reply to the same thread.<br><br><br></div>-- <br>-Regards,<div>Debajyoti Nag</div><div><a href="http://twitter.com/aramis7d" target="_blank">http://twitter.com/aramis7d</a></div>
</div>