I need some information in speed of OLE with MFC. I have created some
functions to parse a MS-Word document. This functions extract all the
text in the document body, including tables, headers and footers.
Additionally, style, font size, page and line number is extracted.
What I am doing is to get all the paragraphs of the document and handle
every one of them (getting the text, getting style and font
information...). I have to do it this way because I need to find some
specific parts of the text characterized by style, font or parts of the
The challenge is that I need to parse large documents with my
functions. The documents are about 1.5MB/80 pages. At the moment, it
takes around 20-30 minutes to check these documents. This is definitely
to much. Is there any hint how the speed of OLE may be increased.
I used a profiler to see where the time is lost.
Most of the time is used for calls to
These three functions take around 90% of the overall time.
Is there a way to for speed improvement?
Or do you have any experience how long it should take to parse such
Any help is appreciated.