This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

Slow XML import

Former Member
Former Member
Hia,

We have a conversion mechnism running that's converting legacy postscript output to generic PDF with formatted XML (1 pdf = 1 pdf, same naming).
On a daily base about 5000 files are generated and imported with an external file source job using the XML and xpaths for meta data.
It's all working nicely, however, the import into M-Files is so slo...o..w.....

Any experience here to speed up this process, to increase the # of files per import (for example 500 in stead of 100) etc?
The (virtual) hardware shouldn't be the issue, nor the SQL vault database.

We have a backlog of let's say a few million files, so any speed increase would be great.


PS: best wishes all!
Parents
  • Former Member
    Former Member
    Hia,

    OCR is not used in this case and can be ignored (as test: importing flat Tif image + XML files has the same performance).
    Importing 100 documents lies between 100 to 150 seconds, so basically 1/1,5 second a file, or about 100.000 per day when running full time.

    That is a lot ofcourse, but when you speak of millions, it means we can import about 3 million files per 1 month, meaning we need 4-5 months of full time importing.
Reply
  • Former Member
    Former Member
    Hia,

    OCR is not used in this case and can be ignored (as test: importing flat Tif image + XML files has the same performance).
    Importing 100 documents lies between 100 to 150 seconds, so basically 1/1,5 second a file, or about 100.000 per day when running full time.

    That is a lot ofcourse, but when you speak of millions, it means we can import about 3 million files per 1 month, meaning we need 4-5 months of full time importing.
Children
No Data