Information Extractor - HELP!

So, I'm trying to setup the infromation extractor to capture two values, Invoice Number and Invoice Value.

Currently this is just a proof of concept so I'm using a simple spreadsheet with the following data in it:

Invoice Number: 1234

Total: 20000

I have created two custom suggesiton rules, one for Invoice Number and one for Invoice Value. Each is very simple:

{
"targetName": "Invoice Number",
"documentContentPattern": "\\b(Invoice Number)(:|\\s?)(\\s)?(?<value>\\w+)\\b",
"enabled": true,
"confidence": 0.95
}

,
{
"targetName": "Invoice Value",
"documentContentPattern": "\\b(Total)(:|\\s?)(\\s)?(?<value>\\w+)\\b",
"enabled": true,
"confidence": 0.95
}

I have also created two Suggestions, one for Invoice Number, one for Invoice Value and set them to be specific property and selected the relevant property.

The only difference is Invoice Number is a Text property, Invoice Value is a Number (Real) property.

I have tried everything I can think of and I can not get M-Files to suggest a value to me for Invoice Value. I have tried replacing:

"\\b(Total)(:|\\s?)(\\s)?(?<value>\\w+)\\b"

with 

"\\b(Total)(:|\\s?)(\\s)?(?<value>\\d+)\\b"

So I'm specifically only looking for numbers and it does nothing.

If I change my "targetName": "Invoice Number" setting to look for numbers after the word Total rather than Invoice Number, it picks up my correct Invoice Value (20000) but is obvioulsy suggesting it as my invoice number, not invoice value. It picks it up if I use \\w+ or \\d+

But whatever I try to capture to my Invoice Value suggestion - nothing. It will nto suggest anything.

Now, the only difference as I said is the property type. Can we not capture metadata suggestions for Number properties?

I couldn't see this listed as an exclusion in the instruction document?

If anyone can help I would be most grateful, I am tearing my hair out here :(

Parents
  • OK, so I thought I would add an extra property to my Invoice Class called 'Invoice Value Text' and try to capture the value (20000) into here.

    This has worked, which makes me suspect that the Information Extractor cannot suggest metadata values for Numerical Properties.

    However, for the proof of concept I'm creating, I need the Invoice Value to be in a Number property because in a workflow, I want to be able to evaluate if the value is above or below a certain amount.

    Any ideas welcomed Slight smile

  • As you found out, Information extractor cannot suggest values for numeric properties. There is an improvement request in our system with ID 153060 about this. 

Reply Children