Needing to removing a leading zero from a PDF index

dstrong · May 02, 2016, 03:41:41 PM

Hi all,

I'm using a PDF indexer to grab a store code from a report, which should be fairly straight-forward except the code is 5 characters in the report with a leading zero, and I only want to index the last 4 characters. The indexer won't let me choose just the 4, it only recognizes the whole chunk. I've tried using a mask of ####, but that grabs the first 4 characters. Using the character removal option under the Load Information tab won't work either, as I need the index to show as 0065 for example, not 65. What are my other options here? Thank you for any help.

Greg Ira · May 03, 2016, 12:24:49 PM

Is there some trailing character after the 5 character store code you could use in the mask? For example if you had 01234: You could use MASK='####:' then use character removal to remove the colon.

dstrong · May 03, 2016, 01:57:14 PM

Unfortunately the store number is the last thing on the line. I went ahead and tried it with '#### ' and removing trailing spaces, but it failed.

paulaj@us.ibm.com · June 07, 2016, 03:57:04 PM

If you are using V9.5 on MP, you can use a regular expression on the PDF indexer FIELD parameter. See the documentation at http://www-01.ibm.com/support/docview.wss?rs=86&uid=swg27041745#docs under PDF Indexer, Using Regular Expressions.

Needing to removing a leading zero from a PDF index

dstrong

Greg Ira

dstrong

paulaj@us.ibm.com