Needing to removing a leading zero from a PDF index

Previous topic - Next topic

dstrong

Hi all,

I'm using a PDF indexer to grab a store code from a report, which should be fairly straight-forward except the code is 5 characters in the report with a leading zero, and I only want to index the last 4 characters. The indexer won't let me choose just the 4, it only recognizes the whole chunk. I've tried using a mask of ####, but that grabs the first 4 characters. Using the character removal option under the Load Information tab won't work either, as I need the index to show as 0065 for example, not 65. What are my other options here? Thank you for any help.

Greg Ira

Is there some trailing character after the 5 character store code you could use in the mask?  For example if you had 01234:  You could use MASK='####:' then use character removal to remove the colon.

dstrong

Unfortunately the store number is the last thing on the line. I went ahead and tried it with '#### ' and removing trailing spaces, but it failed.

paulaj@us.ibm.com

If you are using V9.5 on MP, you can use a regular expression on the PDF indexer FIELD parameter.  See the documentation at http://www-01.ibm.com/support/docview.wss?rs=86&uid=swg27041745#docs under PDF Indexer, Using Regular Expressions.