ACIF lineData Arabic

Previous topic - Next topic

MHassan

Hello ,

I am trying to index a .txt file with line data that have arabic content but when i try to set my triggers and fields with report wizard , the arabic content is appeared like the screenshot i provide below , so anyone have an idea if i want to index .txt files with arabic content how to appear normal like the option in the pdf indexer 9output hexadecimal strings).

Justin Derrick

Which codepage are you using for processing / displaying this?  I'd suspect you'd need the file produced in Unicode, and then use the corresponding Unicode codepage to match it when working with ACIF.  Otherwise, I suspect you're mixing codepages, which I don't think would work at all.

-JD.
Call:  +1-866-533-7742  or  eMail:  jd@justinderrick.com
IBM CMOD Wiki:  https://CMOD.wiki/
FREE IBM CMOD Webinars:  https://CMOD.Training/
IBM CMOD Professional Services: https://CMOD.cloud

Interests: #AIX #Linux #Multiplatforms #DB2 #TSM #SP #Performance #Security #Audits #Customizing #Availability #HA #DR

MHassan

Quote from: Justin Derrick on July 13, 2020, 04:27:40 PM
Which codepage are you using for processing / displaying this?  I'd suspect you'd need the file produced in Unicode, and then use the corresponding Unicode codepage to match it when working with ACIF.  Otherwise, I suspect you're mixing codepages, which I don't think would work at all.

-JD.

Hello JD,
Thanks for reply , i am using 850 codepage for processing , the file is produced in UTF-8 , but it contain english and arabic mixed content, so i can't found a codepage that will generate the file as it is , and the one i found only was Codepage=(1208) but it shows a line fine and the next line the characters are separated from each other  like the screenshot below

Justin Derrick

Looks like you need to work with the team that produced the file to ensure it's properly formatted for Unicode. 

-JD.
Call:  +1-866-533-7742  or  eMail:  jd@justinderrick.com
IBM CMOD Wiki:  https://CMOD.wiki/
FREE IBM CMOD Webinars:  https://CMOD.Training/
IBM CMOD Professional Services: https://CMOD.cloud

Interests: #AIX #Linux #Multiplatforms #DB2 #TSM #SP #Performance #Security #Audits #Customizing #Availability #HA #DR

MHassan

Hello JD,

It seems it works fine on the navigator with the codepage (1208 )like the screenshot below , but on the CMOD client it appear  that it has an issue as the screenshot i provided before ,
Thank you JD for your effort
Modify message

Justin Derrick

Make sure you've got the right codepage set in the Application definition so that the client knows how to display it.

At this point, it may sound like a CMOD Client bug with displaying arabic characters.

-JD.
Call:  +1-866-533-7742  or  eMail:  jd@justinderrick.com
IBM CMOD Wiki:  https://CMOD.wiki/
FREE IBM CMOD Webinars:  https://CMOD.Training/
IBM CMOD Professional Services: https://CMOD.cloud

Interests: #AIX #Linux #Multiplatforms #DB2 #TSM #SP #Performance #Security #Audits #Customizing #Availability #HA #DR

Darrell Bryant

Code page 850 is for Western European languages. It looks like code page 864 or 1256 would be correct for ASCII Arabic data.
#IBMi #iSeries #PDF #XML #400 Indexer #ASM

MHassan

Quote from: Darrell Bryant on July 15, 2020, 12:46:05 PM
Code page 850 is for Western European languages. It looks like code page 864 or 1256 would be correct for ASCII Arabic data.

Both codepage didn't work for me , the only codepage that worked for me is 1208 that appeared fine in navigator but in CMOD client appeared as the screenshot in my previous replies some are fine and some are messy