Hello ,
I am trying to index a .txt file with line data that have arabic content but when i try to set my triggers and fields with report wizard , the arabic content is appeared like the screenshot i provide below , so anyone have an idea if i want to index .txt files with arabic content how to appear normal like the option in the pdf indexer 9output hexadecimal strings).
Which codepage are you using for processing / displaying this? I'd suspect you'd need the file produced in Unicode, and then use the corresponding Unicode codepage to match it when working with ACIF. Otherwise, I suspect you're mixing codepages, which I don't think would work at all.
-JD.
Quote from: Justin Derrick on July 13, 2020, 04:27:40 PM
Which codepage are you using for processing / displaying this? I'd suspect you'd need the file produced in Unicode, and then use the corresponding Unicode codepage to match it when working with ACIF. Otherwise, I suspect you're mixing codepages, which I don't think would work at all.
-JD.
Hello JD,
Thanks for reply , i am using 850 codepage for processing , the file is produced in UTF-8 , but it contain english and arabic mixed content, so i can't found a codepage that will generate the file as it is , and the one i found only was Codepage=(1208) but it shows a line fine and the next line the characters are separated from each other like the screenshot below
Looks like you need to work with the team that produced the file to ensure it's properly formatted for Unicode.
-JD.
Hello JD,
It seems it works fine on the navigator with the codepage (1208 )like the screenshot below , but on the CMOD client it appear that it has an issue as the screenshot i provided before ,
Thank you JD for your effort
Modify message
Make sure you've got the right codepage set in the Application definition so that the client knows how to display it.
At this point, it may sound like a CMOD Client bug with displaying arabic characters.
-JD.
Code page 850 is for Western European languages. It looks like code page 864 or 1256 would be correct for ASCII Arabic data.
Quote from: Darrell Bryant on July 15, 2020, 12:46:05 PM
Code page 850 is for Western European languages. It looks like code page 864 or 1256 would be correct for ASCII Arabic data.
Both codepage didn't work for me , the only codepage that worked for me is 1208 that appeared fine in navigator but in CMOD client appeared as the screenshot in my previous replies some are fine and some are messy