AI Zone Admin Forum Add your forum

NEWS: Chatbots.org survey on 3000 US and UK consumers shows it is time for chatbot integration in customer service!read more..

A weird unicode match error
 
 

I defined a very simple concept containing only one unicode character.

concept: ~xu [徐]

you might need to put it into a file with UTF-8 encoding.

then in the command line,

:testpattern (~xu)  影

returns Matched! This is so weird.

Not every unicode is like this. I am working on a Chinese bot for several months, and this is the only conflict I am running into. But I suspect there might be many.

Similarly, if I have concept
concept: ~ying [影]

:testpattern (~ying) 徐
also matches.

For some reason, Chatscript thinks 徐 and 影 are the same.

I am using Chatscript 7.61.

 

 
  login or register to react