We think, we simply read first as character in textfile and everybody has the problem about their language support next.
UTF-16, we could read, in C, we could say, that is very difficult always.
Someone chages the specification so often.
If we have alyways all dfinitions of All literature, we could use match program.
That solution is very easy.
All words, we could have first.
However, the store of literatures always changes as supports.
1. Storage of Literatures in UTF-16( Unicode )
We cannot open the above always.
---
Correctly reading a utf-16 text file into a string without external libraries? : https://stackoverflow.com/questions/10504044/correctly-reading-a-utf-16-text-file-into-a-string-without-external-libraries C:\Users\abjmp_000\Documents\source\filerecover\filerecover_20200620 --- .\filerecover.cpp Fri Jun 26 16:06:58 2020 1 ://20200408: created: 2 :#include3 :#include 4 : 5 :#define CHAR_MAX 256 6 :#define PARSE_NUM 256 7 : 8 :int filesize( FILE *fp ) ; 9 : 10 : 11 :int main ( int argc, char** argv) { 12 : FILE *fp; 13 : char char_word[CHAR_MAX]; 14 : char char_temp_word[CHAR_MAX]; 15 : char* filename; 16 : 17 : if ( argc < 1 ) { 18 : printf("error: there is no parameter."); 19 : exit(-1); 20 : } 21 : 22 : printf("%s \r\n", argv[0] ); 23 :// exit(-1); 24 : 25 : filename = argv[1]; 26 : 27 : fp = fopen ( filename, "rb" ); 28 : int file_end = filesize ( fp ); 29 : 30 : for( int i=0; i