Re: [sleuthkit-users] File/Directories Names using alternate enco ding formats?
Brought to you by:
carrier
From: Brian C. <ca...@sl...> - 2004-09-22 22:28:06
|
If we are considering an NTFS file system, then all names are stored as UTF-16 Unicode, but TSK takes only the lower byte and turns it into ASCII. With FAT, the original 8.3 directory entry has only ASCII and those can have characters from various code pages (I don't think the actual page is defined in the file system though). FAT long file names are stored in UTF-16 and are Unicode so they can use the Unicode name. Therefore, if you have a FAT file system with Arabic then (I think) the short name will use a code page and the long name will use Unicode. In either case, TSK may not even show you the non-ASCII name because it requires the name to be valid ASCII. This is obviously too restrictive in light of code pages and such. Once TSK becomes Unicode-aware then this will also change. brian On Sep 22, 2004, at 2:00 PM, Kucenski, Matthew A. wrote: > Although I have only limited understanding on this subject (I have > spent the > last several days studying and trying to understand the character set > mess), > I don't know that support of UNICODE is enough (at least for what I'm > trying > to do). > > I know that the ASCII character set maps directly (same bit values) to > the > UTF-8 version of UNICODE. However, other languages do not necessarily > map > directly to UNICODE. For example, a Win95 drive used in an Arab > country > will be using either the MS Win 1256 codepage or the MS DOS 720 > codepage. > Under ASCII or UNICODE, filenames using the language specific > characters in > those codepages will be garbage. > > Again, my knowledge of this subject is limited. I am also still > trying to > get my FreeBSD system to recognize althernate character sets when > mounting a > foreign disk image. This whole subject is just a big steaming pile of > crap. > Unicode is great, but there is a ton of legacy stuff behind it that is > still > floating around causing problems. |