Re: [sleuthkit-users] Charset encoder error when processing mbox files
Brought to you by:
carrier
From: Joseph H. <hyl...@is...> - 2018-12-11 07:21:48
|
Okay, here's what I did: I changed the contents of /etc/default/locale to remove the hard-coded Italian references in that file, changed the default locale to en_US.UTF-8, and got the same locale output as Derrick did. I then attempted to re-run the ingest... and got the same error. I then upgraded to Autopsy 4.9.1... and got the same error. I then installed Autopsy 4.9.1 in a Mint test VM, spun it up, ran it against the data... and got the same error. I am wondering if maybe I should just punt and install all of the locales? After all, this data has God-only-knows what character encoding in it. So, it's probably not a CAINE issue. It could be an issue with teh data itself. Perhaps I could import it into Thunderbird (read-only and off-network) and see if there is any strange encoding in it. Thoughts? On Mon, 2018-12-10 at 18:57 -0700, Derrick Karpo wrote: > Hi Joseph. > > I've attached my locale output below: > > dk@anubis:~$ locale > LANG=en_CA.utf8 > LANGUAGE=en_CA:en > LC_CTYPE="en_CA.utf8" > LC_NUMERIC="en_CA.utf8" > LC_TIME="en_CA.utf8" > LC_COLLATE="en_CA.utf8" > LC_MONETARY="en_CA.utf8" > LC_MESSAGES="en_CA.utf8" > LC_PAPER="en_CA.utf8" > LC_NAME="en_CA.utf8" > LC_ADDRESS="en_CA.utf8" > LC_TELEPHONE="en_CA.utf8" > LC_MEASUREMENT="en_CA.utf8" > LC_IDENTIFICATION="en_CA.utf8" > LC_ALL= > dk@anubis:~$ locale charmap > UTF-8 > > I tested my system under Autopsy 4.9.0 and 4.9.1 and both ran fine. > While I'm not convinced we are on the right track with the locales > stuff we could try something: > > $ sudo dpkg-reconfigure locales (generate "en_US.UTF-8" and set it > as the default locale) > <log out of the Caine X session> > $ locale (make sure it's all "en_US.utf8") > <test Autopsy again> > > Derrick > > On Sun, Dec 9, 2018 at 10:18 PM <hyl...@is...> wrote: > > > > > Hi Joseph. > > > > > > This question might be better asked directly to Nanni as it > > > sounds > > > like it may be Caine specific! I just tested mbox parsing under > > > Debian testing w/Autopsy 4.9.1 and didn't have any issues with > > > keyword > > > searches. > > > > > > While I don't have a copy of Caine to test with at the moment I > > > wonder > > > if it's a manifestation of your systems locale. If you fire up a > > > terminal emulator, can you send the output from 'locale' and > > > 'locale > > > charmap'? From MboxParser.java:111 in Autopsy it looks like if > > > it > > > can't detect the character encoder that it'll throw that message > > > but I > > > could be way off base here. > > > > Hi Derrick, > > > > Thank you very much for the quick reply. Below is the output of > > 'locale': > > > > jhylkema@caine-vm:~$ locale > > LANG=en_US.UTF-8 > > LANGUAGE=en_US > > LC_CTYPE="en_US.UTF-8" > > LC_NUMERIC=it_IT.UTF-8 > > LC_TIME=it_IT.UTF-8 > > LC_COLLATE="en_US.UTF-8" > > LC_MONETARY=it_IT.UTF-8 > > LC_MESSAGES="en_US.UTF-8" > > LC_PAPER=it_IT.UTF-8 > > LC_NAME=it_IT.UTF-8 > > LC_ADDRESS=it_IT.UTF-8 > > LC_TELEPHONE=it_IT.UTF-8 > > LC_MEASUREMENT=it_IT.UTF-8 > > LC_IDENTIFICATION=it_IT.UTF-8 > > LC_ALL= > > > > And below is the output of 'locale charmap': > > > > jhylkema@caine-vm:~$ locale charmap > > UTF-8 > > > > If I were a betting man, my money would be on the fact that LC_ALL > > isn't > > set. Is that environment variable set in your Debian test distro? > > > > I will also email Nanni. > > > > Thank you. > > > > > > > > Derrick > > > On Sun, Dec 9, 2018 at 1:18 AM Joseph Hylkema < > > > hyl...@is...> > > > wrote: > > > > > > > > Hi all, > > > > > > > > First post to the list. > > > > > > > > I am trying to use Autopsy to run some keyword searches on mbox > > > > files > > > > downloaded from gmail. Unfortunately, autopsy returns an > > > > error: > > > > "Error while processing: Could not find appropriate charset > > > > encoder." > > > > I am running Autopsy on Caine 10 in a KVM VM with 8GB RAM on a > > > > Lenovo > > > > P51 with a Core I7 processor. > > > > > > > > Any help would be appreciated. > > > > > > > > -- > > > > "Far better it is to dare mighty things, to win glorious > > > > triumphs, even > > > > though checkered by failure, than to take rank with those poor > > > > spirits > > > > who neither enjoy much nor suffer much, because they live in > > > > the gray > > > > twilight that knows neither victory nor defeat." > > > > > > > > -- Theodore Roosevelt, "The Strenuous Life." > > > > > > > > > > > > > > > > _______________________________________________ > > > > sleuthkit-users mailing list > > > > https://lists.sourceforge.net/lists/listinfo/sleuthkit-users > > > > http://www.sleuthkit.org > > > > |