www.emsdn.com
Class Profile: Home »» Samba [Samba] under "Samba" »»» iocharset iso8859-1 vs cp1252

iocharset iso8859-1 vs cp1252


It is my understanding that although very similar, there are subtle
differences ( )
between iso8859-1 and Windows-1252, especially in the area of the C1
Control Code set ( and
).
I have a file on a Windows 2003 server which I believe is using the 0x0096
EN-DASH character in the filename, and I am finding this file impossible to
read from a linux box using mount -t cifs -o iocharset=iso8859-1. I don't
seem to be able to assign iocharset=cp1252 which is what I want. It appears
to me that a cp1252 doesn't even exist (
). Is there some reason for
this? Am I completely mistaken in all of this, and there is really some
other silly reason I cannot read the file?
linux-cifs-client mailing list
linux-cifs-client (AT) lists (DOT) samba.org


No. 1# | By Developer Tags User at [2008-5-5] | size: 885 bytes

Mon, 2006-10-16 at 16:56 -0500, bugzilla KevinH wrote:
It is my understanding that although very similar, there are subtle
differences
( )
between iso8859-1 and Windows-1252, especially in the area of the C1
Control Code set
( and
).

I have a file on a Windows 2003 server which I believe is using the
0x0096 EN-DASH character in the filename, and I am finding this file
impossible to read from a linux box using mount -t cifs -o
iocharset=iso8859-1. I don't seem to be able to assign
iocharset=cp1252 which is what I want. It appears to me that a cp1252
doesn't even exist ( ).
Is there some reason for this? Am I completely mistaken in all of
this, and there is really some other silly reason I cannot read the
file?

Why don't you just use utf8 ?
Windows uses utf16 so utf8 is the best match.

Simo.

No. 1# | By Developer Tags User at [2008-5-5] | size: 1727 bytes

10/16/06, simo <idra (AT) samba (DOT) orgwrote:
Mon, 2006-10-16 at 16:56 -0500, bugzilla KevinH wrote:
It is my understanding that although very similar, there are subtle
differences
( )
between iso8859-1 and Windows-1252, especially in the area of the C1
Control Code set
( and
).

I have a file on a Windows 2003 server which I believe is using the
0x0096 EN-DASH character in the filename, and I am finding this file
impossible to read from a linux box using mount -t cifs -o
iocharset=iso8859-1. I don't seem to be able to assign
iocharset=cp1252 which is what I want. It appears to me that a cp1252
doesn't even exist ( ).
Is there some reason for this? Am I completely mistaken in all of
this, and there is really some other silly reason I cannot read the
file?

Why don't you just use utf8 ?
Windows uses utf16 so utf8 is the best match.

Well, I had a reason, but it turns out it wasn't a very good one.

The reason I wasn't using utf8 was because I was reading a file
generated by a program which I have no control over, which contained a
list of files on the windows server for me to copy. That file is in
CP1252.

But you are absolutely right, rather than depending on the filesystem
drivers to handle the conversion for me, I can just as easily convert
the CP1252 strings I read out of the file to UTF8.

I still think there is a gap in coverage of iocharsetsbut my
problem is fixed, so I have no good reason to ask for a CP1252
iocharset anymore.

Thanks Simo.
-Kevin

linux-cifs-client mailing list
linux-cifs-client (AT) lists (DOT) samba.org



Samba Hot!

Samba New!


Copyright © 2008 www.emsdn.com • All rights reserved • CMS Theme by www.emsdn.com - 0.375