| View previous topic :: View next topic |
| Author |
Message |
rhimbo General User

Joined: 16 Oct 2005 Posts: 12
|
Posted: Tue Jun 30, 2009 6:36 pm Post subject: How to save a file in UTF-8 or Unicode charset format? |
|
|
Hi folks,
How can I save a file
I have inherited a Word file containing Japanese text (mixed Hiragana, Katakana and Kanji). I've opened it in OOo writer. I would like to save it in UTF-8. How do I do that?
I believe it was created on Windows and appears to be encoded using CP1252.
Many thanks in advance. |
|
| Back to top |
|
 |
Robert Tucker Moderator


Joined: 16 Aug 2004 Posts: 3367 Location: Manchester UK
|
Posted: Wed Jul 01, 2009 1:53 am Post subject: |
|
|
As far as I know OpenOffice creates and saves documents in UTF-8. If you've opened the document in OpenOffice and all the characters appear OK, I think you should be able to just go ahead and save it. If you need to change the encoding before you open it in OpenOffice that may be more difficult. _________________ LibreOffice 3.6.6 on Fedora 18, LibreOffice 4.0.2 on Ubuntu 13.04 (Double Boot) |
|
| Back to top |
|
 |
rhimbo General User

Joined: 16 Oct 2005 Posts: 12
|
Posted: Wed Jul 01, 2009 10:24 am Post subject: Can't paste Japanese text into Eclipse file |
|
|
Hello Robert,
I can read the Japanese characters just fine. I forced a re-save of the file (after adding some spaces to force a save). I also did a "save as" to produce a new version of the file.
I tried to copy and paste a few lines from each of these files into a file that I was editing using the Eclipse IDE. The paste worked and I can see all the Japanese characters. However, I cannot save the file. I get this error:
| Quote: |
"Save could not be completed.
Reason:
Some characters cannot be mapped using "Cp1252" character encoding...." |
I know Cp1252 is a Microsoft encoding. It might be the original encoding with which the file was created. So I'm not sure what the file format or encoding is after I open and resave in OOo.
I am trying to get cygwin installed today to use the good 'ol Unix "od" program to look at the file bytes and see if the first few bytes indicate the file encoding.
Any ideas on what's going on?
Thanks again. |
|
| Back to top |
|
 |
Robert Tucker Moderator


Joined: 16 Aug 2004 Posts: 3367 Location: Manchester UK
|
Posted: Wed Jul 01, 2009 10:33 am Post subject: |
|
|
As it says here:
http://www.velocityreviews.com/forums/t373006-cp1252-problem.html
Cp1252 may be the default setting for Eclipse.
File > Properties > Text file encoding >choose UTF-8 or enter it. _________________ LibreOffice 3.6.6 on Fedora 18, LibreOffice 4.0.2 on Ubuntu 13.04 (Double Boot) |
|
| Back to top |
|
 |
rhimbo General User

Joined: 16 Oct 2005 Posts: 12
|
Posted: Wed Jul 01, 2009 10:37 am Post subject: Must be an Eclipse configuration problem |
|
|
Yeah, I forgot to mention that I tried that... still can't save it.
But thanks anyway. I'm going to post to the Eclipse forum. Maybe there is some other configuration that needs setting that I overlooked.
Thanks again.... |
|
| Back to top |
|
 |
|