| Home » Products » UltraEdit/UEStudio » Getting Started | ||||||||||||||
Unicode and UTF-8 Support |
||||||||||||||
|
Article Number: 1248 | Last Updated: Thu, Oct 20, 2011 3:32 PM
|
||||||||||||||
|
UltraEdit provides support for Unicode (16-Bit wide character) files, and for UTF-8 files and allows direct editing of Unicode files and UTF-8 files as well as conversion routines between ASCII/ANSI and Unicode or UTF-8. UltraEdit attempts to detect the file type when the file is loaded. It will look for the FF FE marker for Unicode files. For UTF-8 it will look for one of the following three occurrences:
1) File marker ( BOM) EF, BB, BF. 2) String occurrences "charset=utf-8" or "encoding=utf-8" 3) It will look for the occurrence of valid UTF-8 multi-byte characters in the first 64KB of a file.
If the file is found to be Unicode, it will be treated as such and the status bar will indicate this with "U-" being displayed in front of the file type in the status bar.
If the file is found to be UTF-8, it will be treated as such and converted internally to Unicode (16-Bit) for editing. The status bar will indicate this with "U8-" being displayed in front of the file type in the status bar. When the file is saved, it will be converted back from Unicode to UTF-8 and saved in this format.
Handling internally as Unicode allows the Unicode character messages from the OS to be used for character input. Additionally, the WM_UNICHAR message used in some versions of Windows and with some applications is supported by UltraEdit allowing Unicode characters to be entered on versions of the OS that do not support Unicode natively (Windows 95/98/Me).
The following conversions to and from Unicode/UTF-8 are available:
Unicode file support within UltraEdit is provided on Windows 95, 98, Me, NT and 2000.
As not all operating systems natively support Unicode certain functions convert from Multi-Byte to Unicode or from Unicode to Multi-Byte. If the active code page (see Windows documentation/help) does not support the characters then the conversion will not convert correctly.
The functions that convert between Multi-Byte and Unicode include Find/Replace and Column Insert. | ||||||||||||||
Attachments
There are no attachments for this article.
|
||||||||||||||
Related Articles
Debugging PHP Scripts
Viewed 1806 times since Thu, Oct 13, 2011
Insert/Overstrike Modes
Viewed 1621 times since Thu, Oct 13, 2011
Code Folding
Viewed 1355 times since Thu, Oct 13, 2011
Large File Handling
Viewed 1946 times since Thu, Oct 20, 2011
Scripting commands
Viewed 8976 times since Thu, Oct 13, 2011
Auto Recovery command
Viewed 1319 times since Thu, Oct 20, 2011
Using Help
Viewed 1458 times since Thu, Oct 13, 2011
Blocking Text / Column Mode
Viewed 1452 times since Thu, Oct 13, 2011
Cursor Arrows (Cursor Movement)
Viewed 1492 times since Thu, Oct 13, 2011
Selecting Text
Viewed 1366 times since Thu, Oct 20, 2011
|
|
|

English