Minimize
Blogger List

Johannes Ahrends
Toad and Oracle

Ben Boise
Toad SC Discussions

Kevin Dalton
Benchmark Factory

Steven Feuerstein
PL/SQL Obsession

Devin Gallagher
Toad SC discussions

Stuart Hodgins
JProbe Discussions

  Henrik "Mauritz" Johnson
Toad Tips & Tricks on the "other" Toads
  Mark Kurtz
Toad SC discussions
  Michael Lumbard
Toad SC discussions
Daniel Norwood
Toad for Data Analysts,
Toad Extension for Visual Studio
Debbie Peabody
Toad for Data Analysts
Gary Piper
Toad Reports Manager
John Pocknell
Toad for Oracle, JProbe
Kuljit Sangha
Toad SC discussions
Bert Scalzo Indicates Oracle ACE status
Toad for Oracle, Data Modeling, Benchmarking
Jeff Smith
Toad product family
Richard To
SQL Optimization
Jim Wankowski
DB2 - LUW and z/OS
John Weathington
  Toad Data Modeler Opens in a new window
Data Modeling
 
  Real Automated Code Testing for Oracle
Quest Code Tester blog
 
Minimize
Blog Tags
toad for oracle (122)
oracle (62)
plsql (46)
sql optimization (37)
toad for data analysts (28)
code tester (19)
toad for ibm db2 (13)
automation (11)
batch optimizer (10)
virtualization (10)
schema browser (9)
toad for sql server (9)
data grid (8)
sql (8)
sql editor (8)
toad data modeler (8)
benchmark factory (7)
excel (7)
query builder (7)
report manager (7)
toad extension (7)
visual studio (7)
11g (6)
configuration (6)
freeware (6)
health check (6)
vmware (6)
connect (5)
dba module (5)
er diagrammer (5)
F4 (5)
linux (5)
refactoring (5)
spotlight (5)
unicode (5)
compare (4)
debugger (4)
export (4)
formatter (4)
make code (4)
rman (4)
strip code (4)
benchmark (3)
bfscript (3)
bulk collect (3)
code templates (3)
code xpert (3)
database browser (3)
db2 (3)
notebook (3)
oem (3)
RAC (3)
session browser (3)
speed (3)
sql optimizer (3)
toad for mysql (3)
tpc-c (3)
9.7 (2)
alert log (2)
app designer (2)
awr (2)
code insight (2)
code snippets (2)
collection (2)
compare and sync (2)
compliance (2)
data generator (2)
data warehouse (2)
database explorer (2)
database monitor (2)
explain (2)
forall (2)
ftp (2)
group execute (2)
handbook (2)
installation (2)
job scheduler (2)
multi-task (2)
nested table (2)
os command (2)
profiler (2)
recovery (2)
release history (2)
save as (2)
schema compare (2)
sql recall (2)
stats pack (2)
subversion (2)
team coding (2)
trace file browser (2)
while loop (2)
10g (1)
64 bit (1)
7zip (1)
action (1)
addm (1)
alter (1)
ansi join (1)
array (1)
ccleaner (1)
code coverage (1)
code road map (1)
CRON (1)
cursor for loop (1)
data browser (1)
data subset (1)
database probe (1)
dbms_flashback (1)
dbms_profiler (1)
ddl (1)
feuerstein (1)
filezilla (1)
flash drive (1)
flow control (1)
for loop (1)
group policy manager (1)
hints (1)
import (1)
index (1)
inheritance (1)
invoker rights (1)
ipad (1)
java (1)
latency (1)
log switch (1)
logical model (1)
ltrim (1)
master-detail browser (1)
monitor (1)
multi-select (1)
naming standards (1)
network (1)
object explorer (1)
OEBS (1)
package (1)
parser (1)
partitioning (1)
performance (1)
pragma (1)
project manager (1)
RAT (1)
revo (1)
REXEC (1)
schema report (1)
script manager (1)
search (1)
set operator (1)
sga (1)
slow (1)
sonarsource (1)
source control (1)
space projection (1)
sql monitor (1)
sql navigator (1)
sql script (1)
sql tracker (1)
sql*plus (1)
standards (1)
statistics (1)
stored procedure (1)
string parser (1)
sub-model (1)
sub-type (1)
synch (1)
synchback (1)
TELNET (1)
toad (1)
trace (1)
unit test (1)
unix (1)
usb (1)
utility (1)
v10 (1)
v9.5 (1)
version control (1)
waits (1)
workload replay (1)
workspace (1)
xml (1)
 
WELCOME, GUEST
 
 

Blogs
Toad and Database Commentaries

Toad World blogs are a mix of insightful how-tos from Quest experts as well as their commentary on experiences with new database technologies.  Have some views of your own to share?  Post your comments!  Note:  Comments are restricted to registered Toad World users.

Do you have a topic that you'd like discussed?  We'd love to hear from you.  Send us your idea for a blog topic.


Jun 8

Written by: John Pocknell
Monday, June 08, 2009  RssIcon

Part 1 of this 2-part blog asked “what is the problem Unicode seeks to address?” as well as “what is Unicode” and “how is it implemented by Oracle into their databases”
 
In this blog, I hope to be able to help you understand how Unicode is handled on your Windows PC/notebook and how to configure your PC/notebook in order to be able to effectively use Unicode in Toad.
 
What settings do I need to change on my PC?
 
Make sure the NLS_LANG value on your Oracle client (registry or environment variable) matches your Windows client character set.  (This has nothing to do with the database you are connecting to) 
 
Oracle searches for an NLS_LANG value in this order:
  1. User environment variable
  2. System environment variable
  3. Registry entry for the active Home (Note: Each Oracle Home has its own NLS_LANG value – see Toad Advisor below)
If no NLS_LANG value is found, the value AMERICAN_AMERICA.US7ASCII is used, not the Oracle database character set as is often believed. (Oracle states explicitly that this is a myth). The registry location is HKLM\SOFTWARE\ORACLE\ “NLS_LANG” key. The registry should not be supplanted by environment variables, but rather environment variables should be used judiciously to override the registry value only when needed.
 
To discover your active Windows character set, look in Regional Settings > Advanced > “Language for non-Unicode programs”, or in the Registry at HKLM\SYSTEM\CurrentControlSet\Control\Nls\CodePage “ACP” value.

There is a new warning in Toad Advisor telling you if NLS_LANG does not properly match your character set.  (a connection is required only so an active Home is set, which is used to pull the NLS_LANG for that Home)

NLS_LANG on your client computer is how Oracle knows what your client character set is, so it knows if it needs to transform characters going back & forth to your client (carried out using SQL*Net).  This seems to be one of the most misunderstood things out there related to getting Unicode properly working in Oracle.  It has nothing to do with NLS_CHARACTERSET, which is the database character set.  If NLS_LANG matches the database character set, the Oracle client will not do any transformation.  But that’s not our concern anyway.
 
How does Oracle manage multiple clients accessing multiple databases? 

The database has a value indicating its character set and the client has a value indicating the active character set of Windows. Before a client application sends data to the database, if these two values are different, Oracle on the client transforms the characters being sent to the database so they may be encoded properly. This process usually occurs on the client for performance:

 Here is a more detailed look at what occurs. When a client application starts and creates a session, the database character set is read and the client character set is determined through NLS_LANG. If these values differ, when data is transferred to and from the database it gets transformed into the proper bytes for the destination:

It should be obvious that NLS_LANG is concerned with the client, not the database. You cannot query the database to discover the value of NLS_LANG, you cannot set NLS_LANG on the database. NLS_LANG is all about the client character set. It is used by the Oracle client software before sending character data to the database so the database receives proper values.
 
What happens if I want to connect to different databases which have different character sets?

Here’s an example:

 

In this case you would create two Oracle Homes, one to access the Asian character set database and one to access the American (Latin) character set database. Alternatively, before running an application such as SQL*Plus, a local NLS_LANG environment variable can be assigned. An easy way to manage this would be to create two different batch files to launch two different types of SQL*Plus sessions, one for each database:
 
AsianSQLPlusSession.bat
set NLS_LANG=AMERICAN_AMERICA.ZHT16MSWIN950
sqlplusw.exe
 
AmericanSQLPlusSession.bat
set NLS_LANG=AMERICAN_AMERICA.WE8MSWIN1252
sqlplusw.exe
 
What else do I need to do? 

Toad 10 will be a fully Unicode application.  However, the various tools Toad calls – Oracle Utilities, SQL Plus and so forth – are not.  This means they require NLS_LANG to be properly set, and in most cases, requires the data they work with to be available in the client character set.  For example, if you have NLS_LANG set to a Big5/Double Byte character set with Chinese, (which means your Windows “Language for non-Unicode Programs” – XP’s silly name for your character set - is set to Chinese) you can “Select Chinese Stuff from dual” and run that in SQL Plus.  You cannot do “Select Turkish stuff from dual” because the Turkish characters won’t be available in your client character set.  SQL Plus will complain.  This *will* work in Toad, however, because the OCI has a Unicode switch flipped on via Toad, and so all data is Unicode-encoded on the client, regardless of the various encodings of the database.

For the purposes of saving data from Toad to an external file, the Data Grid “Export Dataset” (replacing the existing “Save As”) dialog has a new drop down – ‘Encoding’ – where you can chose an encoding to use.  If you use the typical choice – ANSI – no header/BOM (Byte Order Mark) (see Part 1) will be used and things operate as they always have.

In the case where another encoding such as UTF-8 is selected, these files will potentially contain Unicode data so this drop down ensures they can be properly saved if your windows character set is different than the character data in the file.  Therefore, external files will contain the UTF-8 BOM of EF BB BF (hex).  These are non-visible characters which Unicode applications require in order to know what encoding the file is in.  If you are using Toad-created files in other applications and they have a hiccup somehow, it is possible they don’t know what to do with that BOM.  I can’t think of any cases perhaps beyond Editor files where this could potentially be an issue for your other applications. 

What does it mean if I see gobbledygook displayed?

If you see squares somewhere in a window, you are not using a Unicode-friendly font.  Try a Unicode font.  In general, squares mean the data is fine, the font just can’t show it.  Question marks generally mean the data has been corrupted, for example, one byte of a double byte character was lost, so the character is no longer known.1
 
How do I try out Unicode support in Toad?
 
For those of you who are keen to see Unicode support working in Toad right now – ahead of the official Toad 10.0 release, you can download a beta version of Toad version 9.8 from www.toadsoft.com/beta.html . You need to have a commercial license of Toad for Oracle and have it installed on the desktop on which you wish to install the Toad 9.8 beta.
 
To give you an idea of what viewing Unicode data will look like in Toad, have a look at this recent blog (containing lots of screenshots) by Jeff Smith, our Toad technical expert: http://www.toadworld.com/Community/Blogs/tabid/67/EntryID/371/Default.aspx#ViewComment_381
 
Please note: Toad 9.8 is a Quest Software internal release only focusing on testing the conversion to Delphi 2009, the upgrade of all 3rd party components and Unicode testing; it will (mostly) exclude any new functionality.
 
We welcome your feedback on this beta. Please subscribe to our Toad for Oracle Beta community group, by clicking this link: http://tech.groups.yahoo.com/group/toadbeta/
 
Thank you again to Mark Lerch from the Toad Development Team for his kind permission to reproduce articles he has written on the subject.
 


1Mark Lerch – Toad development

Search Blog Entries