CINXE.COM
When to Convert to Unicode
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"> <html> <head> <title>When to Convert to Unicode</title> <meta http-equiv="Content-Type" content="text/html; charset=UTF-8"> <meta name="keywords" content="unicode, convert, keyboard, mapping, input, encode, legacy, custom"> <link rel="stylesheet" href="/cms/assets/misc/css/default.css" type="text/css"> <link rel="stylesheet" href="/cms/sites/nrsi/themes/default/_css/default.css" type="text/css"> <style type="text/css"> <!-- A.GlobalNavLink, A.GlobalNavLink:visited { color: #FFFF00; font-size: smaller; font-weight: bold; } --> </style> <!-- 2023-05-25 PKM Added for Google Analytics 4 --> <!-- Google tag (gtag.js) --> <script async src="https://www.googletagmanager.com/gtag/js?id=G-FVXRGR2Q9V"></script> <script> window.dataLayer = window.dataLayer || []; function gtag(){dataLayer.push(arguments);} gtag('js', new Date()); gtag('config', 'G-FVXRGR2Q9V'); </script> <title>When to Convert to Unicode</title> </head> <body style="padding:0; margin:0"> <style> .archive_notice { /* box-shadow: black 0pt 4pt 20px -8px inset; */ display: block; background-color: orange; font-size: 12pt; font-style: normal; font-weight: lighter; line-height: 100%; padding: 5pt; text-align: center; width: auto; } form { display: none } .webform::before { content: "Forms are disabled on this static version of the site."; display: block; width: fit-content; } </style> <div class="archive_notice"> This is an archive of the original scripts.sil.org site, preserved as a historical reference. Some of the content is outdated. Please consult our other sites for more current information: <a href="https://software.sil.org">software.sil.org</a>, <a href="https://scriptsource.org">ScriptSource</a>, <a href="https://silnrsi.github.io/FDBP/">FDBP</a>, and <a href="https://silnrsi.github.io/silfontdev/">silfontdev</a> </div> <table width="100%" height="100%" border="0" cellspacing="0" cellpadding="0"> <tr> <td style="background: #0068a6; padding-left:20; padding-top:10; white-space:nowrap;" width="110" valign="top"> <p><a href="http://www.sil.org/"> <!-- <img src="/cms/sites/nrsi/themes/default/_media/SIL_logo_left_column.gif" width="86" height="80" border="0"> --> <img src="/cms/sites/nrsi/themes/default/_media/SIL_Logo_TM_Blue_2014.png" width="85" height="95" border="0" alt=""> </a><br><br></p> <p class="Cat1"><a class="Cat1" href="/cms/scripts/page.php%3Fid%3Dhome%26site_id%3Dnrsi.html">Home</a></p> <p class="Cat1"><a class="Cat1" href="/cms/scripts/page.php%3Fid%3Dcontactus%26site_id%3Dnrsi.html">Contact Us</a></p> <p class="Cat1"><a class="Cat1" href="/cms/scripts/page.php%3Fid%3Dgeneral%26site_id%3Dnrsi.html">General</a></p> <p class="Cat2"><a class="Cat2" href="/cms/scripts/page.php%3Fid%3Dbabel%26site_id%3Dnrsi.html">Initiative B@bel</a></p> <p class="Cat2"><a class="Cat2" href="/cms/scripts/page.php%3Fid%3Dwsi_guidelines%26site_id%3Dnrsi.html">WSI Guidelines</a></p> <p class="Cat1"><a class="Cat1" href="/cms/scripts/page.php%3Fid%3Dencoding%26site_id%3Dnrsi.html">Encoding</a></p> <p class="Cat2"><a class="Cat2" href="/cms/scripts/page.php%3Fid%3Dencodingprinciples%26site_id%3Dnrsi.html">Principles</a></p> <p class="Cat2"><a class="Cat2" href="/cms/scripts/page.php%3Fid%3Dunicode%26site_id%3Dnrsi.html">Unicode</a></p> <p class="Cat3"><a class="Cat3" href="/cms/scripts/page.php%3Fid%3Dunicodetraining%26site_id%3Dnrsi.html">Training</a></p> <p class="Cat3"><a class="Cat3" href="/cms/scripts/page.php%3Fid%3Dunicodetutorials%26site_id%3Dnrsi.html">Tutorials</a></p> <p class="Cat3"><a class="Cat3" href="/cms/scripts/page.php%3Fid%3Dunicodepua%26site_id%3Dnrsi.html">PUA</a></p> <p class="Cat2"><a class="Cat2" href="/cms/scripts/page.php%3Fid%3Dconversion%26site_id%3Dnrsi.html">Conversion</a></p> <p class="Cat3"><a class="Cat3" href="/cms/scripts/page.php%3Fid%3Dencconvres%26site_id%3Dnrsi.html">Resources</a></p> <p class="Cat3"><a class="Cat3" href="/cms/scripts/page.php%3Fid%3Dconversionutilities%26site_id%3Dnrsi.html">Utilities</a></p> <p class="Cat4"><a class="Cat4" href="/cms/scripts/page.php%3Fid%3Dteckit%26site_id%3Dnrsi.html">TECkit</a></p> <p class="Cat3"><a class="Cat3" href="/cms/scripts/page.php%3Fid%3Dconversionmaps%26site_id%3Dnrsi.html">Maps</a></p> <p class="Cat2"><a class="Cat2" href="/cms/scripts/page.php%3Fid%3Dencodingresources%26site_id%3Dnrsi.html">Resources</a></p> <p class="Cat1"><a class="Cat1" href="/cms/scripts/page.php%3Fid%3Dinput%26site_id%3Dnrsi.html">Input</a></p> <p class="Cat2"><a class="Cat2" href="/cms/scripts/page.php%3Fid%3Dinputprinciples%26site_id%3Dnrsi.html">Principles</a></p> <p class="Cat2"><a class="Cat2" href="/cms/scripts/page.php%3Fid%3Dinpututilities%26site_id%3Dnrsi.html">Utilities</a></p> <p class="Cat2"><a class="Cat2" href="/cms/scripts/page.php%3Fid%3Dinputtutorials%26site_id%3Dnrsi.html">Tutorials</a></p> <p class="Cat2"><a class="Cat2" href="/cms/scripts/page.php%3Fid%3Dinputresources%26site_id%3Dnrsi.html">Resources</a></p> <p class="Cat1"><a class="Cat1" href="/cms/scripts/page.php%3Fid%3Dtypedesign%26site_id%3Dnrsi.html">Type Design</a></p> <p class="Cat2"><a class="Cat2" href="/cms/scripts/page.php%3Fid%3Dtypedesignprinciples%26site_id%3Dnrsi.html">Principles</a></p> <p class="Cat2"><a class="Cat2" href="/cms/scripts/page.php%3Fid%3Dfontdesigntools%26site_id%3Dnrsi.html">Design Tools</a></p> <p class="Cat2"><a class="Cat2" href="/cms/scripts/page.php%3Fid%3Dfontformats%26site_id%3Dnrsi.html">Formats</a></p> <p class="Cat2"><a class="Cat2" href="/cms/scripts/page.php%3Fid%3Dtypedesignresources%26site_id%3Dnrsi.html">Resources</a></p> <p class="Cat3"><a class="Cat3" href="/cms/scripts/page.php%3Fid%3Dfontdownloads%26site_id%3Dnrsi.html">Font Downloads</a></p> <p class="Cat3"><a class="Cat3" href="/cms/scripts/page.php%3Fid%3Dfontdownloadsgentium%26site_id%3Dnrsi.html">Gentium</a></p> <p class="Cat3"><a class="Cat3" href="/cms/scripts/page.php%3Fid%3Dfontdownloadsdoulos%26site_id%3Dnrsi.html">Doulos</a></p> <p class="Cat3"><a class="Cat3" href="/cms/scripts/page.php%3Fid%3Dfontdownloadsipa%26site_id%3Dnrsi.html">IPA</a></p> <p class="Cat1"><a class="Cat1" href="/cms/scripts/page.php%3Fid%3Drendering%26site_id%3Dnrsi.html">Rendering</a></p> <p class="Cat2"><a class="Cat2" href="/cms/scripts/page.php%3Fid%3Drenderingprinciples%26site_id%3Dnrsi.html">Principles</a></p> <p class="Cat2"><a class="Cat2" href="/cms/scripts/page.php%3Fid%3Drenderingtechnologies%26site_id%3Dnrsi.html">Technologies</a></p> <p class="Cat3"><a class="Cat3" href="/cms/scripts/page.php%3Fid%3Drenderingopentype%26site_id%3Dnrsi.html">OpenType</a></p> <p class="Cat3"><a class="Cat3" href="/cms/scripts/page.php%3Fid%3Drenderinggraphite%26site_id%3Dnrsi.html">Graphite</a></p> <p class="Cat2"><a class="Cat2" href="/cms/scripts/page.php%3Fid%3Drenderingresources%26site_id%3Dnrsi.html">Resources</a></p> <p class="Cat3"><a class="Cat3" href="/cms/scripts/page.php%3Fid%3Dfontfaq%26site_id%3Dnrsi.html">Font FAQ</a></p> <p class="Cat1"><a class="Cat1" href="/cms/scripts/page.php%3Fid%3Dlinks%26site_id%3Dnrsi.html">Links</a></p> <p class="Cat1"><a class="Cat1" href="/cms/scripts/page.php%3Fid%3Dglossary%26site_id%3Dnrsi.html">Glossary</a></p> <br> </td> <td valign="top" style="padding:0" xwidth="650"> <div style="background: #6699CC url(/cms/sites/nrsi/themes/default/_media/home_banner_gradient.gif) no-repeat right; padding:0 0 0 25; height:36px; margin:0; color:#FFFFFF;"> <p style="font-family:Times New Roman; font-size:25px; color:#FFFFFF; padding:10 0 0 0; margin:0 0 0 0">Computers & Writing Systems</p> </div> <div style="padding:0 0 0 0; background-color:#000000; color:#FFFFFF"> <table width='100%'> <tr> <td style="padding: 0 0 0 25px"><a class="GlobalNavLink" href="http://www.sil.org/">SIL HOME</a> | <a class="GlobalNavLink" href="https://software.sil.org/products/">SIL SOFTWARE</a> | <a class="GlobalNavLink" href="/support.html">SUPPORT</a> | <a class="GlobalNavLink" href="https://www.givedirect.org/donate/?cid=13536">DONATE</a> | <a class="GlobalNavLink" href="/privacy-policy.html">PRIVACY POLICY</a> </td> <td align='right' width='20%'> <script async src="https://cse.google.com/cse.js?cx=0760bf09a6bff4b0c"></script><style>.gsc-control-cse {padding: 0.6em; min-width: 10em; width: 18em; max-width: 20em} form.gsc-search-box {display: unset;}</style><div class="gcse-search"></div> </td> </tr> </table> </div> <div style="padding:0 25 25 25"> <p class='CategoryPath'>You are here: <a class='CategoryPath' href='/cms/scripts/page.php%3Fid%3Dencoding%26site_id%3Dnrsi.html'>Encoding</a> > <a class='CategoryPath' href='/cms/scripts/page.php%3Fid%3Dconversion%26site_id%3Dnrsi.html'>Conversion</a><br> Short URL: <a href='/utconvert2unicode.html'>https://scripts.sil.org/UTConvert2Unicode</a></p> <!-- --> <!-- <div class='Warning' > <p class='Warning_heading' > Site unavailability </p> <p> Due to essential repairs, this website may be unavailable at times during September 6 (Tue) and 7 (Wed). We apologize for the inconvenience. </p> </div> --> <p><span class='item_series'>When to Convert to Unicode</span></p><h1>When to Convert to Unicode </h1> <p> <span class='author_date_hits'>Albert Bickford, Jim Brase and Lorna Priest, 2007-05-11</span></p><div class='Sidebar'><p><a href='/cms/scripts/page.php%3Fid%3Dutconvert2unicode%26site_id%3Dnrsi.html'>When to Convert to Unicode</a></p> <p><a href='/cms/scripts/page.php%3Fid%3Dutconvertq1%26site_id%3Dnrsi.html'>What is Unicode? and Why do I need to use Unicode?</a></p> <p><a href='/cms/scripts/page.php%3Fid%3Dutconvertq2%26site_id%3Dnrsi.html'>Is Unicode ready for you?</a></p> <p><a href='/cms/scripts/page.php%3Fid%3Dutconvertq3%26site_id%3Dnrsi.html'>Are there fonts available that will work for you?</a></p> <p><a href='/cms/scripts/page.php%3Fid%3Dutconvertq4%26site_id%3Dnrsi.html'>Can you type all the characters you need?</a></p> <p><a href='/cms/scripts/page.php%3Fid%3Dutconvertq5%26site_id%3Dnrsi.html'>Does available Unicode software meet your needs?</a></p> <p><a href='/cms/scripts/page.php%3Fid%3Dutconvertq6%26site_id%3Dnrsi.html'>Is anyone requiring you to use Unicode?</a></p> <p><a href='/cms/scripts/page.php%3Fid%3Dutconvertq7%26site_id%3Dnrsi.html'>Is software going to force Unicode on you?</a></p> <p><a href='/cms/scripts/page.php%3Fid%3Dutconvertq8%26site_id%3Dnrsi.html'>Is it time to archive your data?</a></p> <p><a href='/cms/scripts/page.php%3Fid%3Dutconvertq9%26site_id%3Dnrsi.html'>Is the technical expertise to do the conversion available to you?</a></p> <p><a href='/cms/scripts/page.php%3Fid%3Dutconvertq10%26site_id%3Dnrsi.html'>Are you ready to learn about how to use Unicode?</a></p> <p><a href='/cms/scripts/page.php%3Fid%3Dutconvertq11%26site_id%3Dnrsi.html'>Do your colleagues use Unicode?</a></p> <p><a href='/cms/scripts/page.php%3Fid%3Dutconvertq12%26site_id%3Dnrsi.html'>Are you willing and able to “straddle the fence”?</a></p> </div><p></p> <p>Linguists and others who work with minority language data often still use older special character systems (fonts and keyboards) that work only for certain languages. These older (‘legacy’) systems are gradually being replaced by Unicode (<a href='/cms/scripts/page.php%3Fid%3Dutconvertq1%26site_id%3Dnrsi.html'>What is Unicode?</a>), which attempts to cover all languages in one unified system. </p> <p>So, you are probably wondering: When is the right time to convert to Unicode? That’s what this article is about. The question actually has two parts:</p> <ul class='dListUnordered'> <li>When is it possible to convert?</li> <li>When is it necessary to convert?</li> </ul> <p>There is usually a period of time—a window of opportunity—between the time when it becomes possible and when it becomes necessary. Ideally, you should plan ahead so you can convert your data during that window. You want to find a time when it is convenient to do so, before a conversion is forced on you at perhaps a very inconvenient time. Hopefully your window of opportunity will be large: it will be reasonable for you to convert your data to Unicode at least a year before it becomes necessary. </p> <p>This article will help you recognize whether you and your language data have entered that window of opportunity. It will also help you predict whether you are approaching the end of the window: the point where a conversion is necessary. It will help you decide whether <span class='Em'>now</span> is the time to convert. </p> <p>Likewise, if you are just starting to work with a language, you may need to decide which special character system to use. From now on, most people should plan to start with Unicode, but there may be a few situations where older fonts and special character systems are necessary.</p> <p>This article is written mostly for a general audience, although some of the links get more detailed and technical. At some point, you may need the advice of someone with special technical knowledge to decide when to convert, and most people need help to actually do the conversion and get set-up to use Unicode. This article will also help you decide when you need to get that help, and provide guidance to the technicians who are advising you. Because it is written for two audiences—ordinary users and technicians—the words “you” and “your” sometimes refer to the users/owners of the data and sometimes to the technicians assisting them. You decide whether they apply to you personally.</p> <p>We start by listing the general questions you need to consider. You will need to click on the links to get to the sections that explore these questions in more detail.</p> <a name='basicQ'></a> <a name='f41af467'></a> <h2>Basic questions to consider</h2> <p>In order to answer the two basic questions above and define your window of opportunity, you need to ask yourself several more specific questions. These are listed in the following chart, with links to later sections that discuss them in more detail. </p> <div class='dDataViewTable'> <div class='bd'> <table class='Table_Default'> <tr><th class='Table_Header_Row' align='left' >When are you ready for Unicode?</th><th class='Table_Header_Row' align='left' >When is Unicode going to be forced on you?</th></tr> <tr><td class='Table_Odd_Row' align='left' >If you can answer “yes” to <span class='Em'>all</span> of the questions in this column, then you have entered the window of opportunity—you are ready to convert your data.</td> <td class='Table_Odd_Row' align='left' >If you answer “yes” to <span class='Em'>any</span> of the questions in this column, you have reached the end of the window of opportunity—Unicode has become a necessity for you.</td> </tr> <tr><td class='Table_Even_Row' align='left' ><ul class='dListUnordered'> <li>Is Unicode ready for you and your data? Does it contain all the special characters you need? (see section <a href='/cms/scripts/page.php%3Fid%3Dutconvertq2%26site_id%3Dnrsi.html'>Is Unicode ready for you?</a>)</li> <li>Are there fonts that contain the characters you need, and are suitable in all other respects? (see section <a href='/cms/scripts/page.php%3Fid%3Dutconvertq3%26site_id%3Dnrsi.html'>Are there fonts available that will work for you?</a>)</li> <li>Are there input methods (e.g. keyboards) that make it easy to type the characters you need? (see section <a href='/cms/scripts/page.php%3Fid%3Dutconvertq4%26site_id%3Dnrsi.html'>Can you type all the characters you need?</a>)</li> <li>Is the technical expertise necessary to do the conversion available to you? (see section <a href='/cms/scripts/page.php%3Fid%3Dutconvertq9%26site_id%3Dnrsi.html'>Is the technical expertise to do the conversion available to you?</a>)</li> <li>Are you and others who work with your data ready to learn about how to use Unicode? (see section <a href='/cms/scripts/page.php%3Fid%3Dutconvertq10%26site_id%3Dnrsi.html'>Are you ready to learn about how to use Unicode?</a>)</li> </ul> </td> <td class='Table_Even_Row' align='left' ><ul class='dListUnordered'> <li>Are you working under the authority of an organization that requires you to use Unicode? (see section <a href='/cms/scripts/page.php%3Fid%3Dutconvertq6%26site_id%3Dnrsi.html'>Is anyone requiring you to use Unicode?</a>)</li> </ul> </td> </tr> <tr><td class='Table_Odd_Row' align='left' ><ul class='dListUnordered'> <li>Consider the different things that you want to do with your linguistic data; will your operating system and your application software allow you to do those tasks if you use Unicode? Or, to put it another way, does available software that handles Unicode do everything that you need it to? (see section <a href='/cms/scripts/page.php%3Fid%3Dutconvertq5%26site_id%3Dnrsi.html'>Does available Unicode software meet your needs?</a>)</li> </ul> </td> <td class='Table_Odd_Row' align='left' ><ul class='dListUnordered'> <li>Consider all the software that you need to use: Does any of it require Unicode? (see section <a href='/cms/scripts/page.php%3Fid%3Dutconvertq7%26site_id%3Dnrsi.html'>Is software going to force Unicode on you?</a>)</li> </ul> </td> </tr> <tr><td class='Table_Even_Row' align='left' ><ul class='dListUnordered'> <li>Do you have <span class='Em'>no</span> need to exchange data with other people who are using your ‘legacy’ character set? (see section <a href='/cms/scripts/page.php%3Fid%3Dutconvertq11%26site_id%3Dnrsi.html'>Do your colleagues use Unicode?</a>)</li> </ul> </td> <td class='Table_Even_Row' align='left' ><ul class='dListUnordered'> <li>Do you need to exchange data with other people who use Unicode? (see section <a href='/cms/scripts/page.php%3Fid%3Dutconvertq11%26site_id%3Dnrsi.html'>Do your colleagues use Unicode?</a>)</li> <li>In particular, do you need to archive your data as Unicode for long-term preservation? (see section <a href='/cms/scripts/page.php%3Fid%3Dutconvertq8%26site_id%3Dnrsi.html'>Is it time to archive your data?</a>)</li> </ul> </td> </tr> <tr><td class='Table_Odd_Row' align='left' colspan='2'><ul class='dListUnordered'> <li>Are you willing and able to “straddle the fence”—working with older systems and Unicode at the same time? (see section <a href='/cms/scripts/page.php%3Fid%3Dutconvertq12%26site_id%3Dnrsi.html'>Are you willing and able to “straddle the fence”?</a>)</li> </ul> </td> </tr> </table> </div> </div> <p class='Caption'>Basic questions to consider</p> <p>This table may provide enough detail for some people to decide whether they are ready to convert to Unicode. If so, you can stop here and get on with converting to Unicode. But, if you need more details, read the linked pages.</p> <a name='801ab246'></a> <h2>Download</h2> <p>You may download this (including linked pages) as one whole document.</p> <table border='0' cellpadding='8' cellspacing='0'><tr><td align='left' valign='top'><img src='/cms/assets/icons/type-pdf.gif' border='0' width='32' height='32' style='margin:1px 5px 0 0'></td><td align='left' valign='top'><small><b>When to Convert to Unicode - basic questions</b><br>Albert Bickford, Jim Brase, Lorna Priest, 2007-05-11<br><a href='/cms/sites/nrsi/download/utc2updf/WhentoConverttoUnicode.pdf'>Download "WhentoConverttoUnicode.pdf"</a>, Acrobat PDF document, 285KB [5124 downloads] </small></td></tr></table><p>With the exception of the Table of Contents, most links in the document will take you to the website, rather than to a point in the document.</p> <hr> <p><small>© 2003-2024 <a href='http://www.sil.org/' target='_blank'>SIL International</a>, all rights reserved, unless otherwise noted elsewhere on this page.<br> Provided by SIL's Writing Systems Technology team (formerly known as NRSI). Read our <a href="/privacy-policy.html">Privacy Policy</a>. <a href='/support.html'>Contact us here.</a></small></p> </div> </td> </table> </body> </html>