Tuesday, February 28, 2012

Google Transliteration IME is available for 22 different languages

Google Transliteration IME is available for 22 different languages

Google Transliteration IME is an input method editor which allows users to enter text in one of the supported languages using a roman keyboard. Users can type a word the way it sounds using Latin characters and Google Transliteration IME will convert the word to its native script. Note that this is not the same as translation -- it is the sound of the words that is converted from one alphabet to the other, not their meaning. Converted content will always be in Unicode. For example, typing "hamesha" transliterates into Hindi as: Hindi transliteration example and typing "salaam" transliterates into Persian as: Farsi transliteration example

Google Transliteration IME is available for 22 different languages - Amharic, Arabic, Bengali, Farsi (Persian), Greek, Gujarati, Hebrew, Hindi, Kannada, Malayalam, Marathi, Nepali, Oriya, Punjabi, Russian, Sanskrit, Serbian, Sinhalese, Tamil, Telugu, Tigrinya and Urdu.

Sample status window and the edit window for Hindi IME are shown below. For simplicity, Hindi IME is used as an example at most places in this help page.
Hindi IME

Install

To install, download (If you are on a 64-bit Windows, make sure you have selected the 64-bit radio button in the download page) and run the installer and follow the instructions on screen.

  • More than one language IME can be installed on the same client machine.
  • Requirements are Windows 7/Vista/XP 32-bit/64-bit operating systems.
  • Only one instance of IME will be installed per machine but the configuration is done for each user using IME.

    Install Window

    Install Window

    Install Window

Uninstall

To uninstall, follow these steps:
  1. Click on the "Start" menu.
  2. Select "Control Panel" and then go to "Add or Remove Programs".
  3. In the program list, select the "Google Input", click on "Change/Remove" button.
  4. By default, "Retain my personal data with settings" option will be checked, which will not clean your personal customizations like display font, user correction cache etc., as part of uninstall. Uncheck the option, if you want to clean all your personal settings information.
  5. In the "Uninstall Google Input" dialog box, click "Yes".

    Uninstall Window

If you want to use IME with any application like notepad, you must first open the application and then open our IME. IME can be opened by clicking on the language bar on the desktop and then selecting the IME language icon or through a shortcut key if it's already configured. To close IME, you can change the language in language bar or try right clicking on the current application to see whether it supports 'Close IME' popup menu option or by closing the current application.

Displaying Language Bar

  • In desktop, right click on tool bar and select Toolbars -> Language bar

    Language Bar

Enabling Language Bar

If the 'Language bar' option is not visible in 'Toolbars', then it needs to be enabled through control panel:

Windows 7/Vista
  1. Control Panel -> Regional and Language Options -> Keyboard and Languages tab
  2. Click on Change keyboards... button to open Text services and input languages dialog
  3. Navigate to Language Bar tab
  4. Enable the radio button Docked in the taskbar under Language Bar section
  5. Apply all settings and try to display language bar as mentioned in previous section.
Windows XP
  1. Go to Control Panel -> Regional and Language Options -> Languages tab -> Text services and input languages (Details) -> Advanced Tab
  2. Make sure that under System configuration, option Turn off advanced text services is NOT checked.
  3. Go to Control Panel -> Regional and Language Options -> Languages tab -> Text services and input languages (Details) -> Settings Tab
  4. Click Language Bar
  5. Select Show the Language bar on the desktop. Click OK.
  6. If you are installing the IME for East Asian language or Right-To-Left language, go to Control Panel -> Regional and Language Options -> Languages Tab
  7. Make sure that options Install files for complex scripts and right to left languages and Install files for East Asian languages are checked in the checkboxes. This requires installation of system files and the system will prompt to insert the Operating System Disc.
  8. Apply all settings and try to display language bar as mentioned in previous section.

IME Shortcut

A shortcut key sequence can be applied to the IME as follows which can be used to quickly enable IME for any in-focus application:

Windows 7/Vista
  1. Control Panel -> Regional and Language Options -> Keyboard and Languages tab
  2. Click on Change keyboards... button to open Text services and input languages dialog
  3. Navigate to Advanced Key Settings tab
  4. If Google Input is not listed in Installed Services box, then click Add and in Add Input language dialog box, go to the language for which you want to enable IME in the languages tree and expand the node. Check the checkbox next to Google Input in the list.
  5. In Hot keys for input languages Select To - Google Input
  6. Press Change Key Sequence
  7. Select Enable Key Sequence
  8. Select option like Left ALT + SHIFT + Key 1
  9. Apply all changes
  10. Now opening an application like notepad and pressing Left ALT + SHIFT + Key 1 should open the IME.
Windows XP
  1. Control Panel -> Regional and Language Options -> Languages tab -> Text services and input languages (Details) -> Settings Tab
  2. If either or Google Input not listed in Installed Services box, then click Add and in Add Input language dialog box, select in Input language and Google Input in Keyboard layout/IME. Click OK
  3. Press Key Settings
  4. In Hot keys for input languages, Select Switch to -Google Input
  5. Press Change Key Sequence
  6. Select Enable Key Sequence
  7. Select option like Left ALT + SHIFT + Key 1
  8. Apply all changes
  9. Now opening an application like notepad and pressing Left ALT + SHIFT + Key 1 should open the IME.

Status Window

Whenever you enable our IME for an application either through a shortcut or through language bar, its status window will be visible on the screen. By default it will be placed at the bottom right corner of your desktop which can also be moved around. It is used for more configuration and controlling various features of our IME. It has three to five icons (depending on the language and features available) with leftmost being the application icon. Next to it are IME language indicator button, keyboard button (not present for Amharic and Tigrinya), canonical mode button (This button will be present if you have any canonical schemes associated with the IME) and menu button in that order.

Status Window

Edit Window

When the IME is enabled for an application like notepad and you start typing, IME displays the edit window with the typed text and the corresponding word choices in IME language. In the below example, user has typed 'googl' and the IME has displayed five choices numbered 1 to 5.

Edit Window

Navigation and Selection

By default the leftmost choice will be highlighted indicating active choice. Active choice can be changed by navigating other choices through BOTTOM-ARROW or TAB key - which moves the selection to the right; or through UP-ARROW or SHIFT+TAB keys - which moves the selection to left. To select one of the choices as the new word for your application, use ENTER key which inserts the active choice to the application at its current cursor position. Using SPACE or any other PUNCTUATION CHARACTER also inserts the active choice to the application along with the typed punctuation character. Exception will be when IME thinks that there is a better choice of word containing the punctuation character as part of the word itself. All the above controls only insert the active choice (highlighted) into the application. Another way to insert any choice even if it is not active is using its position number as: CTRL+.

Navigation

Word Completions

When you type a word, the choices displayed in edit window will be either in BLACK or BLUE color. Intuitive partial word choices for the typed content are displayed in BLACK color which will always be grouped on the left side. If there are any better dictionary-based word completions for the typed content, they are displayed in BLUE color and grouped on the right side.

Paging

In the example being discussed, IME displays only 5 choices in the edit window which is the page size setting of the edit window. It can be configured through the menu button of the status window. If there are more choices than the page size set, then it will be displayed in the next page. Page navigation can be done through the small arrow buttons at the right bottom corner of the edit window or through PageUP and PageDown keys. Even when you navigate the choices with arrow or tab keys, the control moves to the next or previous page from either end of the edit window.

Paging

Search

At anytime while typing in edit window, if you click the Google image on top right corner of the edit window, it triggers a web search for the active/highlighted choice in google.com. Search can also be triggered for non-active choice by right-clicking on any choice and then clicking the Search... drop-down button.

Search

User Cache

There will be scenarios where the leftmost choice is not the one you are looking for and you select some other choice (second choice in the below example for typed word 'program') which will be inserted into the application.
User Cache
IME remembers this selection for the current user and when the user next types 'program' again, the last selected choice will now be displayed as the leftmost choice. To share this information across all applications for a user, this is persisted at user level even after closing the application. But this persisting behavior can be disabled through the 'Disable User Cache' option from menu button of status window. Note that caching doesn't apply if you select word completions. It is only for word choices shown in BLACK color.

User Cache

Switch to English

If you want to add both roman alphabet (typically English) content and IME language content to any application, there is an easy way to switch between the two. In one mode IME gives you choices in its language and in another mode it just emits what you have typed. Toggling can be done by clicking the IME language button in status window or through shortcuts: CTRL+G or F12.

Toggle Language

Keyboard

When you don't get some words as expected choices for any rare complex words; you can use the flexible keyboard to input any possible word. You can open the keyboard by clicking the keyboard button on status window or through the shortcut CTRL+K. Mouse is used to enter characters from the IME keyboard which also triggers word completions in the edit window. Keyboard can be closed by clicking the keyboard button again on status window or through shortcut keys: CTRL+K or ESC. There is also option to enter Zero Width Joiner (ZWJ) and Zero Width Non Joiner (ZWNJ) characters.

Keyboard

Customization

You can customize many features through IME's menu. Options available through the menu popup in status window are:

  • Change or activate one of the available canonical schemes (Option available only if there is atleast one scheme in the Schemes directory).
  • Select the font and size to be used to display the choices in IME language inside edit window (Suggestion Font).
  • Select the font and size to be used to display the English/Roman characters inside edit window.
  • Set the page size for edit window to restrict its size through number of choices.
  • Enable or disable the persistence of user cache.
  • Add/Edit or Delete macros using Manage Macros option.
  • Information about the IME like version.
  • Help, linking to this page.
Customization

Special Cases

There are some special cases in how IME behaves for some characters in some languages:

  • Example in Arabic for special case of SPACE where it is part of the typed word.
    SPACE Special
  • Example in Arabic for special case of PUNCTUATION where it is part of the typed word.
    PUNCTUATION Special
  • Example in Greek for special case of CAPITALIZATION where only first character being in uppercase in input retains its casing.
    Camelcase Special
  • Example in Greek for special case of CAPITALIZATION where all uppercase word in input retains its casing.
    Camelcase Special
  • Example in Greek for special case of DIGITS where number is converted digit wise instead of its whole value.
    Digit Special

IME supports adding custom user defined macros. Macros are short character sequences mapped to a word of your choice. When you type a character sequence, if there is a word corresponding to that sequence in the user-defined macros collection then it will be shown as the first choice in the Edit Window. You can manage the macros by choosing the 'Manage Macros...' option from the IME's menu.

Macros Menu

When you select this option the 'Manage Macros' dialog opens, allowing you to add, modify and delete the set of macros.

Macros Dialog

Adding a Macro entry

Click on the 'Add' button. This will add an empty row to the Macros List. You can double click (or select the cell and press F2) on the cell to add the new Macro entry.
  • Macro Text should consist of only alpha-numeric characters and the length should not exceed 100 characters. Macro Text is case-sensitive.
  • Macro Target should not contain any spaces and the length should not exceed 50 characters.

Removing a Macro entry

Select the cells you want to delete (Use Shift+Click to select a range or Ctrl+Click to select multiple disconnnected cells) and click on the 'Delete' button. Note that you cannot delete single cells. If you delete a cell, the full row will be deleted.

Modifying and Saving Macro Entries
  • You can edit any cell by double clicking it (or selecting the cell and pressing F2 key).
  • You can cancel the edit anytime by pressing the 'Esc' key or clicking outside the cell. Press the 'Enter' key to complete the edit.
  • After all edits are completed you need to click on the 'Save' button to save your changes.
  • You can click on the 'Discard' button anytime to discard all the changes made so far (after the previous save action).

Once you have the Macros in place you can use them while typing. For example if 'Mon' is mapped to Macro Text in the Macros, then when you type 'Mon' you will see Macro Text as the first option.

Macros In Action

Transliteration IME supports adding custom transliteration schemes. Defining a scheme is done in a text file with a .scm extension. Once the scheme file is created you need to place it in the Schemes directory and when you start the IME the next time, it will automatically pick up the scheme files and provide you a menu option to choose one of the available schemes.

Canonical Schemes Menu
Defining Schemes

The Scheme files (.scm files) are text files and you can use your favourite text editor to edit them. The scheme file can be saved as UTF-8 text or UTF-16 text. A scheme file consists of two parts, header and the mapping rules section.
  • Header

    The scheme file header specifies multiple attributes. A sample header with all the attributes will look like,
    version: 1.0 name: ITRANS using classes class-delimiters: [ ] wildcard: # stop-char: _
    • version is a mandatory field. Currently we only support 1.0.
    • name is a mandatory field. The name specifies the name of the canonical scheme. The name should not contain more than 30 characters. This is the name displayed in the 'Schemes' menu. So a name descriptive of the scheme is useful. Here in the above example, the Name is set to 'ITRANS' indicating this scheme file implmenents ITRANS scheme (one of the popular schemes for Indic languages).
    • using classes is an optional line. Having this line will give you the ability to use classes (grouping of mappings under common name). Classes are explained below.
    • wildcard is an optional field. Wildcards are used in conjunction with classes
    • class-delimiters is an optional field. These are used in conjunction with classes.
    • stop-char is an optional field. The canonical transliteration engine by default looks for the longest prefix, matching a rule. If the user wants to break it in the middle, then a Stop char should be be used. For example, if 'a', 'i' and 'ai' all have valid mappings and user wants the engine to output mapping for 'a' followed by mapping for 'i', this can only be done by using the input sequence 'a_i' assuming '_' is the stop-char (using 'ai' will lead to the mapping for 'ai' and not what user expected).

  • Class specification

    You can define mutiple classes in your scheme file if you have specified the using classes line in the header. A class definition should look like,
    class ... rules
    The class-begin-delimiter, and class-end-delimiter are the ones specified in the header as mentioned above,
    class-delimiters:
    If you have not specified class-delimiters in the header (but using classes is defined) then '{' and '}' are used as the default class delimiters. The rule specification inside a class is same as rule specification outside the class.

  • Rule specification

    Rules are specified as,
    Rule prefix can consist of a sequence of ASCII characters and optionally a class specifier. A class specifier inside a rule should be of the form,
    name>
    If the Rule prefix has a class specifier, then the Rule target should use the wildcard. Rule Target consists of a sequence of characters in the target language of user's choice and a wildcard (when the Rule Prefix has a class specifier). The wildcard character is the one specified in the header as,
    wildcard:
    If no wildcard is specified in the header (but using classes is defined) '*' is used as the wildcard by default. A sample scheme file with a class (headers omitted),
    class sample { a 1 b 2 c 3 } a{sample} 1* {sample}x *2 ad 14 vd 24
    When the rules are processed each occurence of the class in the Rule Prefix is replaced by the prefixes defined in the class and the '*' (wildcard) in the target is replaced by the corresponding target in the class. So in the above example the rules get expanded into,
    aa 11 ab 12 ac 13 ax 12 bx 22 cx 32 ad 14 bd 24
    The first three rules correspond to the first rule in the rule file, the next three rules are for the second rule and the last two rules are same as the last two rules in the file because it does not use any class specifiers (Note that the rules inside the class are not added to the set of rules for the scheme). Now if you use this scheme file and type "abaxcxad", you will get the suggestion as "12123214" in the IME Window. If you are not using classes the scheme file is just a set of rules with the version and name specified in the header.
    Example of a scheme file with no classes,
    version: 1.0 name: Sample r1 target1 r2 target2 r3 target3
    Now when you use this Scheme file and type the sequence "r1r2r3" you will see the word "target1target2target3" as the suggested option.

Integrating Scheme file with IME

To integrate the scheme file with the IME, place the scheme file (with extension .scm) under the Schemes directory. The Schemes directory is present under the IME installation directory (It is usually under C:/Program Files/Google/Google Input/). Once the scheme file is placed you need to restart the IME and the IME will pick up the scheme file while loading. If there are any errors in the scheme file it will be displayed in a dialog so that you can fix them. If there are no errors, the Scheme will appear under the Schemes menu and you can activate it and start using it. Schemes can be activated in several ways,
  • Selecting the scheme from the Schemes menu.
  • Using the shortcut key Ctrl+M (If there are multiple schemes this will activate the last used Scheme or the first scheme if you are using Schemes for the first time)
  • Using the 'Toggle Canonical Scheme' button in the Status Window.

Your feedback is important and will help us improve the product. You can send your comments to transliteration-support+ime@google.com.


I do not see the language toolbar.
>>> Check configuration section.

The word I want does not appear in any of the choices.
>>> In case the word you are trying to type does not appear in the list of choices in all pages, you can use the Keyboard.

I see only boxes as choices. No character in IME language is displayed in edit widow.
>>> Make sure that you have installed a Unicode font like Arial Unicode MS supporting IME language and customize a font through IME menu.

How to update to a newer version?
>>> Google IME automatically updates to a newer version when one is released. The update process happens silently, whether or not you're using the IME at the time. If Google IME is open at the time of the update, you must close the IME and restart the system for the new version to take effect.