New York State Identification and Intelligence System

From Wikipedia, the free encyclopedia

The New York State Identification and Intelligence System Phonetic Code, commonly known as NYSIIS, is a phonetic algorithm devised in 1970 as part of the New York State Identification and Intelligence System (now a part of the New York State Division of Criminal Justice Services). It features an accuracy increase of 2.7% over the traditional Soundex algorithm.

The algorithm, as described in Name Search Techniques, New York State Identification and Intelligence System Special Report No. 1, by Robert L. Taft, is:

  1. Translate first characters of name: MAC → MCC, KN → NN, K → C, PH → FF, PF → FF, SCH → SSS
  2. Translate last characters of name: EE → Y, IE → Y, DT, RT, RD, NT, ND → D
  3. First character of key = first character of name.
  4. Translate remaining characters by following rules, incrementing by one character each time:
    1. EV → AF else A, E, I, O, U → A
    2. Q → G, Z → S, M → N
    3. KN → NN else K → C
    4. SCH → SSS, PH → FF
    5. H → If previous or next is nonvowel, previous.
    6. W → If previous is vowel, previous.
    7. Add current to key if current is not same as the last key character.
  5. If last character is S, remove it.
  6. If last characters are AY, replace with Y.
  7. If last character is A, remove it.

[edit] External links