The Soundex Indexing System

Note: Based on National Archives information

The search that is done on surnames in the Alcona County Genealogical Society tables is based on an indexing system called the soundex.

 

The soundex is a coded surname (last name) index based on the way a surname sounds rather than the way it is spelled. Surnames that sound the same, but are spelled differently, like SMITH and SMYTH, have the same code and are filed together. The soundex coding system was developed to help find a surname even though it may have been recorded under various spellings

 

Sometimes, you will also find surnames that don’t sound the same, but have the same soundex code (for example, Ferrett and Ford are both F630). The soundex code often helps you find people, even when variations of the name are used, or their names are misspelled. For example, the surname Ferrett is often misspelled in official documents (Ferrat, Ferret, Feret, Ferat, etc.). All of these variant spellings have the same soundex code.

 


Basic Soundex Coding Rule

Every soundex code consists of a letter and three numbers, such as W252. The letter is always the first letter of the surname. The numbers are assigned to the remaining letters of the surname according to the soundex guide shown below. Zeroes are added at the end if necessary to produce a four-character code. Additional letters are disregarded. Examples:

  • Washington is coded W252 (W, 2 for the S, 5 for the N, 2 for the G, remaining letters disregarded).

·        Lee is coded L000 (L, 000 added).

Soundex Coding Guide

Number

Represents the Letters

1

B, F, P, V

2

C, G, J, K, Q, S, X, Z

3

D, T

4

L

5

M, N

6

R

disregard

A, E, I, O, U, H, W, and Y


Additional Soundex Coding Rules

  1. Names With Double Letters

If the surname has any double letters, they should be treated as one letter. For example:

    • Gutierrez is coded G362 (G, 3 for the T, 6 for the first R, second R ignored, 2 for the Z).
  1. Names with Letters Side-by-Side that have the Same Soundex Code Number

If the surname has different letters side-by-side that have the same number in the soundex coding guide, they should be treated as one letter. Examples:

    • Pfister is coded as P236 (P, F ignored, 2 for the S, 3 for the T, 6 for the R).
    • Jackson is coded as J250 (J, 2 for the C, K ignored, S ignored, 5 for the N, 0 added).
    • Tymczak is coded as T522 (T, 5 for the M, 2 for the C, Z ignored, 2 for the K). Since the vowel "A" separates the Z and K, the K is coded.
  1. Names with Prefixes

If a surname has a prefix, such as Van, Con, De, Di, La, or Le, code both with and without the prefix because the surname might be listed under either code. Note, however, that Mc and Mac are not considered prefixes.

For example, VanDeusen might be coded two ways:

V532 (V, 5 for N, 3 for D, 2 for S)

or

D250 (D, 2 for the S, 5 for the N, 0 added).

  1. Consonant Separators

If a vowel (A, E, I, O, U) separates two consonants that have the same soundex code, the consonant to the right of the vowel is coded. Example:

Tymczak is coded as T522 (T, 5 for the M, 2 for the C, Z ignored (see "Side-by-Side" rule above), 2 for the K). Since the vowel "A" separates the Z and K, the K is coded.

If "H" or "W" separate two consonants that have the same soundex code, the consonant to the right of the vowel is not coded. Example:

Ashcraft is coded A261 (A, 2 for the S, C ignored, 6 for the R, 1 for the F). It is not coded A226.

 

 

  Close Window