The Soundex is a coded surname (last name)
index based on the way a surname sounds rather than the way it
is spelled. Surnames that sound the same, but are spelled
differently, like SMITH and SMYTH, have the same code and are
filed together. The Soundex coding system was developed so
that you can find a surname on the census even though it may
have been recorded under various spellings. This indexing
system was used for the 1880, 1900, and 1920 census records
and is designed to make it easier to locate a specific family
on the regular census pages. Most of the 1890 census has been
lost and only 21 states were Soundexed or Miracoded for 1910.
To search for a particular surname, you must first work out
its code. Basic Soundex coding rules include:
1) Every Soundex code consists of a letter followed by three
numbers, such as W-252. The letter is always the first letter
of the surname. The numbers are assigned to the remaining
letters of the surname according to the following Soundex
coding guide:
1 = B F P V
2 = C G J K Q S X Z
3 = D T
4 = L
5 = M N
6 = R
0 = if no consonants remain
Always disregard the consonants W, H, and Y along with the
vowels
A, E, I, O, U.
2) Zeroes are added at the end if necessary to produce a
four-character code. Extra letters are disregarded. For
example:
* Lee is coded as L-000 (L, no additional consonants so 000 is
added).
* Shea is codes as S-000 (S, no additional consonants so 000
is added).
* Washington is coded as W-252 (W, 2 for the S, 5 for the N, 2
for the G, the remaining letters of the surname are
disregarded).
3) If the surname has any double letters in the surname, they
should be treated as one letter. For example:
* Gutierrez is coded as G-362 (G, 3 for the T, 6 for the first
R, the second R is ignored, 2 for the Z).
* Sheppard is coded as S-163 (S, 1 for the first P, 6 for the
R, and 3 for the D).
* Sherrell is coded as S-640 (S, 6 for the first R, 4 for the
first L, add a 0).
4) If the surname has different consonants side-by-side (two
or more) that have the same number in the soundex coding
guide, they should be treated as one letter. For example:
* Ashcraft is coded as A-261 (A, 2 for the S, the C is
ignored, 6 for the R, 1 for the F). It is not coded A-226.
* Jackson is coded as J-250 (J, 2 for the C, the K is ignored,
the S is ignored, 5 for the N, a 0 is added).
* Tymczak is coded as T-522 (T, 5 for the M, 2 for the C, the
Z is ignored, 2 for the K). Since the vowel "A" separates the
Z and K, the K is also coded.
* Sacks is coded as S-200 (S, 2 for the C, ignore the K and S,
add 00).
5) If the letter following the initial letter has the same
value, it should be disregarded in coding. For example:
* Pfister is coded as P-236 (P, the F is ignored, 2 for the S,
3 for the T, 6 for the R).
* Schechwitz is coded as S-232 (S, 2 for the second C, 3 for
the T, 2 for the Z).
* Szakal is coded as S-240 (S, the Z is ignored, 2 for the K,
4 for the L, add a 0).
* Scklar is coded as S-460 (S, ignore the C and K, 4 for the
L, 6 for the R, add a 0).
6) When a consonant or its equivalent is separated by an A, E,
I, O, U, or W, H, Y, the consonants or their equivalents are
considered separately. For example:
* Staten is coded as S-335 (S, 3 for the first T, 3 for the
second T, 5 for the N).
* Salkiewicz is coded as S- 422 (S, 4 for the L, 2 for the K,
2 for the C).
* Simone is coded as S-550 (S, 5 for the M, 5 for the N, add a
0).
* Sokwzy is coded as S-200 (S, 2 for the K, ignore the Z – see
rule #4, add 00).
* Schkolink is coded as S-452 (S, 4 for the L, 5 for the N, 2
for the K).
7) If a surname has a prefix, such as Van, Con, De, Di, La, or
Le, code both with and without the prefix because the surname
might be listed under either code. Note, however, that Mc and
Mac are not considered prefixes. For example:
* VanDeusen might be coded two ways: as V-532 (V, 5 for the N,
3 for the D, 2 for the S) or as D-250 (D, 2 for the S, 5 for
the N, 0 is added).
All possible spelling variations for surnames should be
considered to avoid missing individuals enumerated on the
various census schedules. For example, names such as Cain and
Kane might be coded as C-600 or K-600 – depending on how the
census enumerator spelled the surname.
The Miracode coding system is very similar to the Soundex
System and was used only for the 1910 federal census. Only 21
states were Soundexed or Miracoded in 1910. Miracode uses the
same phonetic code and abbreviations as the Soundex system.
Each Miracode card lists:
* Volume number
* ED (enumeration district)
* And the visitation (household) number as assigned by the
census enumerator.
In contrast, the Soundex card will show:
* Volume number
* ED
* Page number
* And line number on the census page
After determining the appropriate code (Soundex or Miracode)
for the surname in which you are interested, locate the
individual on the microfilmed indexes. The Soundex is listed
alphabetically by given name within each unique surname code.
A Soundex microfilm heading indicating mixed codes mean
several different surnames with several different Soundex
codes are listed together and are few in number. They still
will be arranged alphabetically by given name. Copy the
following information from the index:
* County
* ED number (enumeration district number)
* Sheet (page) number
* Line or household number
Using this information copied from the Soundex or Miracode
index, you should now be able to easily locate the desired
family on the microfilmed census rolls.
<Back to Family
History Research Tips>