Description of the Census Soundex
and Miracode Indexing Systems 


The Soundex is a coded surname (last name) index based on the way a surname sounds rather than the way it is spelled. Surnames that sound the same, but are spelled differently, like SMITH and SMYTH, have the same code and are filed together. The Soundex coding system was developed so that you can find a surname on the census even though it may have been recorded under various spellings. This indexing system was used for the 1880, 1900, and 1920 census records and is designed to make it easier to locate a specific family on the regular census pages. Most of the 1890 census has been lost and only 21 states were Soundexed or Miracoded for 1910. To search for a particular surname, you must first work out its code. Basic Soundex coding rules include:

1) Every Soundex code consists of a letter followed by three numbers, such as W-252. The letter is always the first letter of the surname. The numbers are assigned to the remaining letters of the surname according to the following Soundex coding guide:

1 = B F P V
2 = C G J K Q S X Z
3 = D T
4 = L
5 = M N
6 = R
0 = if no consonants remain
Always disregard the consonants W, H, and Y along with the vowels
A, E, I, O, U.

2) Zeroes are added at the end if necessary to produce a four-character code. Extra letters are disregarded. For example:

* Lee is coded as L-000 (L, no additional consonants so 000 is added).

* Shea is codes as S-000 (S, no additional consonants so 000 is added).

* Washington is coded as W-252 (W, 2 for the S, 5 for the N, 2 for the G, the remaining letters of the surname are disregarded).

3) If the surname has any double letters in the surname, they should be treated as one letter. For example:

* Gutierrez is coded as G-362 (G, 3 for the T, 6 for the first R, the second R is ignored, 2 for the Z).

* Sheppard is coded as S-163 (S, 1 for the first P, 6 for the R, and 3 for the D).

* Sherrell is coded as S-640 (S, 6 for the first R, 4 for the first L, add a 0).

4) If the surname has different consonants side-by-side (two or more) that have the same number in the soundex coding guide, they should be treated as one letter. For example:

* Ashcraft is coded as A-261 (A, 2 for the S, the C is ignored, 6 for the R, 1 for the F). It is not coded A-226.

* Jackson is coded as J-250 (J, 2 for the C, the K is ignored, the S is ignored, 5 for the N, a 0 is added).

* Tymczak is coded as T-522 (T, 5 for the M, 2 for the C, the Z is ignored, 2 for the K). Since the vowel "A" separates the Z and K, the K is also coded.

* Sacks is coded as S-200 (S, 2 for the C, ignore the K and S, add 00).

5) If the letter following the initial letter has the same value, it should be disregarded in coding. For example:

* Pfister is coded as P-236 (P, the F is ignored, 2 for the S, 3 for the T, 6 for the R).

* Schechwitz is coded as S-232 (S, 2 for the second C, 3 for the T, 2 for the Z).

* Szakal is coded as S-240 (S, the Z is ignored, 2 for the K, 4 for the L, add a 0).

* Scklar is coded as S-460 (S, ignore the C and K, 4 for the L, 6 for the R, add a 0).

6) When a consonant or its equivalent is separated by an A, E, I, O, U, or W, H, Y, the consonants or their equivalents are considered separately. For example:

* Staten is coded as S-335 (S, 3 for the first T, 3 for the second T, 5 for the N).

* Salkiewicz is coded as S- 422 (S, 4 for the L, 2 for the K, 2 for the C).

* Simone is coded as S-550 (S, 5 for the M, 5 for the N, add a 0).

* Sokwzy is coded as S-200 (S, 2 for the K, ignore the Z – see rule #4, add 00).

* Schkolink is coded as S-452 (S, 4 for the L, 5 for the N, 2 for the K).

7) If a surname has a prefix, such as Van, Con, De, Di, La, or Le, code both with and without the prefix because the surname might be listed under either code. Note, however, that Mc and Mac are not considered prefixes. For example:

* VanDeusen might be coded two ways: as V-532 (V, 5 for the N, 3 for the D, 2 for the S) or as D-250 (D, 2 for the S, 5 for the N, 0 is added).

All possible spelling variations for surnames should be considered to avoid missing individuals enumerated on the various census schedules. For example, names such as Cain and Kane might be coded as C-600 or K-600 – depending on how the census enumerator spelled the surname.

The Miracode coding system is very similar to the Soundex System and was used only for the 1910 federal census. Only 21 states were Soundexed or Miracoded in 1910. Miracode uses the same phonetic code and abbreviations as the Soundex system. Each Miracode card lists:

* Volume number

* ED (enumeration district)

* And the visitation (household) number as assigned by the census enumerator.

In contrast, the Soundex card will show:

* Volume number

* ED

* Page number

* And line number on the census page

After determining the appropriate code (Soundex or Miracode) for the surname in which you are interested, locate the individual on the microfilmed indexes. The Soundex is listed alphabetically by given name within each unique surname code. A Soundex microfilm heading indicating mixed codes mean several different surnames with several different Soundex codes are listed together and are few in number. They still will be arranged alphabetically by given name. Copy the following information from the index:

* County

* ED number (enumeration district number)

* Sheet (page) number

* Line or household number

Using this information copied from the Soundex or Miracode index, you should now be able to easily locate the desired family on the microfilmed census rolls.
 

<Back to Family History Research Tips>
 


Email us your questions, comments, or suggestions
info@linkquestusa.com

© Copyright LinkQuest USA 2004