Voice Editing
Use Sound Forge as an example to demonstrate how to edit the sound sources.
STEP 1
Open the original sound file as below
STEP 2
Convert the sound file to the format that we can accept (WAV and MONO)
STEP 3
Normally, we will have high frequency noise introduced when we resample the original sound file. This high frequency noise can be reduced by using the Equalizer function as shown below:
EQ setting example:
60 HZ or below | Depend on situation, sometimes noise will be in this frequency rang, therefore, we need to decrease the gain of this frequency range |
---|---|
150 HZ | Dog bark, Drum |
400 HZ | Male speech |
1000 HZ | High pitch male, female speech |
2400 HZ | High pitch female |
6000 HZ | Children |
15K HZ or above | Usually, we will reduce the gain of this frequency range into minimum because our voice chip cannot response to this. |
The value setting for the above commands are not fixed. User should choose the suitable value for his/her particular sound file.
STEP 4
The sample rate is dependent on which IC body is used. We have the following IC bodies:
OTP Chip | Memory Size | Calculation of Sampling Rate Sample Rate = Memory Size / (voice duration x Data encoding algorithm ) |
---|---|---|
10 sec | 256K | ex sound total 5sec , S.R. = 256K / (5sec x 4bit) ≒ 12.8KHz ( 4bit ADPCM ) ex sound total 5sec , S.R. = 256K / (5sec x 5bit) ≒ 10.2KHz ( 5bit u-Law ) ex sound total 5sec , S.R. = 256K / (5sec x 8bit) ≒ 6.4KHz ( 8bit PCM ) |
21 sec | 512K | ex sound total 8sec S.R. = 512K / (8sec x 4bit) ≒ 16KHz ( 4bit ADPCM ) ex sound total 8sec S.R. = 512K / (8sec x 5bit) ≒ 12.8KHz ( 5bit u-Law ) ex sound total 8sec S.R. = 512K / (8sec x 8bit) ≒ 8KHz ( 8bit PCM ) |
42 sec | 1M | ex sound total 12.5sec S.R. = 1M / (12.5sec x 4bit) ≒ 20KHz ( 4bit ADPCM ) ex sound total 12.5sec S.R. = 1M / (12.5sec x 5bit) ≒ 16KHz ( 5bit u-Law ) ex sound total 12.5sec S.R. = 1M / (12.5sec x 8bit) ≒ 10KHz ( 8bit PCM ) |
85 sec | 2M | ex sound total 28sec S.R. = 2M / (28sec x 4bit) ≒ 18KHz ( 4bit ADPCM ) ex sound total 28sec S.R. = 2M / (28sec x 5bit) ≒ 14.2KHz ( 5bit u-Law ) ex sound total 28sec S.R. = 2M / (28sec x 8bit) ≒ 8.9KHz ( 8bit PCM ) |
170 sec | 4M | ex sound total 70sec S.R. = 4M / (70sec x 4bit) ≒ 14.2KHz ( 4bit ADPCM ) ex sound total 70sec S.R. = 4M / (70sec x 8bit) ≒ 7.1KHz ( 8bit PCM ) |
341 sec | 8M | ex sound total 153sec S.R. = 8M / (153sec x 4bit) ≒ 13KHz ( 4bit ADPCM ) ex sound total 153sec S.R. = 8M / (153sec x 5bit) ≒ 10.4KHz ( 5bit u-Law ) ex sound total 153sec S.R. = 8M / (153sec x 8bit) ≒ 6.5KHz ( 8bit PCM ) |
11 min | 32M | ex sound total 235sec S.R. = 32M / (235sec x 8bit) ≒ 17KHz ( 8bit PCM ) |
After obtaining the sample rate, we should resample the original sound. If Sound Forge is used, and assume the new sample rate is 12KHz, then, we can resample the sound file using ‘Effect’ and ‘Resample’ to modify the sample rate as shown in the figures bellows:
STEP 5
Convert the sound file to the format that we can accept ( 8-bit )
STEP 6
Remove the silence portion at the begin and end of the sound can reduce the chance of “POP” sound to happen.Finally, If there are very low volume part in the middle of the sound, it should be converted to total silence otherwise, the low volume sound maybe become noise sound in the final IC.
STEP 7
save as