Phil Schatzmann

The Power of STM32H7 Microcontrollers

pschatzmann — Thu, 25 Jul 2024 07:56:19 +0000

There are different cheap but powerful Chinese development boards with different STM32H7 chips from DevEBox and WeAct.

STM32H750VBT6 – 128K flash
STM32H743VIT6 – 2048K flash

I recommend to use the STM32H743VIT6 with the 2MB of flash memory. Please note that this is still only half of what you get with an ESP32 but you get an SD drive and 8MB SPI/QSPI Flash!

The STM32H7XX processors are based on the 32-bit Arm Cortex®-M7 core, running at up to 600 MHz. They have plenty of RAM and Flash Memory and have a FPU for fast double-precision floating calculations. In addition you also have plenty of pins (> 88) available!

I am not a big fan of the STM Cube IDE and find the approach of using generated code a bit “retro”. Fortunately we can use Arduino.

To compare the processing speed, I was measuring the speed im ms to process some FFT in comparison with other microcontrollers and here is the result:

Further information can be found here

So the STM32H7 seems to be at least twice as fast as a ESP32 and this looks very promising so I will explore this platform further!

The post The Power of STM32H7 Microcontrollers appeared first on Phil Schatzmann.

Arduino Audio Tools: Introducing Pipelines

pschatzmann — Fri, 19 Apr 2024 10:45:54 +0000

Currently it is already possible in the Arduino AudioTools Library to define some processing chains: Audio Objects are connected to other objects e.g. by adding the dependent object as parameter in the constructor.

Since the very beginning of the library, I planned to have some Pipeline class that would help to define such chains in a more intuitive way.

I finally committed this functionality to the Github. Further information can be found in the Wiki

The post Arduino Audio Tools: Introducing Pipelines appeared first on Phil Schatzmann.

ESP32-A2DP: Redesigning the I2S output

pschatzmann — Sun, 07 Apr 2024 13:02:38 +0000

I am providing a Bluetooth A2DP audio library for the ESP32, which can receive audio from a Bluetooth Source (e.g. a mobile phone) and play it via the I2S API provided by the IDF framework.

Unfortunately Espressif has decided to go for a completely new I2S API which means that my integration needs to be rewritten.

For quite some time now, I am supporting this new API via my AudioTools library, and I was considering to remove the I2S output functiontlity completely from my A2DP library and just use it a data provider for the AudioTools.

My latest design however looks as follows:

I am providing a separate output API in the A2DP library with the following class implementations:
- Output to any AudioTools audio sink and subclass of Arduino Print using BluetoothA2DPOutputAudioTools
- Output with the legacy I2S API using BluetoothA2DPOutputLegacy
- Combination of the above using BluetoothA2DPOutputDefault. This is the default selection.
If you use a Arduino ESP32 core version < 3.0.0 the legacy functionality continues to work
Starting from now, you can use the new API independently on the ESP32 core version.

I have updated the documentation and examples to use the new API.

I recommend to use the new API because this way you can be sure that your sketch will work in future ESP32 cores versions.

Example Sketch with the new API

#include "AudioTools.h
#include "BluetoothA2DPSink.h"

I2SStream i2s;
BluetoothA2DPSink a2dp_sink(i2s);

void setup() {
    auto cfg = i2s.defaultConfig();
    cfg.pin_bck = 14;
    cfg.pin_ws = 15;
    cfg.pin_data = 22;
    i2s.begin(cfg);

    a2dp_sink.start("MyMusic");
}

void loop() {
}

This comes at the cost, that you also need to install the AudioTools library now!

The post ESP32-A2DP: Redesigning the I2S output appeared first on Phil Schatzmann.

Arduino Audio Tools: Using Tasks

pschatzmann — Thu, 04 Apr 2024 19:55:30 +0000

So far I have always used the Arduino loop to do the audio processing with the hint, that there must not be any longer delay added to the processing.

If you have some blocking functions in the loop, the solution is to run the audio in a separate task. I have added some nice concurrency C++ classes to my framework which help that we can keep the sketch quite short.

Arduino Example Sketch

I am using a simple internet streaming sketch as example which is using some long delays in the loop(). The copy of the mp3 data from the url to the decoder is executed in a separate task:

#include "AudioTools.h"
#include "AudioCodecs/CodecMP3Helix.h"
#include "AudioLibs/AudioBoardStream.h"

URLStream url("ssid","password");  // or replace with ICYStream to get metadata
AudioBoardStream i2s(AudioKitEs8388V1); // final output of decoded stream
EncodedAudioStream dec(&i2s, new MP3DecoderHelix()); // Decoding stream
StreamCopy copier(dec, url); // copy url to decoder
Task task("mp3-copy", 10000, 1, 0);

void setup(){
  Serial.begin(115200);
  AudioLogger::instance().begin(Serial, AudioLogger::Info);  

  // setup i2s
  auto config = i2s.defaultConfig(TX_MODE);
  i2s.begin(config);

  // setup I2S based on sampling rate provided by decoder
  dec.begin();

  // mp3 radio
  url.begin("http://stream.srg-ssr.ch/m/rsj/mp3_128","audio/mp3");

  // start copy task
  task.begin([](){copier.copy();});
}

void loop(){
  delay(1000);
  Serial.println("ping...");
}

The core of the example is the Task class which requires a name, the stack size, the priority and the core.

Usually we would just call copier.copy() in the loop. But in the example we want to do this in a separate task.

In order to start the task we just call begin() by passing a global method or a lambda expression like in the example above.

Here is the output from the Serial Monitor:

22:19:33.534 -> [I] StreamCopy.h : 149 - StreamCopy::copy  1024 -> 1024 -> 1024 bytes - in 1 hops
22:19:33.566 -> ping...
22:19:33.598 -> [I] StreamCopy.h : 149 - StreamCopy::copy  1024 -> 1024 -> 1024 bytes - in 1 hops
22:19:33.694 -> [I] StreamCopy.h : 149 - StreamCopy::copy  1024 -> 1024 -> 1024 bytes - in 1 hops
22:19:33.727 -> [I] StreamCopy.h : 149 - StreamCopy::copy  1024 -> 1024 -> 1024 bytes - in 1 hops
22:19:33.790 -> [I] StreamCopy.h : 149 - StreamCopy::copy  1024 -> 1024 -> 1024 bytes - in 1 hops
22:19:33.854 -> [I] StreamCopy.h : 149 - StreamCopy::copy  1024 -> 1024 -> 1024 bytes - in 1 hops
22:19:33.918 -> [I] StreamCopy.h : 149 - StreamCopy::copy  1024 -> 1024 -> 1024 bytes - in 1 hops
22:19:33.982 -> [I] StreamCopy.h : 149 - StreamCopy::copy  1024 -> 1024 -> 1024 bytes - in 1 hops
22:19:34.078 -> [I] StreamCopy.h : 149 - StreamCopy::copy  1024 -> 1024 -> 1024 bytes - in 1 hops
22:19:34.111 -> [I] StreamCopy.h : 149 - StreamCopy::copy  1024 -> 1024 -> 1024 bytes - in 1 hops
22:19:34.175 -> [I] StreamCopy.h : 149 - StreamCopy::copy  1024 -> 1024 -> 1024 bytes - in 1 hops
22:19:34.239 -> [I] StreamCopy.h : 149 - StreamCopy::copy  1024 -> 1024 -> 1024 bytes - in 1 hops
22:19:34.304 -> [I] StreamCopy.h : 149 - StreamCopy::copy  1024 -> 1024 -> 1024 bytes - in 1 hops
22:19:34.368 -> [I] StreamCopy.h : 149 - StreamCopy::copy  1024 -> 1024 -> 1024 bytes - in 1 hops
22:19:34.433 -> [I] StreamCopy.h : 149 - StreamCopy::copy  1024 -> 1024 -> 1024 bytes - in 1 hops
22:19:34.497 -> [I] StreamCopy.h : 149 - StreamCopy::copy  1024 -> 1024 -> 1024 bytes - in 1 hops
22:19:34.561 -> [I] StreamCopy.h : 149 - ping...
22:19:34.561 -> StreamCopy::copy  1024 -> 1024 -> 1024 bytes - in 1 hops

The post Arduino Audio Tools: Using Tasks appeared first on Phil Schatzmann.

Libhelix – Behind the Scene

pschatzmann — Wed, 27 Mar 2024 14:34:01 +0000

I am providing an Arduino MP3 and AAC decoder library for Arduino which is based on libhelix.

In this library I added a simple Arduino inspired C++ API. This was one of my first libraries and I found my implementation quite confusing. So it was time to do some refactoring to clean things up. After all, the C API should be quite easy to use and there must be some reason why I ended up with this complexity.

I am taking the opportunity here to document the quirks of the C API of libhelix and how I managed around them. But let’s start with the API:

Basic Libhelix C API

HMP3Decoder MP3InitDecoder(void);
void MP3FreeDecoder(HMP3Decoder hMP3Decoder);
int MP3Decode(HMP3Decoder hMP3Decoder, unsigned char **inbuf, int *bytesLeft, short *outbuf, int useSize);
void MP3GetLastFrameInfo(HMP3Decoder hMP3Decoder, MP3FrameInfo *mp3FrameInfo);
int MP3GetNextFrameInfo(HMP3Decoder hMP3Decoder, MP3FrameInfo *mp3FrameInfo, unsigned char *buf);
int MP3FindSyncWord(unsigned char *buf, int nBytes);

So decoding is quite simple: you just MP3Decode() until it reports an underflow with -1 and then you continue to provide the next batch of data.

I was testing this with a MP3 stream from the Internet and it was working like a charm. The AAC side looks the same:

HAACDecoder AACInitDecoder(void);
HAACDecoder AACInitDecoderPre(void *ptr, int sz);
void AACFreeDecoder(HAACDecoder hAACDecoder);
int AACDecode(HAACDecoder hAACDecoder, unsigned char **inbuf, int *bytesLeft, short *outbuf);

int AACFindSyncWord(unsigned char *buf, int nBytes);
void AACGetLastFrameInfo(HAACDecoder hAACDecoder, AACFrameInfo *aacFrameInfo);
int AACSetRawBlockParams(HAACDecoder hAACDecoder, int copyLast, AACFrameInfo *aacFrameInfo);
int AACFlushCodec(HAACDecoder hAACDecoder);

Except, this was not working as expected: Instead of getting a -1 the AACDecode() got stuck and never returned when it was running out of data.

The solution was simple: Just make sure that the decoding buffer is keeping a minimum size and if this is used up, request for more data: I was initially using 400 bytes as limit, but later increased this.

So internet music streaming was working now for MP3 and AAC. The next thing was to test with some files and here this simple approach falls down and needs some extension.

Handling of Invalid Audio Data

Files might contain some ID3 Header or they might just be damaged and it turns out if this is fed to the decoder, it will not be able to generate any audio any more. So I extended the logic as follows:

I make sure that the data starts with a Sync Word, before calling the decoding. This means that we need to remove any invalid data before the synch word. I was calling this step presync.
The MP3 files started to play audio, but then it suddenly the got stuck in an endless loop where the decoder was reporting that it decoded 0 bytes. I am handling this case as follows: first, I request for more data, because it might be, that we are in an underflow situation without getting the expected -1. If this did not help, I remove the actual data up to the next synch word. This logic needs to run after the decoding, so I called it resync:

In a nutshell my new C++ API decoding method looks as follows:

  virtual size_t writeChunk(const void *in_ptr, size_t in_size) {
    LOG_HELIX(LogLevelHelix::Info, "writeChunk %zu", in_size);
    time_last_write = millis();
    size_t result = frame_buffer.writeArray((uint8_t *)in_ptr, in_size);

    while (frame_buffer.available() >= minFrameBufferSize()) {

      if (!presync()) break;
      int rc = decode();
      if (!resynch(rc)) break;
      // remove processed data
      frame_buffer.clearArray(rc);

      LOG_HELIX(LogLevelHelix::Info, "rc: %d - available %d", rc,
                frame_buffer.available());

    }

    return result;
  }

I think this is quite easy to understand now. This is a common logic which works both for AAC and MP3, so it is implemented in the CommonHelix class which is the base class for the MP3DecoderHelix in AACDecoderHelix which implement the format specific functionality.

If you are interested in the full source code, you can find it currently in the Development Branch in Github, but it might move to the Main Branch soon.

PSRAM Support

I also decided to add the support for PSRAM when using an ESP32, so instead of calling malloc() or new, I am using my custom allocation logic which is provided by my Allocator.h

This is used for allocating

the decoder itself
the frame and result buffer of the C++ API

The post Libhelix – Behind the Scene appeared first on Phil Schatzmann.

Changing the Sample Size to 32 bits (e.g. for an ES9018 DAC)

pschatzmann — Tue, 26 Mar 2024 07:23:44 +0000

There are some DACs out there that only support 32 bits: E.g the ES9018K2M module which is sold as high end HIFI DAC.

Therefore sometimes it is necessary to change the sample size e.g. from 16 to 32 bits when your codec produces 16 bits.

This can be easily done with the help of the NumberFormatConverterStream class of the Arduino Audio Tools library:

I2SStream i2s; // final output of decoded stream
NumberFormatConverterStream fc(i2s); // write the 16bit data to fc
// conigure 16 -> 32 bit
fc.begin(16, 32);

Arduino Sketch

Here is a complete sketch of an Internet Radio:

#include "AudioTools.h"
#include "AudioCodecs/CodecMP3Helix.h"

URLStream url("ssid","password");
I2SStream i2s; // final output of decoded stream
NumberFormatConverterStream nfc(i2s);
EncodedAudioStream dec(&nfc, new MP3DecoderHelix()); // Decoding stream
StreamCopy copier(dec, url); // copy url to decoder

void setup(){
  Serial.begin(115200);
  AudioLogger::instance().begin(Serial, AudioLogger::Info);  

  // convert 16 bits to 32, you could also change the gain
  nfc.begin(16, 32); 

  // setup i2s
  auto config = i2s.defaultConfig(TX_MODE);
  // you could define e.g your pins and change other settings
  //config.pin_ws = 10;
  //config.pin_bck = 11;
  //config.pin_data = 12;
  //config.mode = I2S_STD_FORMAT;
  //config.bits_per_sample = 32; // we coult do this explicitly
  i2s.begin(config);

  // setup I2S based on sampling rate provided by decoder
  dec.begin();

// mp3 radio
  url.begin("http://stream.srg-ssr.ch/m/rsj/mp3_128","audio/mp3");

}

void loop(){
  copier.copy();
}

The decoder automatically sends any audio format changes to the number format converter which forwards them to i2s. Therefore you end up automatically with the correct sample rate and number of channels w/o the need to define them in the sketch!

Please double check the potentially updated code from the examples directory.

Dependencies

You need to install the following libraries:

The post Changing the Sample Size to 32 bits (e.g. for an ES9018 DAC) appeared first on Phil Schatzmann.

Using Mozzi with a Bluetooth Speaker

pschatzmann — Thu, 21 Mar 2024 09:05:58 +0000

Mozzi brings your Arduino to life by allowing it to produce much more complex and interesting growls, sweeps and chorusing atmospherics. These sounds can be quickly and easily constructed from familiar synthesis units like oscillators, delays, filters and envelopes.

One of the strong points of my AudioTools library is, that we also support extensive communication scenarios. In this example we show how we can output the sound generated by Mozzi to a Bluetooth Speaker using an ESP32.

The Sketch

Here is the example Arduino sketch that uses both libraries and sends the generated sound to a Bluetooth Speaker.

#include "AudioTools.h"
#include "AudioLibs/A2DPStream.h"
#include "AudioLibs/MozziStream.h"
#include                 // oscillator template
#include   // sine table for oscillator

const int sample_rate = 44100;
AudioInfo info(sample_rate, 2, 16);  // bluetooth requires 44100, stereo, 16 bits
BluetoothA2DPSource a2dp_source;
MozziStream mozzi;  // audio source
const int16_t BYTES_PER_FRAME = 4;
// use: Oscil  oscilName (wavetable), look in .h file
// of table #included above
Oscil aSin(SIN2048_DATA);
// control variable, use the smallest data size you can for anything used in
// audio
byte gain = 255;

// callback used by A2DP to provide the sound data 
int32_t get_sound_data(uint8_t* data, int32_t size) {
  int32_t result = mozzi.readBytes(data, size);
  //LOGI("get_sound_data %d->%d",size, result);
  return result;
}

// Arduino Setup
void setup(void) {
  Serial.begin(115200);
  AudioLogger::instance().begin(Serial, AudioLogger::Info);

  // setup mozzi
  auto cfg = mozzi.defaultConfig();
  cfg.control_rate = CONTROL_RATE;
  cfg.copyFrom(info);
  mozzi.begin(cfg);

  aSin.setFreq(3320);  // set the frequency

  // start the bluetooth
  Serial.println("starting A2DP...");
  a2dp_source.start_raw("LEXON MINO L", get_sound_data);
  //a2dp_source.set_volume(100);
}

void updateControl() {
  // as byte, this will automatically roll around to 255 when it passes 0
  gain = gain - 3;
}

int updateAudio() {
  // shift back to STANDARD audio range, like /256 but faster
  return (aSin.next() * gain) >> 8;
}

// Arduino loop - repeated processing
void loop() {
  delay(1000);
}

This is basically the same sketch that I have already explained: The only difference is, that we need to make sure to generate stereo 16 bit data with a sample rate of 44100 and we replaced the audio sink wkth a BluetoothA2DPSource where the callback needs to be filled with data from Mozzi.

Do not forget to set the Partition Scheme to Huge App in the Arduino Tools menu because the application is getting quite big.

Source Code

The potentially updated source code of the sketch can be found in the examples directory.

Dependencies

The post Using Mozzi with a Bluetooth Speaker appeared first on Phil Schatzmann.

I2S Output of 4 Channels with an ESP32

pschatzmann — Wed, 20 Mar 2024 06:36:03 +0000

Recently there was a discussion on how to output more then 2 channels via I2S. I2S is limited to 2 channels, but the ESP32 has 2 I2S ports, so we can output a maximum of 4 channels!

Arduino Sketch

I have added a small example that shows how this can be done using my AudioTools library:

#include 
#include 
#include "AudioTools.h"
#include "AudioCodecs/CodecMP3Helix.h"

const int chipSelect=10;
AudioInfo info(44100, 2, 16);
I2SStream i2s_1; // final output
I2SStream i2s_2; // final out
ChannelsSelectOutput out;
WAVDecoder wav;
EncodedAudioOutput decoder(&out, &wav); // Decoding stream
StreamCopy copier; 
File audioFile;

void setup(){
  Serial.begin(115200);
  AudioLogger::instance().begin(Serial, AudioLogger::Info);  

  // setup file
  SD.begin(chipSelect);
  audioFile = SD.open("/Music/ch4.wav");

  // setup i2s
  auto config1 = i2s_1.defaultConfig(TX_MODE);
  config1.copyFrom(info);
  config1.port_no = 0;
  i2s_1.begin(config1);
  auto config2 = i2s_2.defaultConfig(TX_MODE);
  config2.copyFrom(info);
  config2.port_no = 1;
  i2s_2.begin(config2);

  // split channels to different i2s ports
  out.addOutput(i2s_1, 0, 1);
  out.addOutput(i2s_2, 2, 3);
  //out.addNotifyAudioChange(i2s_1);
  //out.addNotifyAudioChange(i2s_2);
  out.begin(info);

  // setup decoder
  decoder.begin();

  // begin copy
  copier.begin(decoder, audioFile);
}

void loop(){
  if (!copier.copy()) {
    stop();
  }
}

We define 2 separate I2SStream audio sink objects. The data source is a decoded WAV file that has audio with 4 channels and we just need to use a ChannelsSelectOutput as additional intermediary sink. This allows us to define which channels should go to which final output. In the example channel 0 and 1 go to the I2S on port 0 and the channels 2 and 3 go to port 1.

Please note that these output destinations are not automatically notified about format changes, so if you want this to happen, you need to add the corresponding addNotifyAudioChange() statements yourself.

This API might be changing, so please check the potentially updated example on Github!

The post I2S Output of 4 Channels with an ESP32 appeared first on Phil Schatzmann.

Mozzi Revisited

pschatzmann — Fri, 15 Mar 2024 20:55:58 +0000

Mozzi supports quite a lot of different Micro controllers, but it does not have any output method that would let you capture the audio to a stream of data, so I have added some simple integration layer for Mozzi to my AudioTools: This lets you send the generated audio via the network or bluetooth, save it to a file or whatever you can imagine…

Example Sketch

Here is the sketch, inspired by Mozzi/examples/#01.Basics converted to use the AudioTools integration:

#include "AudioTools.h"
#include "AudioLibs/AudioBoardStream.h"
#include "AudioLibs/MozziStream.h"
#include                 // oscillator template
#include   // sine table for oscillator

const int sample_rate = 16000;
AudioInfo info(sample_rate, 1, 16);
AudioBoardStream i2s(AudioKitEs8388V1);  // audio sink
MozziStream mozzi; // audio source
StreamCopy copier(i2s, mozzi); // copy source to sink
// use: Oscil  oscilName (wavetable), look in .h file
// of table #included above
Oscil aSin(SIN2048_DATA);
// control variable, use the smallest data size you can for anything used in
// audio
byte gain = 255;

void setup() {
  Serial.begin(115200);
  AudioLogger::instance().begin(Serial, AudioLogger::Info);

  // setup mozzi
  auto cfg = mozzi.defaultConfig();
  cfg.control_rate = CONTROL_RATE;
  cfg.copyFrom(info);
  mozzi.begin(cfg);

  // setup output
  auto out_cfg = i2s.defaultConfig();
  out_cfg.copyFrom(info);
  i2s.begin(out_cfg);

  // setup mozzi sine
  aSin.setFreq(3320);  // set the frequency
}

void loop() { copier.copy(); }

void updateControl() {
  // as byte, this will automatically roll around to 255 when it passes 0
  gain = gain - 3;
}

int updateAudio() {
  return (aSin.next() * gain) >>
         8;  // shift back to STANDARD audio range, like /256 but faster
}

We just copy the audio from the MozziStream source to the AudioBoardStream audio sink which is the I2S output for an Audiokit.

Compared with the original sketch, startMozzi() has been replaced with the mozzi.begin(cfg); and instead of calling audioHook() in the loop we use copier.copy();

This and further examples can be found in the examples directory and on Github

Summary

With Mozzi I am supporting now quite a rich set of Audio DSP Libraries

arduino-stk Synthesis ToolKit in C++ (STK) examples
Maximilian cross-platform and multi-target audio synthesis and signal processing library examples
Faust A functional programming language for sound synthesis and audio
Pure Data Pure Data (or just “Pd”) is an open source visual
Mozzi A sound synthesis library for Arduino

The post Mozzi Revisited appeared first on Phil Schatzmann.

AudioTools, ESP32 and PSRAM revisited

pschatzmann — Wed, 06 Mar 2024 08:13:53 +0000

Arrays

The best way to allocate an Array of data in the AudioTools is by using a Vector. E.g. Vector vector{100000}; is allocating a vector of 100’000 2 byte signed integers.

RAM and PSRAM

The ESP32 has a few hundred kilobytes of internal RAM, residing on the same die as the rest of the chip components. It can be insufficient for audio, so it has the ability to use up to 4 MB of external PSRAM (Psuedostatic RAM) memory.

To use this you need to activate PSRAM in the Arduino Tools menu. There is one small complication: it is not available yet when global variables are defined! The best way around this is to make sure that the allocation is done in the Arduino setup.

Allocators in AudioTools

An Allocator defines how memory is allocated: You can define the Allocator in the constructor of the Vector and if nothing is defined, we automatically use the DefaultAllocator which is using PSRAM if possible and if this fails it resorts back to regular RAM.

The following allocator objects have been predefined:

DefaultAllocator (use PSRAM and RAM)
DefaultAllocatorPSRAM (only use PSRAM)
DefaultAllocatorRAM (only use RAM)

and you can create your own by implementing an subclass of Allocator.

Examples

Here are a couple of examples:

#include "AudioTools.h"

Vector vectorPSRAM{0}; // same as vectorPSRAM{0, DefaultAllocator};
Vector vectorRAM{10000}; // Allocation in RAM because PSRAM fails.
Vector vectorRAM1{10000, DefaultAllocatorRAM}; // allocation in RAM
Vector vectorRAM2{0, DefaultAllocatorRAM}; // allocation in RAM

void setup() {
  vectorPSRAM.resize(10000); // allocate data in PSRAM 
  vectorRAM2.resize(10000); // allocate data in RAM
}

Configuration

Currently this feature is not automatically activated; You will need to uncomment the //#define USE_ALLOCATOR true line in AudioConfig.h

Final Thoughts

Since I am using the Vector class to allocate memory in all my other audio classes, PSRAM is automatically used, if activated and the resize() method is a flixible way to adjust the size to dynamic requirements.

The post AudioTools, ESP32 and PSRAM revisited appeared first on Phil Schatzmann.