From e37afd69768117b1b6d82063d0d1abfb11b952bf Mon Sep 17 00:00:00 2001
From: Christian Kenny <christiankennyemail@gmail.com>
Date: Sun, 21 Nov 2021 21:36:08 -0500
Subject: [PATCH] 3rdparty: Upgrade soundtouch lib to 2.3.1

---
 3rdparty/soundtouch/COPYING.TXT               |    4 +-
 3rdparty/soundtouch/README.html               | 1826 +++++++++--------
 .../soundtouch/soundtouch/FIFOSampleBuffer.h  |    3 +
 3rdparty/soundtouch/soundtouch/STTypes.h      |   21 +-
 3rdparty/soundtouch/soundtouch/SoundTouch.h   |    4 +-
 .../source/SoundTouch/BPMDetect.cpp           |    7 +-
 .../source/SoundTouch/FIFOSampleBuffer.cpp    |    8 +
 .../source/SoundTouch/FIRFilter.cpp           |   83 +-
 .../soundtouch/source/SoundTouch/FIRFilter.h  |    1 +
 .../source/SoundTouch/InterpolateCubic.h      |    8 +-
 .../source/SoundTouch/InterpolateLinear.cpp   |    2 +-
 .../source/SoundTouch/InterpolateLinear.h     |   18 +-
 .../source/SoundTouch/InterpolateShannon.h    |    8 +-
 .../source/SoundTouch/PeakFinder.cpp          |    6 +-
 .../source/SoundTouch/RateTransposer.cpp      |   10 +-
 .../source/SoundTouch/RateTransposer.h        |    5 +-
 .../source/SoundTouch/TDStretch.cpp           |  145 +-
 .../source/SoundTouch/sse_optimized.cpp       |    2 +-
 18 files changed, 1141 insertions(+), 1020 deletions(-)
diff --git a/3rdparty/soundtouch/COPYING.TXT b/3rdparty/soundtouch/COPYING.TXT
index bbd24e6430..c04d4d8040 100644
--- a/3rdparty/soundtouch/COPYING.TXT
+++ b/3rdparty/soundtouch/COPYING.TXT
@@ -2,7 +2,7 @@
 		       Version 2.1, February 1999
 
  Copyright (C) 1991, 1999 Free Software Foundation, Inc.
-     59 Temple Place, Suite 330, Boston, MA  02111-1307  USA
+ 51 Franklin Street, Fifth Floor, Boston, MA  02110-1301  USA
  Everyone is permitted to copy and distribute verbatim copies
  of this license document, but changing it is not allowed.
 
@@ -117,7 +117,7 @@ be combined with the library in order to run.
 
   0. This License Agreement applies to any software library or other
 program which contains a notice placed by the copyright holder or
-other authoried party saying it may be distributed under the terms of
+other authorized party saying it may be distributed under the terms of
 this Lesser General Public License (also called "this License").
 Each licensee is addressed as "you".
 
diff --git a/3rdparty/soundtouch/README.html b/3rdparty/soundtouch/README.html
index c86989dc3d..1efe7a6ec2 100644
--- a/3rdparty/soundtouch/README.html
+++ b/3rdparty/soundtouch/README.html
@@ -1,587 +1,651 @@
 <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
 <html>
+
 <head>
   <title>SoundTouch library README</title>
-  <meta http-equiv="Content-Type"
- content="text/html; charset=windows-1252">
   <meta http-equiv="Content-Language" content="en-us">
   <meta name="author" content="Olli Parviainen">
-  <meta name="description"
- content="Readme file for SoundTouch audio processing library">
- <style>
-   body {font-family: Arial, Helvetica; }
- </style>
+  <meta name="description" content="Readme file for SoundTouch audio processing library">
+  <style>
+    body {
+      font-family: Arial, Helvetica;
+    }
+  </style>
 </head>
+
 <body class="normal">
-<hr>
-<h1>SoundTouch audio processing library v2.1.2</h1>
-<p class="normal">SoundTouch library Copyright &copy; Olli Parviainen 2001-2018</p>
-<hr>
-<h2>1. Introduction </h2>
-<p>SoundTouch is an open-source audio processing library that allows
-changing the sound tempo, pitch and playback rate parameters
-independently from each other, i.e.:</p>
-<ul>
-  <li> Sound tempo can be increased or decreased while maintaining the
-original pitch</li>
-  <li> Sound pitch can be increased or decreased while maintaining the
-original tempo</li>
-  <li> Change playback rate that affects both tempo and pitch at the
-same time</li>
-  <li> Choose any combination of tempo/pitch/rate</li>
-</ul>
-<h3>1.1 Contact information </h3>
-<p>Author email: oparviai 'at' iki.fi </p>
-<p>SoundTouch WWW page: <a href="http://soundtouch.surina.net">http://soundtouch.surina.net</a></p>
-<p>SoundTouch git repository: <a href="https://gitlab.com/soundtouch/soundtouch.git">https://gitlab.com/soundtouch/soundtouch.git</a></p>
-<hr>
-<h2>2. Compiling SoundTouch</h2>
-<p>Before compiling, notice that you can choose the sample data format if it's 
-desirable to use floating point sample data instead of 16bit integers. See 
-section &quot;sample data format&quot; for more information.</p>
-<p>Also notice that SoundTouch can use OpenMP instructions for parallel 
-computation to accelerate the runtime processing speed in multi-core systems, 
-however, these improvements need to be separately enabled before compiling. See 
-OpenMP notes in Chapter 3 below.</p>
-<h3>2.1. Building in Microsoft Windows</h3>
-<p>Project files for Microsoft Visual C++ are supplied with the source 
-code package. Go to Microsoft WWW page to download 
-<a href="http://www.visualstudio.com/en-US/products/visual-studio-express-vs">
-Microsoft Visual Studio Express version for free</a>.
-</p>
-<p>To build the binaries with Visual C++ compiler, either run
-"make-win.bat" script, or open the appropriate project files in source
-code directories with Visual Studio. The final executable will appear
-under the "SoundTouch\bin" directory. If using the Visual Studio IDE
-instead of the make-win.bat script, directories bin and lib may need to
-be created manually to the SoundTouch package root for the final
-executables. The make-win.bat script creates these directories
-automatically. </p>
-<p><strong>C# example</strong>: The source code package includes also a C# example 
-    application for Windows that shows how to invoke SoundTouch.dll 
+  <hr>
+  <h1>SoundTouch audio processing library v2.3.1</h1>
+  <p class="normal">SoundTouch library Copyright &copy; Olli Parviainen 2001-2021</p>
+  <hr>
+  <h2>1. Introduction </h2>
+  <p>SoundTouch is an open-source audio processing library that allows
+    changing the sound tempo, pitch and playback rate parameters
+    independently from each other, i.e.:</p>
+  <ul>
+    <li> Sound tempo can be increased or decreased while maintaining the
+      original pitch</li>
+    <li> Sound pitch can be increased or decreased while maintaining the
+      original tempo</li>
+    <li> Change playback rate that affects both tempo and pitch at the
+      same time</li>
+    <li> Choose any combination of tempo/pitch/rate</li>
+  </ul>
+  <h3>1.1 Contact information </h3>
+  <p>Author email: oparviai 'at' iki.fi </p>
+  <p>SoundTouch WWW page: <a href="http://soundtouch.surina.net">http://soundtouch.surina.net</a></p>
+  <p>SoundTouch git repository: <a
+      href="https://gitlab.com/soundtouch/soundtouch.git">https://gitlab.com/soundtouch/soundtouch.git</a></p>
+  <hr>
+  <h2>2. Compiling SoundTouch</h2>
+  <p>Before compiling, notice that you can choose the sample data format if it's
+    desirable to use 16bit integer sample data instead of floating point samples. See
+    section &quot;sample data format&quot; for more information.</p>
+  <p>Also notice that SoundTouch can use OpenMP instructions for parallel
+    computation to accelerate the runtime processing speed in multi-core systems,
+    however, these improvements need to be separately enabled before compiling. See
+    OpenMP notes in Chapter 3 below.</p>
+  <h3>2.1. Building in Microsoft Windows</h3>
+  <p>Project files for Microsoft Visual C++ are supplied with the source
+    code package. Go to Microsoft WWW page to download
+    <a href="http://www.visualstudio.com/en-US/products/visual-studio-express-vs">
+      Microsoft Visual Studio Express version for free</a>.
+  </p>
+  <p>To build the binaries with Visual C++ compiler, either run
+    "make-win.bat" script, or open the appropriate project files in source
+    code directories with Visual Studio. The final executable will appear
+    under the "SoundTouch\bin" directory. If using the Visual Studio IDE
+    instead of the make-win.bat script, directories bin and lib may need to
+    be created manually to the SoundTouch package root for the final
+    executables. The make-win.bat script creates these directories
+    automatically. </p>
+  <p><strong>C# example</strong>: The source code package includes also a C# example
+    application for Windows that shows how to invoke SoundTouch.dll
     dynamic-load library for processing mp3 audio.
-<p><strong>OpenMP NOTE</strong>: If activating the OpenMP parallel computing in 
-the compilation, the target program will require additional vcomp dll library to 
-properly run. In Visual C++ 9.0 these libraries can be found in the following 
-folders.</p>
-<ul>
-    <li>x86 32bit: C:\Program Files (x86)\Microsoft Visual Studio 
-    9.0\VC\redist\x86\Microsoft.VC90.OPENMP\vcomp90.dll</li>
-    <li>x64 64bit: C:\Program Files (x86)\Microsoft Visual Studio 
-    9.0\VC\redist\amd64\Microsoft.VC90.OPENMP\vcomp90.dll</li>
-</ul>
-<p>In Visual Studio 2008, a SP1 version may be required for these libraries. In 
-other VC++ versions the required library will be expectedly found in similar 
-&quot;redist&quot; location.</p>
-<p>Notice that as minor demonstration of a &quot;dll hell&quot; phenomenon both the 32-bit 
-and 64-bit version of vcomp90.dll have the same filename but different contents, 
-thus choose the proper version to allow the program start.</p>
-<h3>2.2. Building in Gnu platforms</h3>
-<p>The SoundTouch library compiles in practically any platform
-supporting GNU compiler (GCC) tools. SoundTouch requires GCC version 4.3 or later.</p>
-<p>To build and install the binaries, run the following commands in 
-/soundtouch directory:</p>
-<table border="0" cellpadding="0" cellspacing="4">
-  <tbody>
-    <tr>
-      <td style="vertical-align: top;">
-      <pre>./bootstrap  -</pre>
-      </td>
-      <td style="vertical-align: top;">Creates "configure" file with
-local autoconf/automake toolset.<br>
-      </td>
-    </tr>
-    <tr valign="top">
-      <td>
-      <pre>./configure  -</pre>
-      </td>
-      <td>
-      <p>Configures the SoundTouch package for the local environment.
-Notice that "configure" file is not available before running the
-"./bootstrap" command as above.<br>
-      </p>
-      </td>
-    </tr>
-    <tr valign="top">
-      <td>
-      <pre>make         -</pre>
-      </td>
-      <td>
-      <p>Builds the SoundTouch library &amp; SoundStretch utility. You can 
-      optionally add &quot;-j&quot; switch after &quot;make&quot; to speed up the compilation in 
-      multi-core systems.</p>
-      </td>
-    </tr>
-    <tr valign="top">
-      <td>
-      <pre>make install -</pre>
-      </td>
-      <td>
-      <p>Installs the SoundTouch &amp; BPM libraries to <b>/usr/local/lib</b>
-and SoundStretch utility to <b>/usr/local/bin</b>. Please notice that
-'root' privileges may be required to install the binaries to the
-destination locations.</p>
-      </td>
-    </tr>
-  </tbody>
-</table>
-<h4><b>2.2.1 Required GNU tools</b></h4>
-<p> <span style="font-weight: bold;">Bash shell</span>, <span
- style="font-weight: bold;">GNU C++ compiler</span>, <span
- style="font-weight: bold;">libtool</span>, <span
- style="font-weight: bold;">autoconf</span> and <span
- style="font-weight: bold;">automake</span> tools
-are required for compiling the SoundTouch library. These are usually
-included with the GNU/Linux distribution, but if not, install these
-packages first. For example, Ubuntu Linux can acquire and install
-these with the following command:</p>
-<pre><b>sudo apt-get install automake autoconf libtool build-essential</b></pre>
-<h4><b>2.2.2 Problems with GCC compiler compatibility</b></h4>
-<p>At the release time the SoundTouch package has been tested to
-compile in GNU/Linux platform. However, If you have problems getting the
-SoundTouch library compiled, try disabling optimizations that are specific for 
-x86 processors by running <b>./configure</b> script with switch
-<blockquote>
-<pre>--enable-x86-optimizations=no</pre>
-</blockquote>
+  <p><strong>OpenMP NOTE</strong>: If activating the OpenMP parallel computing in
+    the compilation, the target program will require additional vcomp dll library to
+    properly run. In Visual C++ 9.0 these libraries can be found in the following
+    folders.</p>
+  <ul>
+    <li>x86 32bit: C:\Program Files (x86)\Microsoft Visual Studio
+      9.0\VC\redist\x86\Microsoft.VC90.OPENMP\vcomp90.dll</li>
+    <li>x64 64bit: C:\Program Files (x86)\Microsoft Visual Studio
+      9.0\VC\redist\amd64\Microsoft.VC90.OPENMP\vcomp90.dll</li>
+  </ul>
+  <p>In other VC++ versions the required library will be expectedly found in similar
+    &quot;redist&quot; location.</p>
+  <p>Notice that as minor demonstration of a &quot;dll hell&quot; phenomenon both the 32-bit
+    and 64-bit version of vcomp90.dll have the same filename but different contents,
+    thus choose the proper version to allow the program to start.</p>
+  <h3>2.2. Building in Gnu platforms</h3>
+  <p>The SoundTouch library compiles in practically any platform
+    supporting GNU compiler (GCC) tools.
+  <h4>2.2.1 Compiling with autotools</h4>
+  <p>To install build prerequisites for 'autotools' tool chain:</p>
+  <pre>    sudo apt-get install automake autoconf libtool build-essential</pre>
+  <p>To build and install the binaries, run the following commands in
+    /soundtouch directory:</p>
+  <table border="0" cellpadding="0" cellspacing="4">
+    <tbody>
+      <tr>
+        <td style="vertical-align: top;">
+          <pre>./bootstrap  -</pre>
+        </td>
+        <td style="vertical-align: top;">Creates "configure" file with
+          local autoconf/automake toolset.<br>
+        </td>
+      </tr>
+      <tr valign="top">
+        <td>
+          <pre>./configure  -</pre>
+        </td>
+        <td>
+          <p>Configures the SoundTouch package for the local environment.
+            Notice that "configure" file is not available before running the
+            "./bootstrap" command as above.<br>
+          </p>
+        </td>
+      </tr>
+      <tr valign="top">
+        <td>
+          <pre>make         -</pre>
+        </td>
+        <td>
+          <p>Builds the SoundTouch library &amp; SoundStretch utility. You can
+            optionally add &quot;-j&quot; switch after &quot;make&quot; to speed up the compilation in
+            multi-core systems.</p>
+        </td>
+      </tr>
+      <tr valign="top">
+        <td>
+          <pre>make install -</pre>
+        </td>
+        <td>
+          <p>Installs the SoundTouch &amp; BPM libraries to <b>/usr/local/lib</b>
+            and SoundStretch utility to <b>/usr/local/bin</b>. Please notice that
+            'root' privileges may be required to install the binaries to the
+            destination locations.</p>
+        </td>
+      </tr>
+    </tbody>
+  </table>
 
-Alternatively, if you don't use GNU Configure system, edit file "include/STTypes.h" 
-directly and remove the following definition:<blockquote>
-  <pre>#define SOUNDTOUCH_ALLOW_X86_OPTIMIZATIONS 1</pre>
-</blockquote>
+  <b>Compiling portable Shared Library / DLL version</b>
+  <p> The GNU autotools compilation does not automatically create a shared-library version of
+    SoundTouch (.so or .dll) that features position-independent code and C-language
+    api that are more suitable for cross-language development than C++ libraries.</p>
+  <p> Use script "make-gnu-dll-sh" to build a portable dynamic library version if such is desired.</p>
 
-<h4><b>2.2.3 Compiling Shared Library / DLL version in Cygwin</b></h4>
-    <p>
-        The GNU compilation does not automatically create a shared-library version of 
-        SoundTouch (.so or .dll). If such is desired, then you can create it as follows 
-        after running the usual compilation:</p>
-    <blockquote>
-  <pre>g++ -shared -static -DDLL_EXPORTS -I../../include -o SoundTouch.dll \
-     SoundTouchDLL.cpp ../SoundTouch/.libs/libSoundTouch.a
-sstrip SoundTouch.dll</pre>
-</blockquote>
+  <h4><b>2.2.2 Compiling with cmake</b></h4>
+  <p>'cmake' build scripts are provided as an alternative to the autotools toolchain.</p>
+  <p>To install cmake build prerequisites:</p>
+  <pre>    sudo apt-get install libtool build-essential cmake</pre>
+  <p>To build:</p>
+  <pre>
+    cmake .
+    make -j
+    make install</pre>
+  <p>To compile the additional portable Shared Library / DLL version with the native C-language API:</p>
+  <pre>
+    cmake . -DSOUNDTOUCH_DLL=ON
+    make -j
+    make install</pre>
 
-<h3>2.3. Building in Android</h3>
-<p>Android compilation instructions are within the 
-    source code package, see file &quot;<b>source/Android-lib/README-SoundTouch-Android.html</b>&quot; 
+  <h3>2.3. Building in Android</h3>
+  <p>Android compilation instructions are within the
+    source code package, see file &quot;<b>source/Android-lib/README-SoundTouch-Android.html</b>&quot;
     in the source code package. </p>
-<p>The Android compilation automatically builds separate .so library binaries 
-for ARM, X86 and MIPS processor architectures. For optimal device support, 
-include all these .so library binaries into the Android .apk application 
-package, so the target Android device can automatically choose the proper 
-library binary version to use.</p>
-<p>The <strong>source/Android-lib</strong> folder includes also an Android 
-example application that processes WAV audio files using SoundTouch library in 
-Android devices.</p>
+  <p>The Android compilation automatically builds separate .so library binaries
+    for ARM, X86 and MIPS processor architectures. For optimal device support,
+    include all these .so library binaries into the Android .apk application
+    package, so the target Android device can automatically choose the proper
+    library binary version to use.</p>
+  <p>The <strong>source/Android-lib</strong> folder includes also an Android
+    example application that processes WAV audio files using SoundTouch library in
+    Android devices.</p>
 
-<hr>
-<h2>3. About implementation &amp; Usage tips <h3>3.1. Supported sample data formats</h3>
-<p>The sample data format can be chosen between 16bit signed integer
-and 32bit floating point values. The default is 32bit floating point format, 
-which will also provide slightly better sound quality over the integer format. </p>
-<p> In Windows environment, the sample data format is chosen in file
-"STTypes.h" by choosing one of the following defines:</p>
-<ul>
-  <li> <span style="font-weight: bold;">#define
-SOUNDTOUCH_INTEGER_SAMPLES</span> for 16bit signed integer</li>
-  <li> <span style="font-weight: bold;">#define </span><span
- style="font-weight: bold;">SOUNDTOUCH_</span><span
- style="font-weight: bold;">FLOAT_SAMPLES</span> for 32bit floating
-point</li>
-</ul>
-<p> In GNU environment, the floating sample format is used by default,
-but integer sample format can be chosen by giving the following switch
-to the configure script: </p>
-<blockquote>
-  <pre>./configure --enable-integer-samples</pre>
-</blockquote>
-<p>The sample data can have either single (mono) or double (stereo)
-audio channel. Stereo data is interleaved so that every other data
-value is for left channel and every second for right channel. Notice
-that while it'd be possible in theory to process stereo sound as two
-separate mono channels, this isn't recommended because processing the
-channels separately would result in losing the phase coherency between
-the channels, which consequently would ruin the stereo effect.</p>
-<p>Sample rates between 8000-48000H are supported.</p>
-<h3>3.2. Processing latency</h3>
-<p>The processing and latency constraints of the SoundTouch library are:</p>
-<ul>
-  <li> Input/output processing latency for the SoundTouch processor is
-around 100 ms. This is when time-stretching is used. If the rate
-transposing effect alone is used, the latency requirement is much
-shorter, see section 'About algorithms'.</li>
-  <li> Processing CD-quality sound (16bit stereo sound with 44100H
-sample rate) in real-time or faster is possible starting from
-processors equivalent to Intel Pentium 133Mh or better, if using the
-"quick" processing algorithm. If not using the "quick" mode or if
-floating point sample data are being used, several times more CPU power
-is typically required.</li>
-</ul>
-<h3>3.3. About algorithms</h3>
-<p>SoundTouch provides three seemingly independent effects: tempo,
-pitch and playback rate control. These three controls are implemented
-as combination of two primary effects, <em>sample rate transposing</em>
-and <em>time-stretching</em>.</p>
-<p><em>Sample rate transposing</em> affects both the audio stream
-duration and pitch. It's implemented simply by converting the original
-audio sample stream to the desired duration by interpolating from
-the original audio samples. In SoundTouch, linear interpolation with
-anti-alias filtering is used. Theoretically a higher-order
-interpolation provide better result than 1st order linear
-interpolation, but in audio application linear interpolation together
-with anti-alias filtering performs subjectively about as well as
-higher-order filtering would.</p>
-<p><em>Time-stretching </em>means changing the audio stream duration
-without affecting it's pitch. SoundTouch uses WSOLA-like
-time-stretching routines that operate in the time domain. Compared to
-sample rate transposing, time-stretching is a much heavier operation
-and also requires a longer processing "window" of sound samples used by
-the processing algorithm, thus increasing the algorithm input/output
-latency. Typical i/o latency for the SoundTouch time-stretch algorithm
-is around 100 ms.</p>
-<p>Sample rate transposing and time-stretching are then used together
-to produce the tempo, pitch and rate controls:</p>
-<ul>
-  <li> <strong>'Tempo'</strong> control is implemented purely by
-time-stretching.</li>
-  <li> <strong>'Rate</strong>' control is implemented purely by sample
-rate transposing.</li>
-  <li> <strong>'Pitch</strong>' control is implemented as a
-combination of time-stretching and sample rate transposing. For
-example, to increase pitch the audio stream is first time-stretched to
-longer duration (without affecting pitch) and then transposed back to
-original duration by sample rate transposing, which simultaneously
-reduces duration and increases pitch. The result is original duration
-but increased pitch.</li>
-</ul>
-<h3>3.4 Tuning the algorithm parameters</h3>
-<p>The time-stretch algorithm has few parameters that can be tuned to
-optimize sound quality for certain application. The current default
-parameters have been chosen by iterative if-then analysis (read: "trial
-and error") to obtain best subjective sound quality in pop/rock music
-processing, but in applications processing different kind of sound the
-default parameter set may result into a sub-optimal result.</p>
-<p>The time-stretch algorithm default parameter values are set by the
-following #defines in file "TDStretch.h":</p>
-<blockquote>
-  <pre>#define DEFAULT_SEQUENCE_MS     AUTOMATIC<br>#define DEFAULT_SEEKWINDOW_MS   AUTOMATIC<br>#define DEFAULT_OVERLAP_MS      8</pre>
-</blockquote>
-<p>These parameters affect to the time-stretch algorithm as follows:</p>
-<ul>
-  <li> <strong>DEFAULT_SEQUENCE_MS</strong>: This is the default
-length of a single processing sequence in milliseconds which determines
-the how the original sound is chopped in the time-stretch algorithm.
-Larger values mean fewer sequences are used in processing. In principle
-a larger value sounds better when slowing down the tempo, but worse
-when increasing the tempo and vice versa.<br>
-    <br>
-By default, this setting value is calculated automatically according to
-tempo value.<br>
- </li>
-  <li> <strong>DEFAULT_SEEKWINDOW_MS</strong>: The seeking window
-default length in milliseconds is for the algorithm that seeks the best
-possible overlapping location. This determines from how wide a sample
-"window" the algorithm can use to find an optimal mixing location when
-the sound sequences are to be linked back together.<br>
-    <br>
-The bigger this window setting is, the higher the possibility to find a
-better mixing position becomes, but at the same time large values may
-cause a "drifting" sound artifact because neighboring sequences can be
-chosen at more uneven intervals. If there's a disturbing artifact that
-sounds as if a constant frequency was drifting around, try reducing
-this setting.<br>
-    <br>
-By default, this setting value is calculated automatically according to
-tempo value.<br>
- </li>
-  <li> <strong>DEFAULT_OVERLAP_MS</strong>: Overlap length in
-milliseconds. When the sound sequences are mixed back together to form
-again a continuous sound stream, this parameter defines how much the
-ends of the consecutive sequences will overlap with each other.<br>
-    <br>
-This shouldn't be that critical parameter. If you reduce the
-DEFAULT_SEQUENCE_MS setting by a large amount, you might wish to try a
-smaller value on this.</li>
-</ul>
-<p>Notice that these parameters can also be set during execution time
-with functions "<strong>TDStretch::setParameters()</strong>" and "<strong>SoundTouch::setSetting()</strong>".</p>
-<p>The table below summaries how the parameters can be adjusted for
-different applications:</p>
-<table border="1">
-  <tbody>
-    <tr>
-      <td valign="top"><strong>Parameter name</strong></td>
-      <td valign="top"><strong>Default value magnitude</strong></td>
-      <td valign="top"><strong>Larger value affects...</strong></td>
-      <td valign="top"><strong>Smaller value affects...</strong></td>
-      <td valign="top"><strong>Effect to CPU burden</strong></td>
-    </tr>
-    <tr>
-      <td valign="top">
-      <pre>SEQUENCE_MS</pre>
-      </td>
-      <td valign="top">Default value is relatively large, chosen for
-slowing down music tempo</td>
-      <td valign="top">Larger value is usually better for slowing down
-tempo. Growing the value decelerates the "echoing" artifact when
-slowing down the tempo.</td>
-      <td valign="top">Smaller value might be better for speeding up
-tempo. Reducing the value accelerates the "echoing" artifact when
-slowing down the tempo </td>
-      <td valign="top">Increasing the parameter value reduces
-computation burden</td>
-    </tr>
-    <tr>
-      <td valign="top">
-      <pre>SEEKWINDOW_MS</pre>
-      </td>
-      <td valign="top">Default value is relatively large, chosen for
-slowing down music tempo</td>
-      <td valign="top">Larger value eases finding a good mixing
-position, but may cause a "drifting" artifact</td>
-      <td valign="top">Smaller reduce possibility to find a good mixing
-position, but reduce the "drifting" artifact.</td>
-      <td valign="top">Increasing the parameter value increases
-computation burden</td>
-    </tr>
-    <tr>
-      <td valign="top">
-      <pre>OVERLAP_MS</pre>
-      </td>
-      <td valign="top">Default value is relatively large, chosen to
-suit with above parameters.</td>
-      <td valign="top"></td>
-      <td valign="top">If you reduce the "sequence ms" setting, you
-might wish to try a smaller value.</td>
-      <td valign="top">Increasing the parameter value increases
-computation burden</td>
-    </tr>
-  </tbody>
-</table>
-<h3>3.5 Performance Optimizations </h3>
-<p><strong>General optimizations:</strong></p>
-<p>The time-stretch routine has a 'quick' mode that substantially
-speeds up the algorithm but may slightly compromise the sound quality. 
-This mode is activated by calling SoundTouch::setSetting()
-function with parameter id of SETTING_USE_QUICKSEEK and value
-"1", i.e. </p>
-<blockquote>
-  <p>setSetting(SETTING_USE_QUICKSEEK, 1);</p>
-</blockquote>
-<p><strong>CPU-specific optimizations:</strong></p>
-<p>Intel x86 specific SIMD optimizations are implemented using compiler 
-intrinsics, providing about a 3x processing speedup for x86 compatible 
-processors vs. non-SIMD implementation:</p>
-<ul>
-    <li> Intel MMX optimized routines are used with x86 CPUs when 16bit integer 
-    sample type is used</li>
-  <li> Intel SSE optimized routines are used with x86 CPUs when 32bit floating 
-  point sample type is used</li>
-</ul>
-<h3>3.5 OpenMP parallel computation</h3>
-<p>SoundTouch 1.9 onwards support running the algorithms parallel in several CPU 
-cores. Based on benchmark the experienced multi-core processing speed-up gain 
-ranges between +30% (on a high-spec dual-core x86 Windows PC) to 215% (on a moderately low-spec 
-quad-core ARM of Raspberry Pi2). </p>
-<p>See an external blog article with more detailed discussion about the
-<a href="http://www.softwarecoven.com/parallel-computing-in-embedded-mobile-devices/">
-SoundTouch OpenMP optimization</a>.</p>
-<p>The parallel computing support is implemented using OpenMP spec 3.0 
-instructions. These instructions are supported by Visual C++ 2008 and later, and 
-GCC v4.2 and later. Compilers that do not supporting OpenMP will ignore these 
-optimizations and routines will still work properly. Possible warnings about 
-unknown #pragmas are related to OpenMP support and can be safely ignored.</p>
-<p>The OpenMP improvements are disabled by default, and need to be enabled by 
-developer during compile-time. Reason for this is that parallel processing adds 
-moderate runtime overhead in managing the multi-threading, so it may not be 
-necessary nor desirable in all applications. For example real-time processing 
-that is not constrained by CPU power will not benefit of speed-up provided by 
-the parallel processing, in the contrary it may increase power consumption due 
-to the increased overhead.</p>
-<p>However, applications that run on low-spec multi-core CPUs and may otherwise 
-have possibly constrained performance will benefit of the OpenMP improvements. 
-This include for example multi-core embedded devices.</p>
-<p>OpenMP parallel computation can be enabled before compiling SoundTouch 
-library as follows:</p>
-<ul>
-    <li><strong>Visual Studio</strong>: Open properties for the <strong>SoundTouch
-    </strong>sub-project, browse to <strong>C/C++</strong> and <strong>Language 
-    </strong>settings. Set 
-    there &quot;<strong>OpenMP support</strong>&quot; to &quot;<strong>Yes</strong>&quot;. Alternatively add 
-    <strong>/openmp</strong> switch to command-line 
-    parameters</li>
-    <li><strong>GNU</strong>: Run the configure script with &quot;<strong>./configure 
-    --enable-openmp</strong>&quot; switch, then run make as usually</li>
-    <li><strong>Android</strong>: Add &quot;<strong>-fopenmp</strong>&quot; switches to compiler &amp; linker 
-    options, see README-SoundTouch-Android.html in the source code package for 
-    more detailed instructions.</li>
-</ul>
-<hr>
-<h2><a name="SoundStretch"></a>4. SoundStretch audio processing utility
-</h2>
-<p>SoundStretch audio processing utility<br>
-    Copyright (c) Olli Parviainen 2002-2015</p>
-<p>SoundStretch is a simple command-line application that can change
-tempo, pitch and playback rates of WAV sound files. This program is
-intended primarily to demonstrate how the "SoundTouch" library can be
-used to process sound in your own program, but it can as well be used
-for processing sound files.</p>
-<h3>4.1. SoundStretch Usage Instructions</h3>
-<p>SoundStretch Usage syntax:</p>
-<blockquote>
-  <pre>soundstretch infilename outfilename [switches]</pre>
-</blockquote>
-<p>Where: </p>
-<table width="100%" border="0" cellpadding="2">
-  <tbody>
-    <tr>
-      <td valign="top">
-      <pre>"infilename"</pre>
-      </td>
-      <td valign="top">Name of the input sound data file (in .WAV audio
-file format). Give "stdin" as filename to use standard input pipe. </td>
-    </tr>
-    <tr>
-      <td valign="top">
-      <pre>"outfilename"</pre>
-      </td>
-      <td valign="top">Name of the output sound file where the
-resulting sound is saved (in .WAV audio file format). This parameter
-may be omitted if you don't want to save the output (e.g. when
-only calculating BPM rate with '-bpm' switch). Give "stdout" as
-filename to use standard output pipe.</td>
-    </tr>
-    <tr>
-      <td valign="top">
-      <pre>[switches]</pre>
-      </td>
-      <td valign="top">Are one or more control switches.</td>
-    </tr>
-  </tbody>
-</table>
-<p>Available control switches are:</p>
-<table width="100%" border="0" cellpadding="2">
-  <tbody>
-    <tr>
-      <td valign="top">
-      <pre>-tempo=n </pre>
-      </td>
-      <td valign="top">Change the sound tempo by n percents (n = -95.0
-.. +5000.0 %) </td>
-    </tr>
-    <tr>
-      <td valign="top">
-      <pre>-pitch=n</pre>
-      </td>
-      <td valign="top">Change the sound pitch by n semitones (n = -60.0
-.. + 60.0 semitones) </td>
-    </tr>
-    <tr>
-      <td valign="top">
-      <pre>-rate=n</pre>
-      </td>
-      <td valign="top">Change the sound playback rate by n percents (n
-= -95.0 .. +5000.0 %) </td>
-    </tr>
-    <tr>
-      <td valign="top">
-      <pre>-bpm=n</pre>
-      </td>
-      <td valign="top">Detect the Beats-Per-Minute (BPM) rate of the
-sound and adjust the tempo to meet 'n' BPMs. When this switch is
-applied, the "-tempo" switch is ignored. If "=n" is omitted, i.e.
-switch "-bpm" is used alone, then the BPM rate is estimated and
-displayed, but tempo not adjusted according to the BPM value. </td>
-    </tr>
-    <tr>
-      <td valign="top">
-      <pre>-quick</pre>
-      </td>
-      <td valign="top">Use quicker tempo change algorithm. Gains speed
-but loses sound quality. </td>
-    </tr>
-    <tr>
-      <td valign="top">
-      <pre>-naa</pre>
-      </td>
-      <td valign="top">Don't use anti-alias filtering in sample rate
-transposing. Gains speed but loses sound quality. </td>
-    </tr>
-    <tr>
-      <td valign="top">
-      <pre>-license</pre>
-      </td>
-      <td valign="top">Displays the program license text (LGPL)</td>
-    </tr>
-  </tbody>
-</table>
-<p>Notes:</p>
-<ul>
-  <li> To use standard input/output pipes for processing, give "stdin"
-and "stdout" as input/output filenames correspondingly. The standard
-input/output pipes will still carry the audio data in .wav audio file
-format.</li>
-  <li> The numerical switches allow both integer (e.g. "-tempo=123")
-and decimal (e.g. "-tempo=123.45") numbers.</li>
-  <li> The "-naa" and/or "-quick" switches can be used to reduce CPU
-usage while compromising some sound quality</li>
-  <li> The BPM detection algorithm works by detecting repeating bass or
-drum patterns at low frequencies of &lt;250Hz. A lower-than-expected
-BPM figure may be reported for music with uneven or complex bass
-patterns.</li>
-</ul>
-<h3>4.2. SoundStretch usage examples </h3>
-<p><strong>Example 1</strong></p>
-<p>The following command increases tempo of the sound file
-"originalfile.wav" by 12.5% and stores result to file
-"destinationfile.wav":</p>
-<blockquote>
-  <pre>soundstretch originalfile.wav destinationfile.wav -tempo=12.5</pre>
-</blockquote>
-<p><strong>Example 2</strong></p>
-<p>The following command decreases the sound pitch (key) of the sound
-file "orig.wav" by two semitones and stores the result to file
-"dest.wav":</p>
-<blockquote>
-  <pre>soundstretch orig.wav dest.wav -pitch=-2</pre>
-</blockquote>
-<p><strong>Example 3</strong></p>
-<p>The following command processes the file "orig.wav" by decreasing
-the sound tempo by 25.3% and increasing the sound pitch (key) by 1.5
-semitones. Resulting .wav audio data is directed to standard output
-pipe:</p>
-<blockquote>
-  <pre>soundstretch orig.wav stdout -tempo=-25.3 -pitch=1.5</pre>
-</blockquote>
-<p><strong>Example 4</strong></p>
-<p>The following command detects the BPM rate of the file "orig.wav"
-and adjusts the tempo to match 100 beats per minute. Result is stored
-to file "dest.wav":</p>
-<blockquote>
-  <pre>soundstretch orig.wav dest.wav -bpm=100</pre>
-</blockquote>
-<p><strong>Example 5</strong></p>
-<p>The following command reads .wav sound data from standard input pipe
-and estimates the BPM rate:</p>
-<blockquote>
-  <pre>soundstretch stdin -bpm</pre>
-</blockquote>
-<p><strong>Example 6</strong></p>
-<p>The following command tunes song from original 440Hz tuning to 432Hz tuning: 
-this corresponds to lowering the pitch by -0.318 semitones:</p>
-<blockquote>
-  <pre>soundstretch original.wav output.wav -pitch=-0.318</pre>
-</blockquote>
-<hr>
-<h2>5. Change History</h2>
-<h3>5.1. SoundTouch library Change History </h3>
+  <h3>2.4. Building in Mac</h3>
+  <p>Install autoconf tool as instructed in <a
+      href="http://macappstore.org/autoconf/">http://macappstore.org/autoconf/</a>, or alternatively the 'cmake' toolchain.</p>
+  <p>Then, build as described above in section "Building in Gnu platforms".</p>
+
+  <hr>
+  <h2>3. About implementation &amp; Usage tips <h3>3.1. Supported sample data formats</h3>
+    <p>The sample data format can be chosen between 16bit signed integer
+      and 32bit floating point values.</p>
+    </p> The default sample type is 32bit floating point format,
+    which also provides better sound quality than integer format because
+    integer algorithms need to scale already intermediate calculation results to
+    avoid integer overflows. These early integer scalings can slightly degrade
+    output quality.</p>
+    <p> In Windows environment, the sample data format is chosen in file
+      "STTypes.h" by choosing one of the following defines:</p>
+    <ul>
+      <li> <span style="font-weight: bold;">#define
+          SOUNDTOUCH_INTEGER_SAMPLES</span> for 16bit signed integer</li>
+      <li> <span style="font-weight: bold;">#define </span><span style="font-weight: bold;">SOUNDTOUCH_</span><span
+          style="font-weight: bold;">FLOAT_SAMPLES</span> for 32bit floating
+        point</li>
+    </ul>
+    <p> In GNU environment, the floating sample format is used by default,
+      but integer sample format can be chosen by giving the following switch
+      to the configure script: </p>
+    <blockquote>
+      <pre>./configure --enable-integer-samples</pre>
+    </blockquote>
+    <p>The sample data can have either single (mono) or double (stereo)
+      audio channel. Stereo data is interleaved so that every other data
+      value is for left channel and every second for right channel. Notice
+      that while it'd be possible in theory to process stereo sound as two
+      separate mono channels, this isn't recommended because processing the
+      channels separately would result in losing the phase coherency between
+      the channels, which consequently would ruin the stereo effect.</p>
+    <p>Sample rates between 8000-48000H are supported.</p>
+    <h3>3.2. Processing latency</h3>
+    <p>The processing and latency constraints of the SoundTouch library are:</p>
+    <ul>
+      <li> Input/output processing latency for the SoundTouch processor is
+        around 100 ms. This is when time-stretching is used. If the rate
+        transposing effect alone is used, the latency requirement is much
+        shorter, see section 'About algorithms'.</li>
+      <li> Processing CD-quality sound (16bit stereo sound with 44100H
+        sample rate) in real-time or faster is possible starting from
+        processors equivalent to Intel Pentium 133Mh or better, if using the
+        "quick" processing algorithm. If not using the "quick" mode or if
+        floating point sample data are being used, several times more CPU power
+        is typically required.</li>
+    </ul>
+    <h3>3.3. About algorithms</h3>
+    <p>SoundTouch provides three seemingly independent effects: tempo,
+      pitch and playback rate control. These three controls are implemented
+      as combination of two primary effects, <em>sample rate transposing</em>
+      and <em>time-stretching</em>.</p>
+    <p><em>Sample rate transposing</em> affects both the audio stream
+      duration and pitch. It's implemented simply by converting the original
+      audio sample stream to the desired duration by interpolating from
+      the original audio samples. In SoundTouch, linear interpolation with
+      anti-alias filtering is used. Theoretically a higher-order
+      interpolation provide better result than 1st order linear
+      interpolation, but in audio application linear interpolation together
+      with anti-alias filtering performs subjectively about as well as
+      higher-order filtering would.</p>
+    <p><em>Time-stretching </em>means changing the audio stream duration
+      without affecting it's pitch. SoundTouch uses WSOLA-like
+      time-stretching routines that operate in the time domain. Compared to
+      sample rate transposing, time-stretching is a much heavier operation
+      and also requires a longer processing "window" of sound samples used by
+      the processing algorithm, thus increasing the algorithm input/output
+      latency. Typical i/o latency for the SoundTouch time-stretch algorithm
+      is around 100 ms.</p>
+    <p>Sample rate transposing and time-stretching are then used together
+      to produce the tempo, pitch and rate controls:</p>
+    <ul>
+      <li> <strong>'Tempo'</strong> control is implemented purely by
+        time-stretching.</li>
+      <li> <strong>'Rate</strong>' control is implemented purely by sample
+        rate transposing.</li>
+      <li> <strong>'Pitch</strong>' control is implemented as a
+        combination of time-stretching and sample rate transposing. For
+        example, to increase pitch the audio stream is first time-stretched to
+        longer duration (without affecting pitch) and then transposed back to
+        original duration by sample rate transposing, which simultaneously
+        reduces duration and increases pitch. The result is original duration
+        but increased pitch.</li>
+    </ul>
+    <h3>3.4 Tuning the algorithm parameters</h3>
+    <p>The time-stretch algorithm has few parameters that can be tuned to
+      optimize sound quality for certain application. The current default
+      parameters have been chosen by iterative if-then analysis (read: "trial
+      and error") to obtain best subjective sound quality in pop/rock music
+      processing, but in applications processing different kind of sound the
+      default parameter set may result into a sub-optimal result.</p>
+    <p>The time-stretch algorithm default parameter values are set by the
+      following #defines in file "TDStretch.h":</p>
+    <blockquote>
+      <pre>#define DEFAULT_SEQUENCE_MS     AUTOMATIC<br>#define DEFAULT_SEEKWINDOW_MS   AUTOMATIC<br>#define DEFAULT_OVERLAP_MS      8</pre>
+    </blockquote>
+    <p>These parameters affect to the time-stretch algorithm as follows:</p>
+    <ul>
+      <li> <strong>DEFAULT_SEQUENCE_MS</strong>: This is the default
+        length of a single processing sequence in milliseconds which determines
+        the how the original sound is chopped in the time-stretch algorithm.
+        Larger values mean fewer sequences are used in processing. In principle
+        a larger value sounds better when slowing down the tempo, but worse
+        when increasing the tempo and vice versa.<br>
+        <br>
+        By default, this setting value is calculated automatically according to
+        tempo value.<br>
+      </li>
+      <li> <strong>DEFAULT_SEEKWINDOW_MS</strong>: The seeking window
+        default length in milliseconds is for the algorithm that seeks the best
+        possible overlapping location. This determines from how wide a sample
+        "window" the algorithm can use to find an optimal mixing location when
+        the sound sequences are to be linked back together.<br>
+        <br>
+        The bigger this window setting is, the higher the possibility to find a
+        better mixing position becomes, but at the same time large values may
+        cause a "drifting" sound artifact because neighboring sequences can be
+        chosen at more uneven intervals. If there's a disturbing artifact that
+        sounds as if a constant frequency was drifting around, try reducing
+        this setting.<br>
+        <br>
+        By default, this setting value is calculated automatically according to
+        tempo value.<br>
+      </li>
+      <li> <strong>DEFAULT_OVERLAP_MS</strong>: Overlap length in
+        milliseconds. When the sound sequences are mixed back together to form
+        again a continuous sound stream, this parameter defines how much the
+        ends of the consecutive sequences will overlap with each other.<br>
+        <br>
+        This shouldn't be that critical parameter. If you reduce the
+        DEFAULT_SEQUENCE_MS setting by a large amount, you might wish to try a
+        smaller value on this.
+      </li>
+    </ul>
+    <p>Notice that these parameters can also be set during execution time
+      with functions "<strong>TDStretch::setParameters()</strong>" and "<strong>SoundTouch::setSetting()</strong>".</p>
+    <p>The table below summaries how the parameters can be adjusted for
+      different applications:</p>
+    <table border="1">
+      <tbody>
+        <tr>
+          <td valign="top"><strong>Parameter name</strong></td>
+          <td valign="top"><strong>Default value magnitude</strong></td>
+          <td valign="top"><strong>Larger value affects...</strong></td>
+          <td valign="top"><strong>Smaller value affects...</strong></td>
+          <td valign="top"><strong>Effect to CPU burden</strong></td>
+        </tr>
+        <tr>
+          <td valign="top">
+            <pre>SEQUENCE_MS</pre>
+          </td>
+          <td valign="top">Default value is relatively large, chosen for
+            slowing down music tempo</td>
+          <td valign="top">Larger value is usually better for slowing down
+            tempo. Growing the value decelerates the "echoing" artifact when
+            slowing down the tempo.</td>
+          <td valign="top">Smaller value might be better for speeding up
+            tempo. Reducing the value accelerates the "echoing" artifact when
+            slowing down the tempo </td>
+          <td valign="top">Increasing the parameter value reduces
+            computation burden</td>
+        </tr>
+        <tr>
+          <td valign="top">
+            <pre>SEEKWINDOW_MS</pre>
+          </td>
+          <td valign="top">Default value is relatively large, chosen for
+            slowing down music tempo</td>
+          <td valign="top">Larger value eases finding a good mixing
+            position, but may cause a "drifting" artifact</td>
+          <td valign="top">Smaller reduce possibility to find a good mixing
+            position, but reduce the "drifting" artifact.</td>
+          <td valign="top">Increasing the parameter value increases
+            computation burden</td>
+        </tr>
+        <tr>
+          <td valign="top">
+            <pre>OVERLAP_MS</pre>
+          </td>
+          <td valign="top">Default value is relatively large, chosen to
+            suit with above parameters.</td>
+          <td valign="top"></td>
+          <td valign="top">If you reduce the "sequence ms" setting, you
+            might wish to try a smaller value.</td>
+          <td valign="top">Increasing the parameter value increases
+            computation burden</td>
+        </tr>
+      </tbody>
+    </table>
+    <h3>3.5 Performance Optimizations </h3>
+    <p><strong>Integer vs floating point:</strong></p>
+    <p>Floating point sample type is generally recommended because it provides
+      better sound quality.</p>
+
+    <p>However, execution speed difference between integer and floating point processing
+      depends on the CPU architecture. As rule of thumb,
+    <ul>
+      <li>in 32-bit x86 floating point and integer are roughly equally fast</li>
+      <li>in 64-bit x86/x64 floating point can be significantly faster than integer
+        version, because MMX integer optimizations are not available in the x64 architecture.
+        That depends on the compiler however, so that gcc can autovectorize integer routines
+        to work equally fast as floating point, where as Visual C++ (2017) does not
+        perform equally well and produces integer code that runs some 3x slower than
+        SSE-optimized floating poing code.
+      </li>
+      <li>in ARMv7 integer routines are twice as fast as floating point. Their
+        relative difference is roughly the same both with and without NEON; NEON
+        vfpu can however bring 2.4x speed improvement.
+      </li>
+      <li>in other platforms: try out if the execution time performance makes a
+        big difference</li>
+    </ul>
+    </p>
+    <p><strong>General optimizations:</strong></p>
+    <p>The time-stretch routine has a 'quick' mode that substantially
+      speeds up the algorithm but may slightly compromise the sound quality.
+      This mode is activated by calling SoundTouch::setSetting()
+      function with parameter id of SETTING_USE_QUICKSEEK and value
+      "1", i.e. </p>
+    <blockquote>
+      <p>setSetting(SETTING_USE_QUICKSEEK, 1);</p>
+    </blockquote>
+    <p><strong>CPU-specific optimizations:</strong></p>
+    <p>Intel x86 specific SIMD optimizations are implemented using compiler
+      intrinsics, providing about a 3x processing speedup for x86 compatible
+      processors vs. non-SIMD implementation:</p>
+    <ul>
+      <li> MMX optimized routines are used in 32-bit x86 build when 16bit integer
+        sample type is used</li>
+      <li> SSE optimized routines are used in 32- and 64-bit x86 CPUs when 32bit
+        floating point sample type is used</li>
+    </ul>
+    <p>The algorithms are tuned to utilize autovectorization efficiently
+      also in other CPU architectures, for example ARM cpus see approx 2.4x processing
+      speedup when NEON SIMD support is present.
+    </p>
+    <h3>3.5 OpenMP parallel computation</h3>
+    <p>SoundTouch 1.9 onwards support running the algorithms parallel in several CPU
+      cores. Based on benchmark the experienced multi-core processing speed-up gain
+      ranges between +30% (on a high-spec dual-core x86 Windows PC) to 215% (on a moderately low-spec
+      quad-core ARM of Raspberry Pi2). </p>
+    <p>See an external blog article with more detailed discussion about the
+      <a href="http://www.softwarecoven.com/parallel-computing-in-embedded-mobile-devices/">
+        SoundTouch OpenMP optimization</a>.
+    </p>
+    <p>The parallel computing support is implemented using OpenMP spec 3.0
+      instructions. These instructions are supported by Visual C++ 2008 and later, and
+      GCC v4.2 and later. Compilers that do not supporting OpenMP will ignore these
+      optimizations and routines will still work properly. Possible warnings about
+      unknown #pragmas are related to OpenMP support and can be safely ignored.</p>
+    <p>The OpenMP improvements are disabled by default, and need to be enabled by
+      developer during compile-time. Reason for this is that parallel processing adds
+      moderate runtime overhead in managing the multi-threading, so it may not be
+      necessary nor desirable in all applications. For example real-time processing
+      that is not constrained by CPU power will not benefit of speed-up provided by
+      the parallel processing, in the contrary it may increase power consumption due
+      to the increased overhead.</p>
+    <p>However, applications that run on low-spec multi-core CPUs and may otherwise
+      have possibly constrained performance will benefit of the OpenMP improvements.
+      This include for example multi-core embedded devices.</p>
+    <p>OpenMP parallel computation can be enabled before compiling SoundTouch
+      library as follows:</p>
+    <ul>
+      <li><strong>Visual Studio</strong>: Open properties for the <strong>SoundTouch
+        </strong>sub-project, browse to <strong>C/C++</strong> and <strong>Language
+        </strong>settings. Set
+        there &quot;<strong>OpenMP support</strong>&quot; to &quot;<strong>Yes</strong>&quot;. Alternatively add
+        <strong>/openmp</strong> switch to command-line
+        parameters
+      </li>
+      <li><strong>GNU</strong>: Run the configure script with &quot;<strong>./configure
+          --enable-openmp</strong>&quot; switch, then run make as usually</li>
+      <li><strong>Android</strong>: Add &quot;<strong>-fopenmp</strong>&quot; switches to compiler &amp; linker
+        options, see README-SoundTouch-Android.html in the source code package for
+        more detailed instructions.</li>
+    </ul>
+    <hr>
+    <h2><a name="SoundStretch"></a>4. SoundStretch audio processing utility
+    </h2>
+    <p>SoundStretch audio processing utility<br>
+      Copyright (c) Olli Parviainen 2002-2015</p>
+    <p>SoundStretch is a simple command-line application that can change
+      tempo, pitch and playback rates of WAV sound files. This program is
+      intended primarily to demonstrate how the "SoundTouch" library can be
+      used to process sound in your own program, but it can as well be used
+      for processing sound files.</p>
+    <h3>4.1. SoundStretch Usage Instructions</h3>
+    <p>SoundStretch Usage syntax:</p>
+    <blockquote>
+      <pre>soundstretch infilename outfilename [switches]</pre>
+    </blockquote>
+    <p>Where: </p>
+    <table width="100%" border="0" cellpadding="2">
+      <tbody>
+        <tr>
+          <td valign="top">
+            <pre>"infilename"</pre>
+          </td>
+          <td valign="top">Name of the input sound data file (in .WAV audio
+            file format). Give "stdin" as filename to use standard input pipe. </td>
+        </tr>
+        <tr>
+          <td valign="top">
+            <pre>"outfilename"</pre>
+          </td>
+          <td valign="top">Name of the output sound file where the
+            resulting sound is saved (in .WAV audio file format). This parameter
+            may be omitted if you don't want to save the output (e.g. when
+            only calculating BPM rate with '-bpm' switch). Give "stdout" as
+            filename to use standard output pipe.</td>
+        </tr>
+        <tr>
+          <td valign="top">
+            <pre>[switches]</pre>
+          </td>
+          <td valign="top">Are one or more control switches.</td>
+        </tr>
+      </tbody>
+    </table>
+    <p>Available control switches are:</p>
+    <table width="100%" border="0" cellpadding="2">
+      <tbody>
+        <tr>
+          <td valign="top">
+            <pre>-tempo=n </pre>
+          </td>
+          <td valign="top">Change the sound tempo by n percents (n = -95.0
+            .. +5000.0 %) </td>
+        </tr>
+        <tr>
+          <td valign="top">
+            <pre>-pitch=n</pre>
+          </td>
+          <td valign="top">Change the sound pitch by n semitones (n = -60.0
+            .. + 60.0 semitones) </td>
+        </tr>
+        <tr>
+          <td valign="top">
+            <pre>-rate=n</pre>
+          </td>
+          <td valign="top">Change the sound playback rate by n percents (n
+            = -95.0 .. +5000.0 %) </td>
+        </tr>
+        <tr>
+          <td valign="top">
+            <pre>-bpm=n</pre>
+          </td>
+          <td valign="top">Detect the Beats-Per-Minute (BPM) rate of the
+            sound and adjust the tempo to meet 'n' BPMs. When this switch is
+            applied, the "-tempo" switch is ignored. If "=n" is omitted, i.e.
+            switch "-bpm" is used alone, then the BPM rate is estimated and
+            displayed, but tempo not adjusted according to the BPM value. </td>
+        </tr>
+        <tr>
+          <td valign="top">
+            <pre>-quick</pre>
+          </td>
+          <td valign="top">Use quicker tempo change algorithm. Gains speed
+            but loses sound quality. </td>
+        </tr>
+        <tr>
+          <td valign="top">
+            <pre>-naa</pre>
+          </td>
+          <td valign="top">Don't use anti-alias filtering in sample rate
+            transposing. Gains speed but loses sound quality. </td>
+        </tr>
+        <tr>
+          <td valign="top">
+            <pre>-license</pre>
+          </td>
+          <td valign="top">Displays the program license text (LGPL)</td>
+        </tr>
+      </tbody>
+    </table>
+    <p>Notes:</p>
+    <ul>
+      <li> To use standard input/output pipes for processing, give "stdin"
+        and "stdout" as input/output filenames correspondingly. The standard
+        input/output pipes will still carry the audio data in .wav audio file
+        format.</li>
+      <li> The numerical switches allow both integer (e.g. "-tempo=123")
+        and decimal (e.g. "-tempo=123.45") numbers.</li>
+      <li> The "-naa" and/or "-quick" switches can be used to reduce CPU
+        usage while compromising some sound quality</li>
+      <li> The BPM detection algorithm works by detecting repeating bass or
+        drum patterns at low frequencies of &lt;250Hz. A lower-than-expected
+        BPM figure may be reported for music with uneven or complex bass
+        patterns.</li>
+    </ul>
+    <h3>4.2. SoundStretch usage examples </h3>
+    <p><strong>Example 1</strong></p>
+    <p>The following command increases tempo of the sound file
+      "originalfile.wav" by 12.5% and stores result to file
+      "destinationfile.wav":</p>
+    <blockquote>
+      <pre>soundstretch originalfile.wav destinationfile.wav -tempo=12.5</pre>
+    </blockquote>
+    <p><strong>Example 2</strong></p>
+    <p>The following command decreases the sound pitch (key) of the sound
+      file "orig.wav" by two semitones and stores the result to file
+      "dest.wav":</p>
+    <blockquote>
+      <pre>soundstretch orig.wav dest.wav -pitch=-2</pre>
+    </blockquote>
+    <p><strong>Example 3</strong></p>
+    <p>The following command processes the file "orig.wav" by decreasing
+      the sound tempo by 25.3% and increasing the sound pitch (key) by 1.5
+      semitones. Resulting .wav audio data is directed to standard output
+      pipe:</p>
+    <blockquote>
+      <pre>soundstretch orig.wav stdout -tempo=-25.3 -pitch=1.5</pre>
+    </blockquote>
+    <p><strong>Example 4</strong></p>
+    <p>The following command detects the BPM rate of the file "orig.wav"
+      and adjusts the tempo to match 100 beats per minute. Result is stored
+      to file "dest.wav":</p>
+    <blockquote>
+      <pre>soundstretch orig.wav dest.wav -bpm=100</pre>
+    </blockquote>
+    <p><strong>Example 5</strong></p>
+    <p>The following command reads .wav sound data from standard input pipe
+      and estimates the BPM rate:</p>
+    <blockquote>
+      <pre>soundstretch stdin -bpm</pre>
+    </blockquote>
+    <p><strong>Example 6</strong></p>
+    <p>The following command tunes song from original 440Hz tuning to 432Hz tuning:
+      this corresponds to lowering the pitch by -0.318 semitones:</p>
+    <blockquote>
+      <pre>soundstretch original.wav output.wav -pitch=-0.318</pre>
+    </blockquote>
+    <hr>
+    <h2>5. Change History</h2>
+    <h3>5.1. SoundTouch library Change History </h3>
+    <p><b>2.3.1:</b></p>
+    <ul>
+      <li>Adjusted cmake build settings and header files that cmake installs</li>
+    </ul>
+    <p><b>2.3.0:</b></p>
+    <ul>
+      <li>Disable setting "SOUNDTOUCH_ALLOW_NONEXACT_SIMD_OPTIMIZATION" by default. The original
+        purpose of this setting was to avoid performance penalty due to unaligned SIMD memory
+        accesses in old CPUs, but that is not any more issue in concurrent CPU SIMD implementations
+        and having this setting enabled can cause slight compromise in result quality.
+      </li>
+      <li>Bugfix: soundtouch.clear() to really clear whole processing pipeline state. Earlier
+        individual variables were left uncleared, which caused slightly different result if
+        the same audio stream were processed again after calling clear().
+      </li>
+      <li>Bugfix: TDstretch to align initial offset position to be in middle of correlation search
+        window. This ensures that with zero tempo change the output will be same as input.
+      </li>
+      <li>Bugfix: Fix a bug in TDstrectch with too small initial skipFract value that occurred
+        with certain processing parameter settings: Replace assert with assignment that 
+        corrects the situation.
+      </li>
+      <li>Remove OpenMP "_init_threading" workaround from Android build as it's not needed with concurrent
+        Android SDKs any more.</li>
+    </ul>
+    <p><b>2.2:</b></p>
+    <ul>
+      <li>Improved source codes so that compiler can autovectorize them more effectively.
+        This brings remarkable improvement e.g. ARM cpus equipped with NEON vfpu: Bencmarked
+        2.4x improvement in execution speed in ARMv7l vs the previous SoundTouch version
+        for both integer and floating point sample types.
+      </li>
+      <li>Bugfix: Resolved bad sound quality when using integer sample types in non-x86 CPU</li>
+      <li>Bugfix: Fixed possible reading past end of array in BPM peak detection algorithm</li>
+    </ul>
     <p><b>2.1.2:</b></p>
     <ul>
       <li>Bump version to 2.1.2 also in configure.ac. The earlier release had old version info for GNU autotools.</li>
     </ul>
     <p><b>2.1.1:</b></p>
     <ul>
-      <li>Bugfixes: Fixed potential buffer overwrite bugs in WavFile routines. Replaced asserts with runtime exceptions.</li>
+      <li>Bugfixes: Fixed potential buffer overwrite bugs in WavFile routines. Replaced asserts with runtime exceptions.
+      </li>
       <li>Android: Migrated the SoundTouch Android example to new Android Studio</li>
       <li>Automake: unset ACLOCAL in bootstrap script in case earlier build script has set it</li>
 
@@ -589,10 +653,10 @@ this corresponds to lowering the pitch by -0.318 semitones:</p>
     <p><b>2.1:</b></p>
     <ul>
       <li>Refactored C# interface example</li>
-      <li>Disable anti-alias filter when switch 
-      SOUNDTOUCH_PREVENT_CLICK_AT_RATE_CROSSOVER defined because anti-alias 
-      filter cause slight click if the rate change crosses zero during 
-      processing</li>
+      <li>Disable anti-alias filter when switch
+        SOUNDTOUCH_PREVENT_CLICK_AT_RATE_CROSSOVER defined because anti-alias
+        filter cause slight click if the rate change crosses zero during
+        processing</li>
       <li>Added script for building SoundTouchDll dynamic-link-library for GNU platforms</li>
       <li>Rewrote Beats-per-Minute analysis algorithm for more reliable BPM detection</li>
       <li>Added BPM functions to SoundTouchDll API</li>
@@ -602,326 +666,334 @@ this corresponds to lowering the pitch by -0.318 semitones:</p>
     </ul>
     <p><b>2.0:</b></p>
     <ul>
-        <li>Added functions to get initial processing latency, duration ratio between the original input and processed output tracks, and clarified reporting of input/output batch sizes</li>
-        <li>Fixed issue that added brief sequence of silence to beginning of output audio</li>
-        <li>Adjusted algorithm parameters to reduce reverberating effect at tempo slowdown</li>
-        <li>Bugfix: Fixed a glitch that could cause negative array indexing in quick seek algorithm</li>
-        <li>Bugfix: flush() didn't properly flush final samples from the pipeline on 2nd time in case that soundtouch object instance was recycled and used for processing a second audio stream.</li>
-        <li>Bugfix: Pi value had incorrect 9th/10th decimals</li>
-        <li>Added C# example application that uses SoundTouch dll library for processing MP3 files</li>
+      <li>Added functions to get initial processing latency, duration ratio between the original input and processed
+        output tracks, and clarified reporting of input/output batch sizes</li>
+      <li>Fixed issue that added brief sequence of silence to beginning of output audio</li>
+      <li>Adjusted algorithm parameters to reduce reverberating effect at tempo slowdown</li>
+      <li>Bugfix: Fixed a glitch that could cause negative array indexing in quick seek algorithm</li>
+      <li>Bugfix: flush() didn't properly flush final samples from the pipeline on 2nd time in case that soundtouch
+        object instance was recycled and used for processing a second audio stream.</li>
+      <li>Bugfix: Pi value had incorrect 9th/10th decimals</li>
+      <li>Added C# example application that uses SoundTouch dll library for processing MP3 files</li>
     </ul>
     <p><b>1.9.2:</b></p>
     <ul>
-        <li>Fix in GNU package configuration</li>
+      <li>Fix in GNU package configuration</li>
     </ul>
     <p><b>1.9.1:</b></p>
     <ul>
-        <li>Improved SoundTouch::flush() function so that it returns precisely the desired amount of samples for exact output duration control</li>
-        <li>Redesigned quickseek algorithm for improved sound quality when using the quickseek mode. The new quickseek algorithm can find 99% as good results as the 
-        default full-scan mode, while the quickseek algorithm is remarkable less 
+      <li>Improved SoundTouch::flush() function so that it returns precisely the desired amount of samples for exact
+        output duration control</li>
+      <li>Redesigned quickseek algorithm for improved sound quality when using the quickseek mode. The new quickseek
+        algorithm can find 99% as good results as the
+        default full-scan mode, while the quickseek algorithm is remarkable less
         CPU intensive.</li>
-        <li>Added adaptive integer divider scaling for improved sound quality when using integer processing algorithm
-        </li>
+      <li>Added adaptive integer divider scaling for improved sound quality when using integer processing algorithm
+      </li>
     </ul>
-<p><b>1.9:</b></p>
-<ul>
-    <li>Added support for parallel computation support via OpenMP primitives for better performance in multicore systems. 
-        Benchmarks show that achieved parallel processing speedup improvement 
-    typically range from +30% (x86 dual-core) to +180% (ARM quad-core). The 
-    OpenMP optimizations are disabled by default, see OpenMP notes above in this 
-    readme file how to enabled these optimizations.</li>
-    <li>Android: Added support for Android devices featuring X86 and MIPS CPUs, 
-    in addition to ARM CPUs.</li>
-    <li>Android: More versatile Android example application that processes WAV 
-    audio files with SoundTouch library</li>
-    <li>Replaced Windows-like 'BOOL' types with native 'bool'</li>
-    <li>Changed documentation token to "dist_doc_DATA" in Makefile.am file</li>
-    <li>Miscellaneous small fixes and improvements</li>
-</ul>
-<p><b>1.8.0:</b></p>
-<ul>
-    <li>Added support for multi-channel audio processing</li>
-    <li>Added support for <b>cubic</b> and <b>shannon</b> interpolation for rate and pitch shift effects besides 
-        the original <b>linear</b> interpolation, to reduce aliasing at high frequencies due to interpolation.
-        Cubic interpolation is used as default for floating point processing, and linear interpolation for integer 
-        processing.</li>
-    <li>Fixed bug in anti-alias filtering that limited stop-band attenuation to -10 dB instead of <-50dB, and
-        increased filter length from 32 to 64 taps to further reduce aliasing due to frequency folding.</li>
-    <li>Performance improvements in cross-correlation algorithm</li>
-    <li>Other bug and compatibility fixes</li>
-</ul>
-<p><b>1.7.1:</b></p>
-<ul>
-    <li>Added files for Android compilation
-</ul>
-<p><b>1.7.0:</b></p>
-<ul>
-    <li>Sound quality improvements/li>
-    <li>Improved flush() to adjust output sound stream duration to match better with 
-        ideal duration</li>
-    <li>Rewrote x86 cpu feature check to resolve compatibility problems</li>
-    <li>Configure script automatically checks if CPU supports mmx & sse compatibility for GNU platform, and
-    the script support now "--enable-x86-optimizations" switch to allow disabling x86-specific optimizations.</li>
-    <li>Revised #define conditions for 32bit/64bit compatibility</li>
-    <li>gnu autoconf/automake script compatibility fixes</li>
-    <li>Tuned beat-per-minute detection algorithm</li>
-</ul>
-<p><b>1.6.0:</b></p>
-<ul>
-  <li> Added automatic cutoff threshold adaptation to beat detection
-routine to better adapt BPM calculation to different types of music</li>
-  <li> Retired 3DNow! optimization support as 3DNow! is nowadays
-obsoleted and assembler code is nuisance to maintain</li>
-  <li>Retired "configure" file from source code package due to
-autoconf/automake versio conflicts, so that it is from now on to be
-generated by invoking "boostrap" script that uses locally available
-toolchain version for generating the "configure" file</li>
-  <li>Resolved namespace/label naming conflicts with other libraries by
-replacing global labels such as INTEGER_SAMPLES with more specific
-SOUNDTOUCH_INTEGER_SAMPLES etc.<br>
- </li>
-  <li>Updated windows build scripts &amp; project files for Visual
-Studio 2008 support</li>
-  <li> Updated SoundTouch.dll API for .NET compatibility</li>
-  <li> Added API for querying nominal processing input &amp; output
-sample batch sizes</li>
-</ul>
-<p><strong>1.5.0:</strong></p>
-<ul>
-  <li> Added normalization to correlation calculation and improvement
-automatic seek/sequence parameter calculation to improve sound quality</li>
-  <li> Bugfixes:
+    <p><b>1.9:</b></p>
     <ul>
-      <li> Fixed negative array indexing in quick seek algorithm</li>
-      <li> FIR autoalias filter running too far in processing buffer</li>
-      <li> Check against zero sample count in rate transposing</li>
-      <li> Fix for x86-64 support: Removed pop/push instructions from
-the cpu detection algorithm.</li>
-      <li> Check against empty buffers in FIFOSampleBuffer</li>
-      <li> Other minor fixes &amp; code cleanup</li>
+      <li>Added support for parallel computation support via OpenMP primitives for better performance in multicore
+        systems.
+        Benchmarks show that achieved parallel processing speedup improvement
+        typically range from +30% (x86 dual-core) to +180% (ARM quad-core). The
+        OpenMP optimizations are disabled by default, see OpenMP notes above in this
+        readme file how to enabled these optimizations.</li>
+      <li>Android: Added support for Android devices featuring X86 and MIPS CPUs,
+        in addition to ARM CPUs.</li>
+      <li>Android: More versatile Android example application that processes WAV
+        audio files with SoundTouch library</li>
+      <li>Replaced Windows-like 'BOOL' types with native 'bool'</li>
+      <li>Changed documentation token to "dist_doc_DATA" in Makefile.am file</li>
+      <li>Miscellaneous small fixes and improvements</li>
+    </ul>
+    <p><b>1.8.0:</b></p>
+    <ul>
+      <li>Added support for multi-channel audio processing</li>
+      <li>Added support for <b>cubic</b> and <b>shannon</b> interpolation for rate and pitch shift effects besides
+        the original <b>linear</b> interpolation, to reduce aliasing at high frequencies due to interpolation.
+        Cubic interpolation is used as default for floating point processing, and linear interpolation for integer
+        processing.</li>
+      <li>Fixed bug in anti-alias filtering that limited stop-band attenuation to -10 dB instead of <-50dB, and
+          increased filter length from 32 to 64 taps to further reduce aliasing due to frequency folding.</li>
+      <li>Performance improvements in cross-correlation algorithm</li>
+      <li>Other bug and compatibility fixes</li>
+    </ul>
+    <p><b>1.7.1:</b></p>
+    <ul>
+      <li>Added files for Android compilation
+    </ul>
+    <p><b>1.7.0:</b></p>
+    <ul>
+      <li>Sound quality improvements/li>
+      <li>Improved flush() to adjust output sound stream duration to match better with
+        ideal duration</li>
+      <li>Rewrote x86 cpu feature check to resolve compatibility problems</li>
+      <li>Configure script automatically checks if CPU supports mmx & sse compatibility for GNU platform, and
+        the script support now "--enable-x86-optimizations" switch to allow disabling x86-specific optimizations.</li>
+      <li>Revised #define conditions for 32bit/64bit compatibility</li>
+      <li>gnu autoconf/automake script compatibility fixes</li>
+      <li>Tuned beat-per-minute detection algorithm</li>
+    </ul>
+    <p><b>1.6.0:</b></p>
+    <ul>
+      <li> Added automatic cutoff threshold adaptation to beat detection
+        routine to better adapt BPM calculation to different types of music</li>
+      <li> Retired 3DNow! optimization support as 3DNow! is nowadays
+        obsoleted and assembler code is nuisance to maintain</li>
+      <li>Retired "configure" file from source code package due to
+        autoconf/automake versio conflicts, so that it is from now on to be
+        generated by invoking "boostrap" script that uses locally available
+        toolchain version for generating the "configure" file</li>
+      <li>Resolved namespace/label naming conflicts with other libraries by
+        replacing global labels such as INTEGER_SAMPLES with more specific
+        SOUNDTOUCH_INTEGER_SAMPLES etc.<br>
+      </li>
+      <li>Updated windows build scripts &amp; project files for Visual
+        Studio 2008 support</li>
+      <li> Updated SoundTouch.dll API for .NET compatibility</li>
+      <li> Added API for querying nominal processing input &amp; output
+        sample batch sizes</li>
+    </ul>
+    <p><strong>1.5.0:</strong></p>
+    <ul>
+      <li> Added normalization to correlation calculation and improvement
+        automatic seek/sequence parameter calculation to improve sound quality</li>
+      <li> Bugfixes:
+        <ul>
+          <li> Fixed negative array indexing in quick seek algorithm</li>
+          <li> FIR autoalias filter running too far in processing buffer</li>
+          <li> Check against zero sample count in rate transposing</li>
+          <li> Fix for x86-64 support: Removed pop/push instructions from
+            the cpu detection algorithm.</li>
+          <li> Check against empty buffers in FIFOSampleBuffer</li>
+          <li> Other minor fixes &amp; code cleanup</li>
+        </ul>
+      </li>
+      <li> Fixes in compilation scripts for non-Intel platforms</li>
+      <li> Added Dynamic-Link-Library (DLL) version of SoundTouch library
+        build, provided with Delphi/Pascal wrapper for calling the dll routines
+      </li>
+      <li> Added #define PREVENT_CLICK_AT_RATE_CROSSOVER that prevents a
+        click artifact when crossing the nominal pitch from either positive to
+        negative side or vice versa</li>
+    </ul>
+    <p><strong>1.4.1:</strong></p>
+    <ul>
+      <li> Fixed a buffer overflow bug in BPM detect algorithm routines if
+        processing more than 2048 samples at one call</li>
+    </ul>
+    <p><strong>1.4.0:</strong></p>
+    <ul>
+      <li> Improved sound quality by automatic calculation of time stretch
+        algorithm processing parameters according to tempo setting</li>
+      <li> Moved BPM detection routines from SoundStretch application into
+        SoundTouch library</li>
+      <li> Bugfixes: Usage of uninitialied variables, GNU build scripts,
+        compiler errors due to 'const' keyword mismatch.</li>
+      <li> Source code cleanup</li>
+    </ul>
+    <p><strong>1.3.1: </strong> </p>
+    <ul>
+      <li> Changed static class declaration to GCC 4.x compiler compatible
+        syntax.</li>
+      <li> Enabled MMX/SSE-optimized routines also for GCC compilers.
+        Earlier the MMX/SSE-optimized routines were written in
+        compiler-specific inline assembler, now these routines are migrated to
+        use compiler intrinsic syntax which allows compiling the same
+        MMX/SSE-optimized source code with both Visual C++ and GCC compilers.</li>
+      <li> Set floating point as the default sample format and added switch
+        to the GNU configure script for selecting the other sample format.</li>
+    </ul>
+    <p><strong>1.3.0: </strong> </p>
+    <ul>
+      <li> Fixed tempo routine output duration inaccuracy due to rounding
+        error</li>
+      <li> Implemented separate processing routines for integer and
+        floating arithmetic to allow improvements to floating point routines
+        (earlier used algorithms mostly optimized for integer arithmetic also
+        for floating point samples)</li>
+      <li> Fixed a bug that distorts sound if sample rate changes during
+        the sound stream</li>
+      <li> Fixed a memory leak that appeared in MMX/SSE/3DNow! optimized
+        routines</li>
+      <li> Reduced redundant code pieces in MMX/SSE/3DNow! optimized
+        routines vs. the standard C routines.</li>
+      <li> MMX routine incompatibility with new gcc compiler versions</li>
+      <li> Other miscellaneous bug fixes</li>
+    </ul>
+    <p><strong>1.2.1: </strong> </p>
+    <ul>
+      <li> Added automake/autoconf scripts for GNU platforms (in courtesy
+        of David Durham)</li>
+      <li> Fixed SCALE overflow bug in rate transposer routine.</li>
+      <li> Fixed 64bit address space bugs.</li>
+      <li> Created a 'soundtouch' namespace for SAMPLETYPE definitions.</li>
+    </ul>
+    <p><strong>1.2.0: </strong> </p>
+    <ul>
+      <li> Added support for 32bit floating point sample data type with
+        SSE/3DNow! optimizations for Win32 platform (SSE/3DNow! optimizations
+        currently not supported in GCC environment)</li>
+      <li> Replaced 'make-gcc' script for GNU environment by master
+        Makefile</li>
+      <li> Added time-stretch routine configurability to SoundTouch main
+        class</li>
+      <li> Bugfixes</li>
+    </ul>
+    <p><strong>1.1.1: </strong> </p>
+    <ul>
+      <li> Moved SoundTouch under lesser GPL license (LGPL). This allows
+        using SoundTouch library in programs that aren't released under GPL
+        license.</li>
+      <li> Changed MMX routine organiation so that MMX optimized routines
+        are now implemented in classes that are derived from the basic classes
+        having the standard non-mmx routines.</li>
+      <li> MMX routines to support gcc version 3.</li>
+      <li> Replaced windows makefiles by script using the .dsw files</li>
+    </ul>
+    <p><strong>1.0.1: </strong> </p>
+    <ul>
+      <li> "mmx_gcc.cpp": Added "using namespace std" and removed "return
+        0" from a function with void return value to fix compiler errors when
+        compiling the library in Solaris environment.</li>
+      <li> Moved file "FIFOSampleBuffer.h" to "include" directory to allow
+        accessing the FIFOSampleBuffer class from external files.</li>
+    </ul>
+    <p><strong>1.0: </strong> </p>
+    <ul>
+      <li> Initial release</li>
+    </ul>
+    <h3>5.2. SoundStretch application Change History </h3>
+    <p><b>1.9:</b></p>
+    <ul>
+      <li>Added support for WAV file 'fact' information chunk.</li>
     </ul>
- </li>
-  <li> Fixes in compilation scripts for non-Intel platforms</li>
-  <li> Added Dynamic-Link-Library (DLL) version of SoundTouch library
-build, provided with Delphi/Pascal wrapper for calling the dll routines
- </li>
-  <li> Added #define PREVENT_CLICK_AT_RATE_CROSSOVER that prevents a
-click artifact when crossing the nominal pitch from either positive to
-negative side or vice versa</li>
-</ul>
-<p><strong>1.4.1:</strong></p>
-<ul>
-  <li> Fixed a buffer overflow bug in BPM detect algorithm routines if
-processing more than 2048 samples at one call</li>
-</ul>
-<p><strong>1.4.0:</strong></p>
-<ul>
-  <li> Improved sound quality by automatic calculation of time stretch
-algorithm processing parameters according to tempo setting</li>
-  <li> Moved BPM detection routines from SoundStretch application into
-SoundTouch library</li>
-  <li> Bugfixes: Usage of uninitialied variables, GNU build scripts,
-compiler errors due to 'const' keyword mismatch.</li>
-  <li> Source code cleanup</li>
-</ul>
-<p><strong>1.3.1: </strong> </p>
-<ul>
-  <li> Changed static class declaration to GCC 4.x compiler compatible
-syntax.</li>
-  <li> Enabled MMX/SSE-optimized routines also for GCC compilers.
-Earlier the MMX/SSE-optimized routines were written in
-compiler-specific inline assembler, now these routines are migrated to
-use compiler intrinsic syntax which allows compiling the same
-MMX/SSE-optimized source code with both Visual C++ and GCC compilers.</li>
-  <li> Set floating point as the default sample format and added switch
-to the GNU configure script for selecting the other sample format.</li>
-</ul>
-<p><strong>1.3.0: </strong> </p>
-<ul>
-  <li> Fixed tempo routine output duration inaccuracy due to rounding
-error</li>
-  <li> Implemented separate processing routines for integer and
-floating arithmetic to allow improvements to floating point routines
-(earlier used algorithms mostly optimized for integer arithmetic also
-for floating point samples)</li>
-  <li> Fixed a bug that distorts sound if sample rate changes during
-the sound stream</li>
-  <li> Fixed a memory leak that appeared in MMX/SSE/3DNow! optimized
-routines</li>
-  <li> Reduced redundant code pieces in MMX/SSE/3DNow! optimized
-routines vs. the standard C routines.</li>
-  <li> MMX routine incompatibility with new gcc compiler versions</li>
-  <li> Other miscellaneous bug fixes</li>
-</ul>
-<p><strong>1.2.1: </strong> </p>
-<ul>
-  <li> Added automake/autoconf scripts for GNU platforms (in courtesy
-of David Durham)</li>
-  <li> Fixed SCALE overflow bug in rate transposer routine.</li>
-  <li> Fixed 64bit address space bugs.</li>
-  <li> Created a 'soundtouch' namespace for SAMPLETYPE definitions.</li>
-</ul>
-<p><strong>1.2.0: </strong> </p>
-<ul>
-  <li> Added support for 32bit floating point sample data type with
-SSE/3DNow! optimizations for Win32 platform (SSE/3DNow! optimizations
-currently not supported in GCC environment)</li>
-  <li> Replaced 'make-gcc' script for GNU environment by master
-Makefile</li>
-  <li> Added time-stretch routine configurability to SoundTouch main
-class</li>
-  <li> Bugfixes</li>
-</ul>
-<p><strong>1.1.1: </strong> </p>
-<ul>
-  <li> Moved SoundTouch under lesser GPL license (LGPL). This allows
-using SoundTouch library in programs that aren't released under GPL
-license.</li>
-  <li> Changed MMX routine organiation so that MMX optimized routines
-are now implemented in classes that are derived from the basic classes
-having the standard non-mmx routines.</li>
-  <li> MMX routines to support gcc version 3.</li>
-  <li> Replaced windows makefiles by script using the .dsw files</li>
-</ul>
-<p><strong>1.0.1: </strong> </p>
-<ul>
-  <li> "mmx_gcc.cpp": Added "using namespace std" and removed "return
-0" from a function with void return value to fix compiler errors when
-compiling the library in Solaris environment.</li>
-  <li> Moved file "FIFOSampleBuffer.h" to "include" directory to allow
-accessing the FIFOSampleBuffer class from external files.</li>
-</ul>
-<p><strong>1.0: </strong> </p>
-<ul>
-  <li> Initial release</li>
-</ul>
-<h3>5.2. SoundStretch application Change History </h3>
-<p><b>1.9:</b></p>
-<ul>
-    <li>Added support for WAV file 'fact' information chunk.</li>
-</ul>
 
-<p><b>1.7.0:</b></p>
-<ul>
-    <li>Bugfixes in Wavfile: exception string formatting, avoid getLengthMs() integer 
+    <p><b>1.7.0:</b></p>
+    <ul>
+      <li>Bugfixes in Wavfile: exception string formatting, avoid getLengthMs() integer
         precision overflow, support WAV files using 24/32bit sample format.</li>
-</ul>
+    </ul>
     <p><b>1.5.0:</b></p>
-<ul>
-  <li> Added "-speech" switch to activate algorithm parameters more
-suitable for speech processing than the default parameters tuned for
-music processing.</li>
-</ul>
-<p><strong>1.4.0:</strong></p>
-<ul>
-  <li> Moved BPM detection routines from SoundStretch application into
-SoundTouch library</li>
-  <li> Allow using standard input/output pipes as audio processing
-input/output streams</li>
-</ul>
-<p><strong>1.3.0:</strong></p>
-<ul>
-  <li> Simplified accessing WAV files with floating point sample
-format.</li>
-</ul>
-<p><strong>1.2.1: </strong> </p>
-<ul>
-  <li> Fixed 64bit address space bugs.</li>
-</ul>
-<p><strong>1.2.0: </strong> </p>
-<ul>
-  <li> Added support for 32bit floating point sample data type</li>
-  <li> Restructured the BPM routines into separate library</li>
-  <li> Fixed big-endian conversion bugs in WAV file routines (hopefully
-:)</li>
-</ul>
-<p><strong>1.1.1: </strong> </p>
-<ul>
-  <li> Fixed bugs in WAV file reading &amp; added byte-order conversion
-for big-endian processors.</li>
-  <li> Moved SoundStretch source code under 'example' directory to
-highlight difference from SoundTouch stuff.</li>
-  <li> Replaced windows makefiles by script using the .dsw files</li>
-  <li> Output file name isn't required if output isn't desired (e.g. if
-using the switch '-bpm' in plain format only)</li>
-</ul>
-<p><strong>1.1:</strong></p>
-<ul>
-  <li> Fixed "Release" settings in Microsoft Visual C++ project file
-(.dsp)</li>
-  <li> Added beats-per-minute (BPM) detection routine and command-line
-switch "-bpm"</li>
-</ul>
-<p><strong>1.01: </strong> </p>
-<ul>
-  <li> Initial release</li>
-</ul>
-<hr>
-<h2>6. Acknowledgements </h2>
-<p>Kudos for these people who have contributed to development or
-submitted bugfixes:</p>
-<ul>
-  <li> Arthur A</li>
-  <li> Paul Adenot</li>
-  <li> Richard Ash</li>
-  <li> Stanislav Brabec</li>
-  <li> Christian Budde</li>
-  <li> Jamie Bullock</li>
-  <li> Chris Bryan</li>  
-  <li> Jacek Caban</li>
-  <li> Brian Cameron</li>
-  <li> Jason Champion</li>
-  <li> David Clark</li>
-  <li> Patrick Colis</li>
-  <li> Miquel Colon</li>
-  <li> Jim Credland</li>
-  <li> Sandro Cumerlato</li>
-  <li> Gerry Fan</li>
-  <li> Justin Frankel</li>
-  <li> Masa H.</li>
-  <li> Jason Garland</li>
-  <li> Takashi Iwai</li>
-  <li> Thomas Klausner</li>
-  <li> Lu Zhihe</li>
-	<li> Luzpaz</li>
-  <li> Tony Mechelynck </li>
-  <li> Mathias M&ouml;hl</li>
-  <li> Yuval Naveh</li>
-  <li> Mats Palmgren </li>
-  <li> Chandni Patel</li>
-  <li> Paulo Pizarro</li>
-  <li> Andrey Ponomarenko</li>
-  <li> Blaise Potard</li>
-  <li> Michael Pruett</li>
-  <li> Rajeev Puran</li>
-  <li> RJ Ryan</li>
-  <li> John Sheehy</li>
-  <li> Tim Shuttleworth</li>
-  <li> Albert Sirvent</li>
-  <li> Tyson Smith</li>
-  <li> John Stumpo</li>
-  <li> Mario di Vece</li>
-  <li> Katja Vetter</li>
-  <li> Wu Q.</li>
-</ul>
-<p>Moral greetings to all other contributors and users also!</p>
-<hr>
-<h2>7. LICENSE </h2>
-<p>SoundTouch audio processing library<br>
-Copyright (c) Olli Parviainen</p>
-<p>This library is free software; you can redistribute it and/or modify
-it under the terms of the GNU Lesser General Public License version 2.1
-as published by the Free Software Foundation.</p>
-<p>This library is distributed in the hope that it will be useful, but
-WITHOUT ANY WARRANTY; without even the implied warranty of
-MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser
-General Public License for more details.</p>
-<p>You should have received a copy of the GNU Lesser General Public
-License along with this library; if not, write to the Free Software
-Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA</p>
-<p>---</p>
-<p>commercial license alternative also available, contact author for details.</p>
-<hr>
-<p><i>README.html file updated in November-2018</i></p>
+    <ul>
+      <li> Added "-speech" switch to activate algorithm parameters more
+        suitable for speech processing than the default parameters tuned for
+        music processing.</li>
+    </ul>
+    <p><strong>1.4.0:</strong></p>
+    <ul>
+      <li> Moved BPM detection routines from SoundStretch application into
+        SoundTouch library</li>
+      <li> Allow using standard input/output pipes as audio processing
+        input/output streams</li>
+    </ul>
+    <p><strong>1.3.0:</strong></p>
+    <ul>
+      <li> Simplified accessing WAV files with floating point sample
+        format.</li>
+    </ul>
+    <p><strong>1.2.1: </strong> </p>
+    <ul>
+      <li> Fixed 64bit address space bugs.</li>
+    </ul>
+    <p><strong>1.2.0: </strong> </p>
+    <ul>
+      <li> Added support for 32bit floating point sample data type</li>
+      <li> Restructured the BPM routines into separate library</li>
+      <li> Fixed big-endian conversion bugs in WAV file routines (hopefully
+        :)</li>
+    </ul>
+    <p><strong>1.1.1: </strong> </p>
+    <ul>
+      <li> Fixed bugs in WAV file reading &amp; added byte-order conversion
+        for big-endian processors.</li>
+      <li> Moved SoundStretch source code under 'example' directory to
+        highlight difference from SoundTouch stuff.</li>
+      <li> Replaced windows makefiles by script using the .dsw files</li>
+      <li> Output file name isn't required if output isn't desired (e.g. if
+        using the switch '-bpm' in plain format only)</li>
+    </ul>
+    <p><strong>1.1:</strong></p>
+    <ul>
+      <li> Fixed "Release" settings in Microsoft Visual C++ project file
+        (.dsp)</li>
+      <li> Added beats-per-minute (BPM) detection routine and command-line
+        switch "-bpm"</li>
+    </ul>
+    <p><strong>1.01: </strong> </p>
+    <ul>
+      <li> Initial release</li>
+    </ul>
+    <hr>
+    <h2>6. Acknowledgements </h2>
+    <p>Kudos for these people who have contributed to development or
+      submitted bugfixes:</p>
+    <ul>
+      <li> Arthur A</li>
+      <li> Paul Adenot</li>
+      <li> Richard Ash</li>
+      <li> Stanislav Brabec</li>
+      <li> Christian Budde</li>
+      <li> Jamie Bullock</li>
+      <li> Chris Bryan</li>
+      <li> Jacek Caban</li>
+      <li> Marketa Calabkova</li>
+      <li> Brian Cameron</li>
+      <li> Jason Champion</li>
+      <li> Giuseppe Cigala</li>
+      <li> David Clark</li>
+      <li> Patrick Colis</li>
+      <li> Miquel Colon</li>
+      <li> Jim Credland</li>
+      <li> Sandro Cumerlato</li>
+      <li> Gerry Fan</li>
+      <li> Justin Frankel</li>
+      <li> Masa H.</li>
+      <li> Jason Garland</li>
+      <li> Takashi Iwai</li>
+      <li> Thomas Klausner</li>
+      <li> Lu Zhihe</li>
+      <li> Luzpaz</li>
+      <li> Tony Mechelynck </li>
+      <li> Mathias M&ouml;hl</li>
+      <li> Yuval Naveh</li>
+      <li> Mats Palmgren </li>
+      <li> Chandni Patel</li>
+      <li> Paulo Pizarro</li>
+      <li> Andrey Ponomarenko</li>
+      <li> Blaise Potard</li>
+      <li> Michael Pruett</li>
+      <li> Rajeev Puran</li>
+      <li> RJ Ryan</li>
+      <li> John Sheehy</li>
+      <li> Tim Shuttleworth</li>
+      <li> Albert Sirvent</li>
+      <li> Tyson Smith</li>
+      <li> John Stumpo</li>
+      <li> Mario di Vece</li>
+      <li> Rémi Verschelde</li>
+      <li> Katja Vetter</li>
+      <li> Wu Q.</li>
+    </ul>
+    <p>Moral greetings to all other contributors and users also!</p>
+    <hr>
+    <h2>7. LICENSE </h2>
+    <p>SoundTouch audio processing library<br>
+      Copyright (c) Olli Parviainen</p>
+    <p>This library is free software; you can redistribute it and/or modify
+      it under the terms of the GNU Lesser General Public License version 2.1
+      as published by the Free Software Foundation.</p>
+    <p>This library is distributed in the hope that it will be useful, but
+      WITHOUT ANY WARRANTY; without even the implied warranty of
+      MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser
+      General Public License for more details.</p>
+    <p>You should have received a copy of the GNU Lesser General Public
+      License along with this library; if not, write to the Free Software
+      Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA</p>
+    <p>---</p>
+    <p>commercial license alternative also available, contact author for details.</p>
+    <hr>
 </body>
-</html>
+
+</html>
\ No newline at end of file
diff --git a/3rdparty/soundtouch/soundtouch/FIFOSampleBuffer.h b/3rdparty/soundtouch/soundtouch/FIFOSampleBuffer.h
index f7623f2a88..537a7b8722 100644
--- a/3rdparty/soundtouch/soundtouch/FIFOSampleBuffer.h
+++ b/3rdparty/soundtouch/soundtouch/FIFOSampleBuffer.h
@@ -170,6 +170,9 @@ public:
     /// allow trimming (downwards) amount of samples in pipeline.
     /// Returns adjusted amount of samples
     uint adjustAmountOfSamples(uint numSamples);
+
+    /// Add silence to end of buffer
+    void addSilent(uint nSamples);
 };
 
 }
diff --git a/3rdparty/soundtouch/soundtouch/STTypes.h b/3rdparty/soundtouch/soundtouch/STTypes.h
index 423f51b087..dc964b0222 100644
--- a/3rdparty/soundtouch/soundtouch/STTypes.h
+++ b/3rdparty/soundtouch/soundtouch/STTypes.h
@@ -121,10 +121,10 @@ namespace soundtouch
 
     #endif
 
-    // If defined, allows the SIMD-optimized routines to take minor shortcuts 
-    // for improved performance. Undefine to require faithfully similar SIMD 
-    // calculations as in normal C implementation.
-    #define SOUNDTOUCH_ALLOW_NONEXACT_SIMD_OPTIMIZATION    1
+    // If defined, allows the SIMD-optimized routines to skip unevenly aligned
+    // memory offsets that can cause performance penalty in some SIMD implementations.
+    // Causes slight compromise in sound quality.
+    // #define SOUNDTOUCH_ALLOW_NONEXACT_SIMD_OPTIMIZATION    1
 
 
     #ifdef SOUNDTOUCH_INTEGER_SAMPLES
@@ -149,8 +149,9 @@ namespace soundtouch
 
         // floating point samples
         typedef float  SAMPLETYPE;
-        // data type for sample accumulation: Use double to utilize full precision.
-        typedef double LONG_SAMPLETYPE;
+        // data type for sample accumulation: Use float also here to enable
+        // efficient autovectorization
+        typedef float LONG_SAMPLETYPE;
 
         #ifdef SOUNDTOUCH_ALLOW_X86_OPTIMIZATIONS
             // Allow SSE optimizations
@@ -159,7 +160,13 @@ namespace soundtouch
 
     #endif  // SOUNDTOUCH_INTEGER_SAMPLES
 
-};
+    #if ((SOUNDTOUCH_ALLOW_SSE) || (__SSE__) || (SOUNDTOUCH_USE_NEON))
+        #if SOUNDTOUCH_ALLOW_NONEXACT_SIMD_OPTIMIZATION
+            #define ST_SIMD_AVOID_UNALIGNED
+        #endif
+    #endif
+
+}
 
 // define ST_NO_EXCEPTION_HANDLING switch to disable throwing std exceptions:
 // #define ST_NO_EXCEPTION_HANDLING    1
diff --git a/3rdparty/soundtouch/soundtouch/SoundTouch.h b/3rdparty/soundtouch/soundtouch/SoundTouch.h
index abac5371dc..f5cc5d6bec 100644
--- a/3rdparty/soundtouch/soundtouch/SoundTouch.h
+++ b/3rdparty/soundtouch/soundtouch/SoundTouch.h
@@ -72,10 +72,10 @@ namespace soundtouch
 {
 
 /// Soundtouch library version string
-#define SOUNDTOUCH_VERSION          "2.1.2"
+#define SOUNDTOUCH_VERSION          "2.3.1"
 
 /// SoundTouch library version id
-#define SOUNDTOUCH_VERSION_ID       (20102)
+#define SOUNDTOUCH_VERSION_ID       (20301)
 
 //
 // Available setting IDs for the 'setSetting' & 'get_setting' functions:
diff --git a/3rdparty/soundtouch/source/SoundTouch/BPMDetect.cpp b/3rdparty/soundtouch/source/SoundTouch/BPMDetect.cpp
index e28cc93007..959f36e3f7 100644
--- a/3rdparty/soundtouch/source/SoundTouch/BPMDetect.cpp
+++ b/3rdparty/soundtouch/source/SoundTouch/BPMDetect.cpp
@@ -313,7 +313,7 @@ void BPMDetect::updateXCorr(int process_samples)
     #pragma omp parallel for
     for (offs = windowStart; offs < windowLen; offs ++) 
     {
-        double sum;
+        float sum;
         int i;
 
         sum = 0;
@@ -341,7 +341,6 @@ void BPMDetect::updateBeatPos(int process_samples)
     //    static double thr = 0.0003;
     double posScale = (double)this->decimateBy / (double)this->sampleRate;
     int resetDur = (int)(0.12 / posScale + 0.5);
-    double corrScale = 1.0 / (double)(windowLen - windowStart);
 
     // prescale pbuffer
     float tmp[XCORR_UPDATE_SEQUENCE / 2];
@@ -353,7 +352,7 @@ void BPMDetect::updateBeatPos(int process_samples)
     #pragma omp parallel for
     for (int offs = windowStart; offs < windowLen; offs++)
     {
-        double sum = 0;
+        float sum = 0;
         for (int i = 0; i < process_samples; i++)
         {
             sum += tmp[i] * pBuffer[offs + i];
@@ -562,7 +561,7 @@ float BPMDetect::getBpm()
 /// \return number of beats in the arrays.
 int BPMDetect::getBeats(float *pos, float *values, int max_num)
 {
-    int num = beats.size();
+    int num = (int)beats.size();
     if ((!pos) || (!values)) return num;    // pos or values NULL, return just size
 
     for (int i = 0; (i < num) && (i < max_num); i++)
diff --git a/3rdparty/soundtouch/source/SoundTouch/FIFOSampleBuffer.cpp b/3rdparty/soundtouch/source/SoundTouch/FIFOSampleBuffer.cpp
index 5206746c0b..ad36875466 100644
--- a/3rdparty/soundtouch/source/SoundTouch/FIFOSampleBuffer.cpp
+++ b/3rdparty/soundtouch/source/SoundTouch/FIFOSampleBuffer.cpp
@@ -265,3 +265,11 @@ uint FIFOSampleBuffer::adjustAmountOfSamples(uint numSamples)
     }
     return samplesInBuffer;
 }
+
+
+/// Add silence to end of buffer
+void FIFOSampleBuffer::addSilent(uint nSamples)
+{
+    memset(ptrEnd(nSamples), 0, sizeof(SAMPLETYPE) * nSamples * channels);
+    samplesInBuffer += nSamples;
+}
diff --git a/3rdparty/soundtouch/source/SoundTouch/FIRFilter.cpp b/3rdparty/soundtouch/source/SoundTouch/FIRFilter.cpp
index 0af11852bf..24cc33bf6f 100644
--- a/3rdparty/soundtouch/source/SoundTouch/FIRFilter.cpp
+++ b/3rdparty/soundtouch/source/SoundTouch/FIRFilter.cpp
@@ -60,12 +60,14 @@ FIRFilter::FIRFilter()
     length = 0;
     lengthDiv8 = 0;
     filterCoeffs = NULL;
+    filterCoeffsStereo = NULL;
 }
 
 
 FIRFilter::~FIRFilter()
 {
     delete[] filterCoeffs;
+    delete[] filterCoeffsStereo;
 }
 
 
@@ -78,35 +80,26 @@ uint FIRFilter::evaluateFilterStereo(SAMPLETYPE *dest, const SAMPLETYPE *src, ui
     // because division is much slower operation than multiplying.
     double dScaler = 1.0 / (double)resultDivider;
 #endif
+    // hint compiler autovectorization that loop length is divisible by 8
+    int ilength = length & -8;
 
-    assert(length != 0);
-    assert(src != NULL);
-    assert(dest != NULL);
-    assert(filterCoeffs != NULL);
+    assert((length != 0) && (length == ilength) && (src != NULL) && (dest != NULL) && (filterCoeffs != NULL));
 
-    end = 2 * (numSamples - length);
+    end = 2 * (numSamples - ilength);
 
     #pragma omp parallel for
     for (j = 0; j < end; j += 2) 
     {
         const SAMPLETYPE *ptr;
         LONG_SAMPLETYPE suml, sumr;
-        uint i;
 
         suml = sumr = 0;
         ptr = src + j;
 
-        for (i = 0; i < length; i += 4) 
+        for (int i = 0; i < ilength; i ++)
         {
-            // loop is unrolled by factor of 4 here for efficiency
-            suml += ptr[2 * i + 0] * filterCoeffs[i + 0] +
-                    ptr[2 * i + 2] * filterCoeffs[i + 1] +
-                    ptr[2 * i + 4] * filterCoeffs[i + 2] +
-                    ptr[2 * i + 6] * filterCoeffs[i + 3];
-            sumr += ptr[2 * i + 1] * filterCoeffs[i + 0] +
-                    ptr[2 * i + 3] * filterCoeffs[i + 1] +
-                    ptr[2 * i + 5] * filterCoeffs[i + 2] +
-                    ptr[2 * i + 7] * filterCoeffs[i + 3];
+            suml += ptr[2 * i] * filterCoeffsStereo[2 * i];
+            sumr += ptr[2 * i + 1] * filterCoeffsStereo[2 * i + 1];
         }
 
 #ifdef SOUNDTOUCH_INTEGER_SAMPLES
@@ -116,14 +109,11 @@ uint FIRFilter::evaluateFilterStereo(SAMPLETYPE *dest, const SAMPLETYPE *src, ui
         suml = (suml < -32768) ? -32768 : (suml > 32767) ? 32767 : suml;
         // saturate to 16 bit integer limits
         sumr = (sumr < -32768) ? -32768 : (sumr > 32767) ? 32767 : sumr;
-#else
-        suml *= dScaler;
-        sumr *= dScaler;
 #endif // SOUNDTOUCH_INTEGER_SAMPLES
         dest[j] = (SAMPLETYPE)suml;
         dest[j + 1] = (SAMPLETYPE)sumr;
     }
-    return numSamples - length;
+    return numSamples - ilength;
 }
 
 
@@ -137,31 +127,28 @@ uint FIRFilter::evaluateFilterMono(SAMPLETYPE *dest, const SAMPLETYPE *src, uint
     double dScaler = 1.0 / (double)resultDivider;
 #endif
 
-    assert(length != 0);
+    // hint compiler autovectorization that loop length is divisible by 8
+    int ilength = length & -8;
 
-    end = numSamples - length;
+    assert(ilength != 0);
+
+    end = numSamples - ilength;
     #pragma omp parallel for
-    for (j = 0; j < end; j ++) 
+    for (j = 0; j < end; j ++)
     {
         const SAMPLETYPE *pSrc = src + j;
         LONG_SAMPLETYPE sum;
-        uint i;
+        int i;
 
         sum = 0;
-        for (i = 0; i < length; i += 4) 
+        for (i = 0; i < ilength; i ++)
         {
-            // loop is unrolled by factor of 4 here for efficiency
-            sum += pSrc[i + 0] * filterCoeffs[i + 0] + 
-                   pSrc[i + 1] * filterCoeffs[i + 1] + 
-                   pSrc[i + 2] * filterCoeffs[i + 2] + 
-                   pSrc[i + 3] * filterCoeffs[i + 3];
+            sum += pSrc[i] * filterCoeffs[i];
         }
 #ifdef SOUNDTOUCH_INTEGER_SAMPLES
         sum >>= resultDivFactor;
         // saturate to 16 bit integer limits
         sum = (sum < -32768) ? -32768 : (sum > 32767) ? 32767 : sum;
-#else
-        sum *= dScaler;
 #endif // SOUNDTOUCH_INTEGER_SAMPLES
         dest[j] = (SAMPLETYPE)sum;
     }
@@ -185,14 +172,18 @@ uint FIRFilter::evaluateFilterMulti(SAMPLETYPE *dest, const SAMPLETYPE *src, uin
     assert(filterCoeffs != NULL);
     assert(numChannels < 16);
 
-    end = numChannels * (numSamples - length);
+    // hint compiler autovectorization that loop length is divisible by 8
+    int ilength = length & -8;
+
+    end = numChannels * (numSamples - ilength);
 
     #pragma omp parallel for
     for (j = 0; j < end; j += numChannels)
     {
         const SAMPLETYPE *ptr;
         LONG_SAMPLETYPE sums[16];
-        uint c, i;
+        uint c;
+        int i;
 
         for (c = 0; c < numChannels; c ++)
         {
@@ -201,7 +192,7 @@ uint FIRFilter::evaluateFilterMulti(SAMPLETYPE *dest, const SAMPLETYPE *src, uin
 
         ptr = src + j;
 
-        for (i = 0; i < length; i ++)
+        for (i = 0; i < ilength; i ++)
         {
             SAMPLETYPE coef=filterCoeffs[i];
             for (c = 0; c < numChannels; c ++)
@@ -215,13 +206,11 @@ uint FIRFilter::evaluateFilterMulti(SAMPLETYPE *dest, const SAMPLETYPE *src, uin
         {
 #ifdef SOUNDTOUCH_INTEGER_SAMPLES
             sums[c] >>= resultDivFactor;
-#else
-            sums[c] *= dScaler;
 #endif // SOUNDTOUCH_INTEGER_SAMPLES
             dest[j+c] = (SAMPLETYPE)sums[c];
         }
     }
-    return numSamples - length;
+    return numSamples - ilength;
 }
 
 
@@ -233,6 +222,13 @@ void FIRFilter::setCoefficients(const SAMPLETYPE *coeffs, uint newLength, uint u
     assert(newLength > 0);
     if (newLength % 8) ST_THROW_RT_ERROR("FIR filter length not divisible by 8");
 
+    #ifdef SOUNDTOUCH_FLOAT_SAMPLES
+        // scale coefficients already here if using floating samples
+        double scale = 1.0 / resultDivider;
+    #else
+        short scale = 1;
+    #endif
+
     lengthDiv8 = newLength / 8;
     length = lengthDiv8 * 8;
     assert(length == newLength);
@@ -242,7 +238,16 @@ void FIRFilter::setCoefficients(const SAMPLETYPE *coeffs, uint newLength, uint u
 
     delete[] filterCoeffs;
     filterCoeffs = new SAMPLETYPE[length];
-    memcpy(filterCoeffs, coeffs, length * sizeof(SAMPLETYPE));
+    delete[] filterCoeffsStereo;
+    filterCoeffsStereo = new SAMPLETYPE[length*2];
+    for (uint i = 0; i < length; i ++)
+    {
+        filterCoeffs[i] = (SAMPLETYPE)(coeffs[i] * scale);
+        // create also stereo set of filter coefficients: this allows compiler
+        // to autovectorize filter evaluation much more efficiently
+        filterCoeffsStereo[2 * i] = (SAMPLETYPE)(coeffs[i] * scale);
+        filterCoeffsStereo[2 * i + 1] = (SAMPLETYPE)(coeffs[i] * scale);
+    }
 }
 
 
diff --git a/3rdparty/soundtouch/source/SoundTouch/FIRFilter.h b/3rdparty/soundtouch/source/SoundTouch/FIRFilter.h
index 6408302770..39c2cc7542 100644
--- a/3rdparty/soundtouch/source/SoundTouch/FIRFilter.h
+++ b/3rdparty/soundtouch/source/SoundTouch/FIRFilter.h
@@ -57,6 +57,7 @@ protected:
 
     // Memory for filter coefficients
     SAMPLETYPE *filterCoeffs;
+    SAMPLETYPE *filterCoeffsStereo;
 
     virtual uint evaluateFilterStereo(SAMPLETYPE *dest, 
                                       const SAMPLETYPE *src, 
diff --git a/3rdparty/soundtouch/source/SoundTouch/InterpolateCubic.h b/3rdparty/soundtouch/source/SoundTouch/InterpolateCubic.h
index 7f84f28024..481abd64bc 100644
--- a/3rdparty/soundtouch/source/SoundTouch/InterpolateCubic.h
+++ b/3rdparty/soundtouch/source/SoundTouch/InterpolateCubic.h
@@ -41,7 +41,6 @@ namespace soundtouch
 class InterpolateCubic : public TransposerBase
 {
 protected:
-    virtual void resetRegisters();
     virtual int transposeMono(SAMPLETYPE *dest, 
                         const SAMPLETYPE *src, 
                         int &srcSamples);
@@ -56,6 +55,13 @@ protected:
 
 public:
     InterpolateCubic();
+
+    virtual void resetRegisters();
+
+    int getLatency() const
+    {
+        return 1;
+    }
 };
 
 }
diff --git a/3rdparty/soundtouch/source/SoundTouch/InterpolateLinear.cpp b/3rdparty/soundtouch/source/SoundTouch/InterpolateLinear.cpp
index edfb8d10c0..9533e79b79 100644
--- a/3rdparty/soundtouch/source/SoundTouch/InterpolateLinear.cpp
+++ b/3rdparty/soundtouch/source/SoundTouch/InterpolateLinear.cpp
@@ -142,7 +142,7 @@ int InterpolateLinearInteger::transposeMulti(SAMPLETYPE *dest, const SAMPLETYPE
         LONG_SAMPLETYPE temp, vol1;
     
         assert(iFract < SCALE);
-        vol1 = (SCALE - iFract);
+        vol1 = (LONG_SAMPLETYPE)(SCALE - iFract);
         for (int c = 0; c < numChannels; c ++)
         {
             temp = vol1 * src[c] + iFract * src[c + numChannels];
diff --git a/3rdparty/soundtouch/source/SoundTouch/InterpolateLinear.h b/3rdparty/soundtouch/source/SoundTouch/InterpolateLinear.h
index 087dc2aecf..ff362e84b1 100644
--- a/3rdparty/soundtouch/source/SoundTouch/InterpolateLinear.h
+++ b/3rdparty/soundtouch/source/SoundTouch/InterpolateLinear.h
@@ -45,8 +45,6 @@ protected:
     int iFract;
     int iRate;
 
-    virtual void resetRegisters();
-
     virtual int transposeMono(SAMPLETYPE *dest, 
                        const SAMPLETYPE *src, 
                        int &srcSamples);
@@ -60,6 +58,13 @@ public:
     /// Sets new target rate. Normal rate = 1.0, smaller values represent slower 
     /// rate, larger faster rates.
     virtual void setRate(double newRate);
+
+    virtual void resetRegisters();
+
+    int getLatency() const
+    {
+        return 0;
+    }
 };
 
 
@@ -69,8 +74,6 @@ class InterpolateLinearFloat : public TransposerBase
 protected:
     double fract;
 
-    virtual void resetRegisters();
-
     virtual int transposeMono(SAMPLETYPE *dest, 
                        const SAMPLETYPE *src, 
                        int &srcSamples);
@@ -81,6 +84,13 @@ protected:
 
 public:
     InterpolateLinearFloat();
+
+    virtual void resetRegisters();
+
+    int getLatency() const
+    {
+        return 0;
+    }
 };
 
 }
diff --git a/3rdparty/soundtouch/source/SoundTouch/InterpolateShannon.h b/3rdparty/soundtouch/source/SoundTouch/InterpolateShannon.h
index 2a59a84f19..72ab0b526d 100644
--- a/3rdparty/soundtouch/source/SoundTouch/InterpolateShannon.h
+++ b/3rdparty/soundtouch/source/SoundTouch/InterpolateShannon.h
@@ -46,7 +46,6 @@ namespace soundtouch
 class InterpolateShannon : public TransposerBase
 {
 protected:
-    void resetRegisters();
     int transposeMono(SAMPLETYPE *dest, 
                         const SAMPLETYPE *src, 
                         int &srcSamples);
@@ -61,6 +60,13 @@ protected:
 
 public:
     InterpolateShannon();
+
+    void resetRegisters();
+
+    int getLatency() const
+    {
+        return 3;
+    }
 };
 
 }
diff --git a/3rdparty/soundtouch/source/SoundTouch/PeakFinder.cpp b/3rdparty/soundtouch/source/SoundTouch/PeakFinder.cpp
index 91cb1696d9..44dc937ae7 100644
--- a/3rdparty/soundtouch/source/SoundTouch/PeakFinder.cpp
+++ b/3rdparty/soundtouch/source/SoundTouch/PeakFinder.cpp
@@ -57,7 +57,7 @@ int PeakFinder::findTop(const float *data, int peakpos) const
 
     refvalue = data[peakpos];
 
-    // seek within �10 points
+    // seek within ±10 points
     start = peakpos - 10;
     if (start < minPos) start = minPos;
     end = peakpos + 10;
@@ -142,7 +142,7 @@ int PeakFinder::findCrossingLevel(const float *data, float level, int peakpos, i
     peaklevel = data[peakpos];
     assert(peaklevel >= level);
     pos = peakpos;
-    while ((pos >= minPos) && (pos < maxPos))
+    while ((pos >= minPos) && (pos + direction < maxPos))
     {
         if (data[pos + direction] < level) return pos;   // crossing found
         pos += direction;
@@ -256,7 +256,7 @@ double PeakFinder::detectPeak(const float *data, int aminPos, int amaxPos)
 
         // accept harmonic peak if 
         // (a) it is found
-        // (b) is within �4% of the expected harmonic interval
+        // (b) is within ±4% of the expected harmonic interval
         // (c) has at least half x-corr value of the max. peak
 
         double diff = harmonic * peaktmp / highPeak;
diff --git a/3rdparty/soundtouch/source/SoundTouch/RateTransposer.cpp b/3rdparty/soundtouch/source/SoundTouch/RateTransposer.cpp
index 24d8f36b34..4c202391e0 100644
--- a/3rdparty/soundtouch/source/SoundTouch/RateTransposer.cpp
+++ b/3rdparty/soundtouch/source/SoundTouch/RateTransposer.cpp
@@ -61,6 +61,7 @@ RateTransposer::RateTransposer() : FIFOProcessor(&outputBuffer)
     // Instantiates the anti-alias filter
     pAAFilter = new AAFilter(64);
     pTransposer = TransposerBase::newInstance();
+    clear();
 }
 
 
@@ -77,6 +78,7 @@ void RateTransposer::enableAAFilter(bool newMode)
 #ifndef SOUNDTOUCH_PREVENT_CLICK_AT_RATE_CROSSOVER
     // Disable Anti-alias filter if desirable to avoid click at rate change zero value crossover
     bUseAAFilter = newMode;
+    clear();
 #endif
 }
 
@@ -192,6 +194,11 @@ void RateTransposer::clear()
     outputBuffer.clear();
     midBuffer.clear();
     inputBuffer.clear();
+    pTransposer->resetRegisters();
+
+    // prefill buffer to avoid losing first samples at beginning of stream
+    int prefill = getLatency();
+    inputBuffer.addSilent(prefill);
 }
 
 
@@ -209,7 +216,8 @@ int RateTransposer::isEmpty() const
 /// Return approximate initial input-output latency
 int RateTransposer::getLatency() const
 {
-    return (bUseAAFilter) ? pAAFilter->getLength() : 0;
+    return pTransposer->getLatency() +
+        ((bUseAAFilter) ? (pAAFilter->getLength() / 2) : 0);
 }
 
 
diff --git a/3rdparty/soundtouch/source/SoundTouch/RateTransposer.h b/3rdparty/soundtouch/source/SoundTouch/RateTransposer.h
index 5f31d1a6dd..59381fab5f 100644
--- a/3rdparty/soundtouch/source/SoundTouch/RateTransposer.h
+++ b/3rdparty/soundtouch/source/SoundTouch/RateTransposer.h
@@ -59,8 +59,6 @@ public:
     };
 
 protected:
-    virtual void resetRegisters() = 0;
-
     virtual int transposeMono(SAMPLETYPE *dest, 
                         const SAMPLETYPE *src, 
                         int &srcSamples)  = 0;
@@ -83,6 +81,9 @@ public:
     virtual int transpose(FIFOSampleBuffer &dest, FIFOSampleBuffer &src);
     virtual void setRate(double newRate);
     virtual void setChannels(int channels);
+    virtual int getLatency() const = 0;
+
+    virtual void resetRegisters() = 0;
 
     // static factory function
     static TransposerBase *newInstance();
diff --git a/3rdparty/soundtouch/source/SoundTouch/TDStretch.cpp b/3rdparty/soundtouch/source/SoundTouch/TDStretch.cpp
index b37985fbd4..c903a92f6b 100644
--- a/3rdparty/soundtouch/source/SoundTouch/TDStretch.cpp
+++ b/3rdparty/soundtouch/source/SoundTouch/TDStretch.cpp
@@ -1,4 +1,4 @@
-////////////////////////////////////////////////////////////////////////////////
+///////////////////////////////////////////////////////////////////////////////
 /// 
 /// Sampled sound tempo changer/time stretch algorithm. Changes the sound tempo 
 /// while maintaining the original pitch by using a time domain WSOLA-like 
@@ -54,7 +54,6 @@ using namespace soundtouch;
 
 #define max(x, y) (((x) > (y)) ? (x) : (y))
 
-
 /*****************************************************************************
  *
  * Constant definitions
@@ -93,11 +92,6 @@ TDStretch::TDStretch() : FIFOProcessor(&outputBuffer)
     bAutoSeqSetting = true;
     bAutoSeekSetting = true;
 
-    maxnorm = 0;
-    maxnormf = 1e8;
-
-    skipFract = 0;
-
     tempo = 1.0f;
     setParameters(44100, DEFAULT_SEQUENCE_MS, DEFAULT_SEEKWINDOW_MS, DEFAULT_OVERLAP_MS);
     setTempo(1.0f);
@@ -203,7 +197,7 @@ void TDStretch::overlapMono(SAMPLETYPE *pOutput, const SAMPLETYPE *pInput) const
     m1 = (SAMPLETYPE)0;
     m2 = (SAMPLETYPE)overlapLength;
 
-    for (i = 0; i < overlapLength ; i ++) 
+    for (i = 0; i < overlapLength ; i ++)
     {
         pOutput[i] = (pInput[i] * m1 + pMidBuffer[i] * m2 ) / overlapLength;
         m1 += 1;
@@ -224,6 +218,9 @@ void TDStretch::clearInput()
     inputBuffer.clear();
     clearMidBuffer();
     isBeginning = true;
+    maxnorm = 0;
+    maxnormf = 1e8;
+    skipFract = 0;
 }
 
 
@@ -311,13 +308,14 @@ int TDStretch::seekBestOverlapPositionFull(const SAMPLETYPE *refPos)
     bestCorr = (bestCorr + 0.1) * 0.75;
 
     #pragma omp parallel for
-    for (i = 1; i < seekLength; i ++) 
+    for (i = 1; i < seekLength; i ++)
     {
         double corr;
         // Calculates correlation value for the mixing position corresponding to 'i'
-#ifdef _OPENMP
+#if defined(_OPENMP) || defined(ST_SIMD_AVOID_UNALIGNED)
         // in parallel OpenMP mode, can't use norm accumulator version as parallel executor won't
         // iterate the loop in sequential order
+        // in SIMD mode, avoid accumulator version to allow avoiding unaligned positions
         corr = calcCrossCorr(refPos + channels * i, pMidBuffer, norm);
 #else
         // In non-parallel version call "calcCrossCorrAccumulate" that is otherwise same
@@ -675,23 +673,24 @@ void TDStretch::processSamples()
             // Adjust processing offset at beginning of track by not perform initial overlapping
             // and compensating that in the 'input buffer skip' calculation
             isBeginning = false;
-            int skip = (int)(tempo * overlapLength + 0.5);
+            int skip = (int)(tempo * overlapLength + 0.5 * seekLength + 0.5);
 
-            #ifdef SOUNDTOUCH_ALLOW_NONEXACT_SIMD_OPTIMIZATION
-                #ifdef SOUNDTOUCH_ALLOW_SSE
-                // if SSE mode, round the skip amount to value corresponding to aligned memory address
-                if (channels == 1)
-                {
-                    skip &= -4;
-                }
-                else if (channels == 2)
-                {
-                    skip &= -2;
-                }
-                #endif
+            #ifdef ST_SIMD_AVOID_UNALIGNED
+            // in SIMD mode, round the skip amount to value corresponding to aligned memory address
+            if (channels == 1)
+            {
+                skip &= -4;
+            }
+            else if (channels == 2)
+            {
+                skip &= -2;
+            }
             #endif
             skipFract -= skip;
-            assert(nominalSkip >= -skipFract);
+            if (skipFract <= -nominalSkip)
+            {
+                skipFract = -nominalSkip;
+            }
         }
 
         // ... then copy sequence samples from 'inputBuffer' to output:
@@ -818,7 +817,7 @@ void TDStretch::overlapStereo(short *poutput, const short *input) const
     short temp;
     int cnt2;
 
-    for (i = 0; i < overlapLength ; i ++) 
+    for (i = 0; i < overlapLength ; i ++)
     {
         temp = (short)(overlapLength - i);
         cnt2 = 2 * i;
@@ -830,21 +829,19 @@ void TDStretch::overlapStereo(short *poutput, const short *input) const
 
 // Overlaps samples in 'midBuffer' with the samples in 'input'. The 'Multi'
 // version of the routine.
-void TDStretch::overlapMulti(SAMPLETYPE *poutput, const SAMPLETYPE *input) const
+void TDStretch::overlapMulti(short *poutput, const short *input) const
 {
-    SAMPLETYPE m1=(SAMPLETYPE)0;
-    SAMPLETYPE m2;
-    int i=0;
+    short m1;
+    int i = 0;
 
-    for (m2 = (SAMPLETYPE)overlapLength; m2; m2 --)
+    for (m1 = 0; m1 < overlapLength; m1 ++)
     {
+        short m2 = (short)(overlapLength - m1);
         for (int c = 0; c < channels; c ++)
         {
             poutput[i] = (input[i] * m1 + pMidBuffer[i] * m2)  / overlapLength;
             i++;
         }
-
-        m1++;
     }
 }
 
@@ -889,20 +886,23 @@ double TDStretch::calcCrossCorr(const short *mixingPos, const short *compare, do
     unsigned long lnorm;
     int i;
 
+    #ifdef ST_SIMD_AVOID_UNALIGNED
+        // in SIMD mode skip 'mixingPos' positions that aren't aligned to 16-byte boundary
+        if (((ulongptr)mixingPos) & 15) return -1e50;
+    #endif
+
+    // hint compiler autovectorization that loop length is divisible by 8
+    int ilength = (channels * overlapLength) & -8;
+
     corr = lnorm = 0;
-    // Same routine for stereo and mono. For stereo, unroll loop for better
-    // efficiency and gives slightly better resolution against rounding. 
-    // For mono it same routine, just  unrolls loop by factor of 4
-    for (i = 0; i < channels * overlapLength; i += 4) 
+    // Same routine for stereo and mono
+    for (i = 0; i < ilength; i += 2)
     {
         corr += (mixingPos[i] * compare[i] + 
-                 mixingPos[i + 1] * compare[i + 1]) >> overlapDividerBitsNorm;  // notice: do intermediate division here to avoid integer overflow
-        corr += (mixingPos[i + 2] * compare[i + 2] + 
-                mixingPos[i + 3] * compare[i + 3]) >> overlapDividerBitsNorm;
+                 mixingPos[i + 1] * compare[i + 1]) >> overlapDividerBitsNorm;
         lnorm += (mixingPos[i] * mixingPos[i] + 
-                mixingPos[i + 1] * mixingPos[i + 1]) >> overlapDividerBitsNorm; // notice: do intermediate division here to avoid integer overflow
-        lnorm += (mixingPos[i + 2] * mixingPos[i + 2] + 
-                mixingPos[i + 3] * mixingPos[i + 3]) >> overlapDividerBitsNorm;
+                  mixingPos[i + 1] * mixingPos[i + 1]) >> overlapDividerBitsNorm;
+        // do intermediate scalings to avoid integer overflow
     }
 
     if (lnorm > maxnorm)
@@ -925,9 +925,12 @@ double TDStretch::calcCrossCorr(const short *mixingPos, const short *compare, do
 double TDStretch::calcCrossCorrAccumulate(const short *mixingPos, const short *compare, double &norm)
 {
     long corr;
-    unsigned long lnorm;
+    long lnorm;
     int i;
 
+    // hint compiler autovectorization that loop length is divisible by 8
+    int ilength = (channels * overlapLength) & -8;
+
     // cancel first normalizer tap from previous round
     lnorm = 0;
     for (i = 1; i <= channels; i ++)
@@ -936,15 +939,11 @@ double TDStretch::calcCrossCorrAccumulate(const short *mixingPos, const short *c
     }
 
     corr = 0;
-    // Same routine for stereo and mono. For stereo, unroll loop for better
-    // efficiency and gives slightly better resolution against rounding. 
-    // For mono it same routine, just  unrolls loop by factor of 4
-    for (i = 0; i < channels * overlapLength; i += 4) 
+    // Same routine for stereo and mono.
+    for (i = 0; i < ilength; i += 2) 
     {
         corr += (mixingPos[i] * compare[i] + 
-                 mixingPos[i + 1] * compare[i + 1]) >> overlapDividerBitsNorm;  // notice: do intermediate division here to avoid integer overflow
-        corr += (mixingPos[i + 2] * compare[i + 2] + 
-                 mixingPos[i + 3] * compare[i + 3]) >> overlapDividerBitsNorm;
+                 mixingPos[i + 1] * compare[i + 1]) >> overlapDividerBitsNorm;
     }
 
     // update normalizer with last samples of this round
@@ -1045,27 +1044,24 @@ void TDStretch::calculateOverlapLength(int overlapInMsec)
 /// Calculate cross-correlation
 double TDStretch::calcCrossCorr(const float *mixingPos, const float *compare, double &anorm)
 {
-    double corr;
-    double norm;
+    float corr;
+    float norm;
     int i;
 
+    #ifdef ST_SIMD_AVOID_UNALIGNED
+        // in SIMD mode skip 'mixingPos' positions that aren't aligned to 16-byte boundary
+        if (((ulongptr)mixingPos) & 15) return -1e50;
+    #endif
+
+    // hint compiler autovectorization that loop length is divisible by 8
+    int ilength = (channels * overlapLength) & -8;
+
     corr = norm = 0;
-    // Same routine for stereo and mono. For Stereo, unroll by factor of 2.
-    // For mono it's same routine yet unrollsd by factor of 4.
-    for (i = 0; i < channels * overlapLength; i += 4) 
+    // Same routine for stereo and mono
+    for (i = 0; i < ilength; i ++)
     {
-        corr += mixingPos[i] * compare[i] +
-                mixingPos[i + 1] * compare[i + 1];
-
-        norm += mixingPos[i] * mixingPos[i] + 
-                mixingPos[i + 1] * mixingPos[i + 1];
-
-        // unroll the loop for better CPU efficiency:
-        corr += mixingPos[i + 2] * compare[i + 2] +
-                mixingPos[i + 3] * compare[i + 3];
-
-        norm += mixingPos[i + 2] * mixingPos[i + 2] +
-                mixingPos[i + 3] * mixingPos[i + 3];
+        corr += mixingPos[i] * compare[i];
+        norm += mixingPos[i] * mixingPos[i];
     }
 
     anorm = norm;
@@ -1076,7 +1072,7 @@ double TDStretch::calcCrossCorr(const float *mixingPos, const float *compare, do
 /// Update cross-correlation by accumulating "norm" coefficient by previously calculated value
 double TDStretch::calcCrossCorrAccumulate(const float *mixingPos, const float *compare, double &norm)
 {
-    double corr;
+    float corr;
     int i;
 
     corr = 0;
@@ -1087,14 +1083,13 @@ double TDStretch::calcCrossCorrAccumulate(const float *mixingPos, const float *c
         norm -= mixingPos[-i] * mixingPos[-i];
     }
 
-    // Same routine for stereo and mono. For Stereo, unroll by factor of 2.
-    // For mono it's same routine yet unrollsd by factor of 4.
-    for (i = 0; i < channels * overlapLength; i += 4) 
+    // hint compiler autovectorization that loop length is divisible by 8
+    int ilength = (channels * overlapLength) & -8;
+
+    // Same routine for stereo and mono
+    for (i = 0; i < ilength; i ++)
     {
-        corr += mixingPos[i] * compare[i] +
-                mixingPos[i + 1] * compare[i + 1] +
-                mixingPos[i + 2] * compare[i + 2] +
-                mixingPos[i + 3] * compare[i + 3];
+        corr += mixingPos[i] * compare[i];
     }
 
     // update normalizer with last samples of this round
diff --git a/3rdparty/soundtouch/source/SoundTouch/sse_optimized.cpp b/3rdparty/soundtouch/source/SoundTouch/sse_optimized.cpp
index 38a4b243e1..9c16ea8f89 100644
--- a/3rdparty/soundtouch/source/SoundTouch/sse_optimized.cpp
+++ b/3rdparty/soundtouch/source/SoundTouch/sse_optimized.cpp
@@ -80,7 +80,7 @@ double TDStretchSSE::calcCrossCorr(const float *pV1, const float *pV2, double &a
     // Compile-time define SOUNDTOUCH_ALLOW_NONEXACT_SIMD_OPTIMIZATION is provided
     // for choosing if this little cheating is allowed.
 
-#ifdef SOUNDTOUCH_ALLOW_NONEXACT_SIMD_OPTIMIZATION
+#ifdef ST_SIMD_AVOID_UNALIGNED
     // Little cheating allowed, return valid correlation only for 
     // aligned locations, meaning every second round for stereo sound.
 

- ./bootstrap - -	Creates "configure" file with -local autoconf/automake toolset. -
- ./configure - -	- Configures the SoundTouch package for the local environment. -Notice that "configure" file is not available before running the -"./bootstrap" command as above. - -
- make - -	- Builds the SoundTouch library & SoundStretch utility. You can - optionally add "-j" switch after "make" to speed up the compilation in - multi-core systems. -
- make install - -	- Installs the SoundTouch & BPM libraries to /usr/local/lib -and SoundStretch utility to /usr/local/bin. Please notice that -'root' privileges may be required to install the binaries to the -destination locations. -
+ ./bootstrap - +	Creates "configure" file with + local autoconf/automake toolset. +
+ ./configure - +	+ Configures the SoundTouch package for the local environment. + Notice that "configure" file is not available before running the + "./bootstrap" command as above. + +
+ make - +	+ Builds the SoundTouch library & SoundStretch utility. You can + optionally add "-j" switch after "make" to speed up the compilation in + multi-core systems. +
+ make install - +	+ Installs the SoundTouch & BPM libraries to /usr/local/lib + and SoundStretch utility to /usr/local/bin. Please notice that + 'root' privileges may be required to install the binaries to the + destination locations. +
Parameter name	Default value magnitude	Larger value affects...	Smaller value affects...	Effect to CPU burden
- SEQUENCE_MS -	Default value is relatively large, chosen for -slowing down music tempo	Larger value is usually better for slowing down -tempo. Growing the value decelerates the "echoing" artifact when -slowing down the tempo.	Smaller value might be better for speeding up -tempo. Reducing the value accelerates the "echoing" artifact when -slowing down the tempo	Increasing the parameter value reduces -computation burden
- SEEKWINDOW_MS -	Default value is relatively large, chosen for -slowing down music tempo	Larger value eases finding a good mixing -position, but may cause a "drifting" artifact	Smaller reduce possibility to find a good mixing -position, but reduce the "drifting" artifact.	Increasing the parameter value increases -computation burden
- OVERLAP_MS -	Default value is relatively large, chosen to -suit with above parameters.		If you reduce the "sequence ms" setting, you -might wish to try a smaller value.	Increasing the parameter value increases -computation burden
- "infilename" -	Name of the input sound data file (in .WAV audio -file format). Give "stdin" as filename to use standard input pipe.
- "outfilename" -	Name of the output sound file where the -resulting sound is saved (in .WAV audio file format). This parameter -may be omitted if you don't want to save the output (e.g. when -only calculating BPM rate with '-bpm' switch). Give "stdout" as -filename to use standard output pipe.
- [switches] -	Are one or more control switches.
- -tempo=n -	Change the sound tempo by n percents (n = -95.0 -.. +5000.0 %)
- -pitch=n -	Change the sound pitch by n semitones (n = -60.0 -.. + 60.0 semitones)
- -rate=n -	Change the sound playback rate by n percents (n -= -95.0 .. +5000.0 %)
- -bpm=n -	Detect the Beats-Per-Minute (BPM) rate of the -sound and adjust the tempo to meet 'n' BPMs. When this switch is -applied, the "-tempo" switch is ignored. If "=n" is omitted, i.e. -switch "-bpm" is used alone, then the BPM rate is estimated and -displayed, but tempo not adjusted according to the BPM value.
- -quick -	Use quicker tempo change algorithm. Gains speed -but loses sound quality.
- -naa -	Don't use anti-alias filtering in sample rate -transposing. Gains speed but loses sound quality.
- -license -	Displays the program license text (LGPL)
+ "infilename" +	Name of the input sound data file (in .WAV audio + file format). Give "stdin" as filename to use standard input pipe.
+ "outfilename" +	Name of the output sound file where the + resulting sound is saved (in .WAV audio file format). This parameter + may be omitted if you don't want to save the output (e.g. when + only calculating BPM rate with '-bpm' switch). Give "stdout" as + filename to use standard output pipe.
+ [switches] +	Are one or more control switches.
+ -tempo=n +	Change the sound tempo by n percents (n = -95.0 + .. +5000.0 %)
+ -pitch=n +	Change the sound pitch by n semitones (n = -60.0 + .. + 60.0 semitones)
+ -rate=n +	Change the sound playback rate by n percents (n + = -95.0 .. +5000.0 %)
+ -bpm=n +	Detect the Beats-Per-Minute (BPM) rate of the + sound and adjust the tempo to meet 'n' BPMs. When this switch is + applied, the "-tempo" switch is ignored. If "=n" is omitted, i.e. + switch "-bpm" is used alone, then the BPM rate is estimated and + displayed, but tempo not adjusted according to the BPM value.
+ -quick +	Use quicker tempo change algorithm. Gains speed + but loses sound quality.
+ -naa +	Don't use anti-alias filtering in sample rate + transposing. Gains speed but loses sound quality.
+ -license +	Displays the program license text (LGPL)