lucstechblog: 2026

Friday, March 6, 2026

Make Gemini speak

For an index to all my stories click this text.

In the previous story I showed how to build a webpage on which you could type a question. The question was then send to a simple AI system and then spoken out.
You can read that story here: https://lucstechblog.blogspot.com/2026/02/2a-speaking-ai.html

It works but has some flaws. It uses a special service to avoid a CORS error. And that service sometimes is so busy, that you need to retry sending your question a few times. Which is annoying.

I had better luck with Google's Gemini AI.
I know that there are people out there that hate Google. But I actually like them. And their AI has a great free tier, and it works great.
So I am going to rebuild the webpage with Gemini.

To use the following program you will need to obtain an API key from VoiceRSS. Read this story that tells you how to get it: https://lucstechblog.blogspot.com/2026/02/text-to-speech-with-voicerss.html

And you'll need an API key for using Google's Gemini. Read this story that tells how to get it: https://lucstechblog.blogspot.com/2026/02/build-webpage-with-gemini-ai-as-your.html

Sidenote.

Javascript is a fun language which is not very difficult to learn. And you get instant results in your webbrowser. The language is useful for all kinds of projects, including IOT projects of which you find several on this weblog.
To make programming in Javascript easier I collected over 500 tips and tricks and bundled them in a book. The book is distributed world-wide by amazon.com

Click here to learn more about this book or to order it.

So what we are going to do is to build a webpage on which you can type a question. That question is send to Gemini and the answer is spoken out aloud. So turn up the volume of your computers speakers and let's go.

And here is the complete program.

<!DOCTYPE html>
<html lang="en">
<head>
  <meta charset="UTF-8">
  <title>Gemini Web Chat Demo</title>
  <style>
    body {
      font-family: system-ui, sans-serif;
      background: #f7f7f7;
      padding: 2rem;
    }
    #container {
      background: #fff;
      padding: 1.5rem;
      border-radius: 8px;
      box-shadow: 0 0 10px rgba(0,0,0,0.1);
      max-width: 600px;
      margin: auto;
    }
    textarea {
      width: 100%;
      font-family: inherit;
      font-size: 1rem;
      padding: 0.6rem;
      border: 1px solid #ccc;
      border-radius: 6px;
      resize: none;           /* user can’t drag resize */
      overflow: hidden;       /* hide scrollbar */
      min-height: 2.5rem;
    }
    button {
      margin-top: 0.5rem;
      padding: 0.4rem 1rem;
    }
    #response {
      margin-top: 1rem;
      white-space: pre-wrap;
      background: #f0f0f0;
      padding: 1rem;
      border-radius: 6px;
      min-height: 100px;
    }
  </style>
</head>
<body>
  <div id="container">
    <h2>Ask Gemini</h2>
    <textarea id="userInput" placeholder="Type your question..."></textarea>
    <br>
    <button id="sendBtn">Send</button>

    <h3>Response:</h3>
    <div id="response" contenteditable="true"></div>

    <div id="output"></div>
    <audio id="audioPlayer" controls></audio>

  </div>

  <script>
    const Google_API_KEY = "PUT-YOUR-GEMINI-API-KEY-HERE"; // Replace with your key
    const MODEL = "gemini-2.5-flash";//works
    const textarea = document.getElementById("userInput");
    const sendBtn = document.getElementById("sendBtn");

    // Auto-resize textarea
    textarea.addEventListener("input", () => {
      textarea.style.height = "auto";
      textarea.style.height = textarea.scrollHeight + "px";
    });

    sendBtn.addEventListener("click", async () => {
      const input = textarea.value.trim();
      if (!input) return alert("Please enter a question.");
      const responseDiv = document.getElementById("response");
      responseDiv.textContent = "Loading...";

      try {
        const res = await fetch(
  `https://generativelanguage.googleapis.com/v1beta/models/${MODEL}:generateContent?key=${Google_API_KEY}`,
          {
            method: "POST",
            headers: { "Content-Type": "application/json" },
            body: JSON.stringify({
            contents: [{ parts: [{ text: input }] }] // ✅ send user input
            })
          }
    );


        const data = await res.json();
        const text = data?.candidates?.[0]?.content?.parts?.[0]?.text || "(No response)";
        responseDiv.textContent = text;


        let rec_ans = text || "";
        rec_ans = rec_ans.replace(/[^A-Za-z0-9 \n.,!?\\*+\-%@$&:<>()[\]{}"`]/g, "").trim();

        //document.getElementById("output").textContent = rec_ans;
        //console.log("Received Answer:", rec_ans);

        const VoiceRSS_API_KEY = 'PUT=VOICERSS-API-KEY-HERE';
        const url = 'https://api.voicerss.org/';

        const params = new URLSearchParams({
          key: VoiceRSS_API_KEY,
          hl: 'en-us',
          v: 'Amy',
          f: '8khz_16bit_mono',
          src: rec_ans
        });

        const ttsResponse = await fetch(url, {
          method: 'POST',
          headers: { 'Content-Type': 'application/x-www-form-urlencoded' },
          body: params
        });

        if (!ttsResponse.ok) {
          console.error('❌ TTS request failed:', ttsResponse.status);
          return;
        }

        const audioBlob = await ttsResponse.blob();
        const audioUrl = URL.createObjectURL(audioBlob);

        const audioPlayer = document.getElementById("audioPlayer");
        audioPlayer.src = audioUrl;

        audioPlayer.play().catch(err => console.error("🎧 Playback error:", err));

        console.log("🎵 Playing audio...");


      // ✅ Automatically trigger audio file download (no visible button)
      const downloadLink = document.createElement('a');
      downloadLink.href = audioUrl;
      downloadLink.download = `response_${Date.now()}.mp3`;
      document.body.appendChild(downloadLink);
      downloadLink.click();
      downloadLink.remove(); // clean up the temporary link

      } // end of try part

      catch (err) {
        responseDiv.textContent = "Error: " + err.message;
      }
    });
  </script>
</body>
</html>

I do think there are no real pitfalls here and the code is derived from the code from the previous stories. So I will not go into details here. Please check those previous stories for clarification of the code.
You can also send me a message if you want an explanation about a certain part of the code.

How to use this.

Copy the complete code and paste it in your favorite editor. Then save it as gemini.html Change that name in anything you want as long as it ends on .html
Then open the directory where you saved the program and click on it's icon.
Your default web browser will open with the webpage.

You can now type your question. Then press the Send button
The answer will be written as text in the output field and also spoken.
The audio file is also saved to your download folder, so you can use it for different projects.
Pressing the play button at the bottom of the screen will replay that audio file. So if you misheard the answer you can play it again without sending the question anew to Gemini.

You can restrict the answer from Gemini by putting in your question: "answer in x lines without giving an explanation". Where x can be any number of your choice.

Have fun playing with this. I sure am !!!

Till next time

Luc Volders

Friday, February 27, 2026

Speaking AI

For an index to all my stories click this text.

Let's do something fun.
This story shows how you can send a question to an AI chatbot and have the answer spoken out !! This story falls back on two previous stories.

The first one is building a webpage with JavaScript that accesses a simple AI system.
You can read that story here : https://lucstechblog.blogspot.com/2026/02/easy-ai-with-javascript.html

The second story shows how to use the VoiceRSS service to make your webpage generate spoken text.
You can read that story here: https://lucstechblog.blogspot.com/2026/02/text-to-speech-with-voicerss.html

How about typing a text on a webpage, send that text to an AI system and then send the answer to VoiceRSS. Our webpage then speaks out the audio received from VoiceRSS.

Actually this is quite simple to achieve. The only thing we need to do is to combine the programs of the two previous (above mentioned) stories.

First thing to do.

The first thing to do is to get your own API key from VoiceRSS. This was discussed in the following story:
https://lucstechblog.blogspot.com/2026/02/text-to-speech-with-voicerss.html
So make sure to carefully read that story and get your own API Key.

When you got the Key you can fill it in the next program.

The basic program.

<!DOCTYPE html>
<html>
<head>
    <title></title>
    <meta charset='utf-8' />
    <script src='voicerss-tts.min.js'></script>
</head>
<body>
    <script>
        VoiceRSS.speech({
            key: 'PUT-YOUR-API-KEY-HERE',
            src: 'Hello, world!',
            hl: 'en-us',
            v: 'Linda',
            r: 0,
            c: 'mp3',
            f: '44khz_16bit_stereo',
            ssml: false
        });
    </script>
</body>
</html>

Actually this program works and speaks out the words "Hello World"
So copy this and paste it in your favorite editor and then save it as an HTML file.
Do not forget to paste n your own API key where it says: PUT-YOUR-API-KEY-HERE

Click on the icon and your web browser will open and after a few seconds the text will be spoken. So make sure to turn the volume of your speakers up.

Sidenote.

The script (program) inside the webpage is Javascript. Javascript is actually not very complicated to use. And best part is that you can get immediate results on a simple webpage. However, Javascript is very versatile and can be used for all kinds of projects. To make programming with Javascript a lot easier I collected more then 500 programming tips and tricks and put them in a book. You can buy this book at Amazon and they ship worldwide.

Click here to learn more or buy this book.

The program looks and is very simple but that is because it imports a library from the internet.
The point is that I actually do not want to import a library.
Why ???
Well because it is totally unnecessary. You can use VoiceRSS with a simple fetch command.

So I wrote a complete program that is a webpage which has an input field in which you can type your question.

When you then press the send button the question is send to a simple AI system. The answer that the AI system sends back is then send to VoiceRSS. The answer is also saved in your downloads directory, as a wav file. This way you can use that audio file in other programs or projects.
The answer is then spoken out.

And there is an audio player at the bottom of the page that can replay the text that was spoken, it just gets the saved audio file and plays that again.

Here is the complete program:

<!DOCTYPE html>
<html lang="en">
<head>
  <meta charset="UTF-8">
  <title>Simple AI Fetch</title>
  <style>
    body {
      font-family: Arial, sans-serif;
      margin: 20px;
    }

    textarea {
      width: 100%;
      min-height: 100px;
      resize: vertical;
      box-sizing: border-box;
      padding: 10px;
      font-size: 16px;
      border: 1px solid #ccc;
      border-radius: 5px;
      overflow-y: auto;
    }

    button {
      margin-top: 10px;
      padding: 10px 20px;
      font-size: 16px;
      cursor: pointer;
    }

    pre {
      background: #f9f9f9;
      border: 1px solid #ddd;
      padding: 10px;
      white-space: pre-wrap;
      word-wrap: break-word;
    }

    audio {
      margin-top: 10px;
      width: 100%;
    }
  </style>
</head>
<body>
  <h2>Ask a Question</h2>

  <!-- Replaced input with textarea -->
  <textarea id="userInput" placeholder="Type your question..."></textarea>
  <br>
  <button onclick="sendRequest()">Send</button>

  <h3>Response:</h3>
  <pre id="output"></pre>

  <audio id="audioPlayer" controls></audio>

  <script>
    async function sendRequest() {
      const userText = document.getElementById("userInput").value.trim();
      if (!userText) return alert("Please enter a question!");

      const targetUrl = "http://nimaartman.ir/science/L1.php?text=" +
                        encodeURIComponent("answerinenglishnoutf8anywhere.:" + userText);

      const proxyUrl = "https://api.allorigins.win/raw?url=" + encodeURIComponent(targetUrl);

      try {
        const response = await fetch(proxyUrl);
        const text = await response.text();

        let data;
        try {
          data = JSON.parse(text);
        } catch {
          data = { data: text };
        }

        let rec_ans = data.data || "";
        rec_ans = rec_ans.replace(/[^A-Za-z0-9 \n.,!?\\*+\-%@$&:<>()[\]{}"`]/g, "").trim();

        document.getElementById("output").textContent = rec_ans;
        console.log("Received Answer:", rec_ans);

        const API_KEY = 'YOUR-OWN-API-KEY-HERE';
        const url = 'https://api.voicerss.org/';

        const params = new URLSearchParams({
          key: API_KEY,
          hl: 'en-us',
          v: 'Amy',
          f: '8khz_16bit_mono',
          src: rec_ans || "I have turned the lights off"
        });

        const ttsResponse = await fetch(url, {
          method: 'POST',
          headers: { 'Content-Type': 'application/x-www-form-urlencoded' },
          body: params
        });

        if (!ttsResponse.ok) {
          console.error('❌ TTS request failed:', ttsResponse.status);
          return;
        }

        const audioBlob = await ttsResponse.blob();
        const audioUrl = URL.createObjectURL(audioBlob);

        const audioPlayer = document.getElementById("audioPlayer");
        audioPlayer.src = audioUrl;

        audioPlayer.play().catch(err => console.error("🎧 Playback error:", err));

        console.log("🎵 Playing audio...");


      // ✅ Automatically trigger audio file download (no visible button)
      const downloadLink = document.createElement('a');
      downloadLink.href = audioUrl;
      downloadLink.download = `response_${Date.now()}.mp3`;
      document.body.appendChild(downloadLink);
      downloadLink.click();
      downloadLink.remove(); // clean up the temporary link


      } catch (error) {
        console.error('⚠️ Request failed:', error);
      }
    }
  </script>
</body>
</html>

Nothing dramatically complicated here.
I will just hi-lite a few sections.

      const targetUrl = "http://nimaartman.ir/science/L1.php?text=" +
                        encodeURIComponent("answerinenglishnoutf8anywhere.:" + userText);

This is the URL for the AI we are going to use for asking our questions.
But using this in a fetch command will throw a CORS error.
I have written about that in this story : https://lucstechblog.blogspot.com/2026/01/solving-cors-error-with-javascript.html

      const proxyUrl = "https://api.allorigins.win/raw?url=" + encodeURIComponent(targetUrl);

To avoid the CORS error we are going to use an external service called AllOrigins.
So we are going to wrap our fetch command into an AllOrigins fetch. Please read the story about this like stated above.

The AI system does not need you to log in or make an account. So you also do not need an API key for that.

But VoiceRSS does need an API key.
I wrote how to get that key in this story:
https://lucstechblog.blogspot.com/2026/02/text-to-speech-with-voicerss.html
So follow that to get your own key.

        const API_KEY = 'YOUR-OWN-API-KEY-HERE';

This is where you fill in your own API key for VoiceRSS.

        const params = new URLSearchParams({
          key: API_KEY,
          hl: 'en-us',
          v: 'Amy',
          f: '8khz_16bit_mono',
          src: rec_ans || "I have turned the lights off"
        });

This is where you set the language and voice. You can find all supported languages and voices on https://www.voicerss.org/api/

The rest is just the fetch command and the code for playing the audio.

How to use this.

Just copy the above code and paste it in your favorite editor. Then save it as spoken-ai.html or give it any name you like. Then open the directory where you saved it and click on the icon. Your default browser will open and show the webpage.

Put your question in the input field and press the send button. After a second or two the answer to your question will be spoken out. So turn up the volume of your speakers.

The audio file with the spoken text is saved to your download directory. And pressing the play button replays the audio (speaking the text) without the need to press the send button again.

Things to consider.

It takes a few seconds before the program starts speaking the answer to your question. Sometimes it takes just a second and sometimes it takes a few seconds.
This is because the program first has to access the AllOrigins API, then the AI and then access VoiceRSS. And sometimes (especially Allorigins) can get a bit busy.

And just sometimes it will not work at all.
Then you can see that AllOrigins is not working cause you'll get a CORS error.
You can see that if you open the console window in your browser (in more tools - developer tools).
The only thing that rests then is to retry sending your question several times.

You could solve this by building your own local AllOrigins server: their Github page tells you how. But that is maybe asking too much.

So maybe we should stick to the Gemini version..........
I am going to show you how to add voice to that next time.
Till that time you really can play and have fun with this.

Till next time then.
Have fun

Luc Volders

Friday, February 20, 2026

Text to Speech with voicerss

For an index to all my stories click this text.

This story shows how to convert text to spoken word using voicerss. This is the first in a series. This story tells how to use this service in your browser.

Some background.

I love playing with text to speech programs and services. I also think this can be a valuable addition to your projects. In the most dramatic scenario you can give spoken word feedback in your IOT projects to a blind person or a person who is visually impaired.
But there are more projects where a spoken word feedback can be valuable. And besides that it is just fun to hear your computer or microcontroller speak to you. There are several speech related projects on this weblog, but they use your phone for speech conversion.

In this and the upcoming story we are going to use a service called voicerss. And this will work on your computer but also on a microcontroller !! I will start with the computer version.

Voicerss.

Voicerss is a company that offers text to speech conversion. It is a commercial company but they have a free tier. And that tier is really generous.
You can make 350 free conversions per day and each one can have no less then 100K text. That is an awful lot for a free tier.

Even for small messages this is a lot.
350 messages a day is 14 messages per hour. But you are not likely to be awake 24 hour a day. Meaning that if you for example use this 10 hour a day you can have 35 messages per hour, meaning every 2 minutes. Well you can do the calculations yourself.

Create an account

To use voicerss there is a very simple API that can be called from a large variety of computer languages.
You do need to make an account to obtain your personal API key.

Start with visiting the Voicerss site: https://www.voicerss.org/

Chose login from the menu and at the bottom chose registration.

Fill in all the fields. You do not need to give a company name. But do fill in your real e-mail address. And of course chose an appropriate password.
Then check "I am not a robot" and press the register button at the bottom.

A confirmation email will be send to your email address. So open your email program. Look for that mail and press on the link.

The profile screen now shows at the bottom that your account is active. And it shows your API key.

The voicerss API

The API is what we need to let our computer (or microcontroller) communicate with voicerss. It is really very easy.

Look at the page with the API info.

If you scroll a bit down you can see the examples.
You can copy any one of them and just paste it into your browsers URL field.
But before pressing enter change key=1234567890QWERTY.
After the = fill in your own API key.

Then press enter. And hear the magic happening.

Experiment with this by altering the text in anything you like.

Javascript example

There are several examples at the SDK page. Just pick your preferred language and click on it.

This is the page that shows how to use voicerss with Javascript.

Start with clicking on Download Javascript text-to-speech sdk.

The download starts immediately. And a zip file will be downloaded in your computers download folder. I transferred it to a folder called voicerss I made for this article.

Clicking on that zip file reveals that it contains just a small javascript library called voicerss-tts-min.js. Extract that to the same forlder where you are going to put your own program in. This is important. The voicerss-tts-min.js library should be in the same folder where your javascript program will be. If you do not put it there your program will not be able to find the library.

At the bottom of the Javascript SDK page there is also a small example program. I downloaded that also and called it voicetest-minimal.html

<!DOCTYPE html>
<html>
<head>
    <title></title>
    <meta charset='utf-8' />
    <script src='voicerss-tts.min.js'></script>
</head>
<body>
    <script>
        VoiceRSS.speech({
            key: '<API key>',
            src: 'Hello, world!',
            hl: 'en-us',
            v: 'Linda',
            r: 0,
            c: 'mp3',
            f: '44khz_16bit_stereo',
            ssml: false
        });
    </script>
</body>
</html>

Above is this program and as you can see it is very simple. That is of course because it uses the voicerss-tts-min library.

To get this working you need to change <API key> in your own API key. Only the key, do not put in the brackets.

Save the html page and then double click on it. Your browser should open saying the words Hello world. Make sure to have the volume of your speakers up.

Now this is a very simple program that can be adapted for many different projects. You could embed this in your IOT dashboards or in any project that uses Javascript to monitor or control data.

No library needed

As you have seen above you can paste the API direct in the browsers URL. So obviously using the right code you do not need the Javascript library.

I wrote a program in Javascript (and admittedly had some help with the styling). This opens a webpage with a field in which you can type text.

The code is below. Just copy it from this page. Paste it into an editor and save it as voice-test.html or something like that. Then click on that file and it will pen in your favorite browser.

Pressing the SEND button sends the text to voicerss and the program has an audio player that plays the received spoken words.
You can replay audio as often as you like by pressing the play button in the audio player.
You can of course also alter the text and resend it to voicerss.

But there is more

The received audio is also saved in your Downloads folder.
For those that wonder: my computer is a Raspberry Pi5 with 8GB and I am running Raspberry Trixie with the KDE Plasma shell. That is why the downloads folder might look unfamiliar.

Here is the program:

<!DOCTYPE html>
<html lang="en">
<head>
  <meta charset="UTF-8">
  <meta name="viewport" content="width=device-width, initial-scale=1.0">
  <title>Text to Speech Demo</title>
  <style>
    body {
      font-family: Arial, sans-serif;
      padding: 20px;
      background: #f9f9f9;
    }

    textarea {
      width: 100%;
      height: 150px;
      padding: 10px;
      font-size: 16px;
      border: 1px solid #ccc;
      border-radius: 5px;
      resize: vertical;
    }

    button {
      margin-top: 10px;
      padding: 10px 20px;
      font-size: 16px;
      background-color: #0078d4;
      color: white;
      border: none;
      border-radius: 5px;
      cursor: pointer;
    }

    button:hover {
      background-color: #005fa3;
    }

    audio {
      margin-top: 20px;
      width: 100%;
    }
  </style>
</head>
<body>
  <h1>Text-to-Speech (VoiceRSS)</h1>

  <textarea id="textInput" placeholder="Type your text here..."></textarea><br>
  <button id="sendButton">Send</button>

  <audio id="audioPlayer" controls></audio>

  <script>
    const button = document.getElementById('sendButton');
    const textInput = document.getElementById('textInput');
    const audioPlayer = document.getElementById('audioPlayer');

    button.addEventListener('click', async () => {
      const text = textInput.value.trim();
      if (!text) {
        alert('Please enter some text first.');
        return;
      }

      const apiKey = 'replace-with-your-key'; // replace with your real VoiceRSS key
      const language = 'en-us';
      const voice = 'Amy';

      try {
        // VoiceRSS expects a POST with form data
        const formData = new FormData();
        formData.append('key',apiKey);
        formData.append('src', text);
        formData.append('hl', language);
        formData.append('v', voice);
        formData.append('c', 'WAV'); // codec
        formData.append('f', '8khz_16bit_mono');

        const response = await fetch('https://api.voicerss.org/', {
          method: 'POST',
          body: formData
        });

        const blob = await response.blob();

        // VoiceRSS sometimes returns text error messages instead of audio
        const contentType = blob.type || '';
        if (!contentType.startsWith('audio/')) {
          const textError = await blob.text();
          console.error('VoiceRSS error:', textError);
          alert('VoiceRSS error: ' + textError);
          return;
        }

        // Create object URL for playback and saving
        const audioUrl = URL.createObjectURL(blob);

        // Play in audio element
        audioPlayer.src = audioUrl;
        audioPlayer.play();

        // Trigger file save
        const downloadLink = document.createElement('a');
        downloadLink.href = audioUrl;
        const safeText = text.slice(0, 20).replace(/[^a-z0-9]/gi, '_'); // clean name
        downloadLink.download = `tts_${safeText || 'output'}.mp3`;
        document.body.appendChild(downloadLink);
        downloadLink.click();
        document.body.removeChild(downloadLink);
      } catch (err) {
        console.error(err);
        alert('Failed to fetch or play audio.');
      }
    });
  </script>
</body>
</html>

There are just two things in this program that I want to go into detail here.

      const apiKey = 'replace-with-your-key'; // replace with your real VoiceRSS key
      const language = 'en-us';
      const voice = 'Amy';

Before running this code change 'replace-with-your-key' indeed with your own API key.

      try {
        // VoiceRSS expects a POST with form data
        const formData = new FormData();
        formData.append('key',apiKey);
        formData.append('src', text);
        formData.append('hl', language);
        formData.append('v', voice);
        formData.append('c', 'WAV'); // codec
        formData.append('f', '8khz_16bit_mono');

        const response = await fetch('https://api.voicerss.org/', {
          method: 'POST',
          body: formData
        });

And here you can see that I did not use the Javascript library but made an ordinary fetch request.

And hey ? What's that ???
The codec is WAV and 8khz in 16 bit mono ???

Can we do something with that ??????
Well that is for another story. You'll be surprised.

The API

I urge you to look at the API page.

You can change:

- The language
- The voice male/female often multiple voices
- The audio codecs MP3/WAV/AAC/OGG/CAV
- Audio format: from 8Khz-8bit to 48Khz-44khz_16bit

Plenty of room to play around with.

Concluding

voicerss is a cloud based service and I actually lately have a dislike for cloud based services. The dislike comes from many cloud based services that suddenly shut down (dweet, original blynk, iotTweet, logitech pop etc.) or suddenly start charging for their services (IFTTT webhooks).

Nevertheless is voicerss a fun service to play with and build some projects around. I have some ideas for this and will publish them as soon as they are mature.
The API is very easy to use in all kinds of programming languages.
And most important: they have a very generous free tier which makes it possible for us mere hobbyists to build some great projects for free.

Till next time
have fun

Luc Volders