18 comments

  • nceasy 950 days ago
    Hey nice one! I'm trying to innovate in the STT and TTS space as well (completely different field), feel free to contact me in my profile email if you want to exchange some knowledge :). I hear some bugs in portuguese, but I'm guessing the trained model has some issues. Congrats and Good luck with your tool! Ps: give users some preview of the speakers voice so we can test it before convert audio, it can save you some resources.
    • Paul_Grsl 950 days ago
      Hello, we can meet on twitter: grsl_en :) thanks for your comment, it's still a v1 in beta but I will continue to improve it. And yes as you can see in the ui there are buttons for preview but not yet functional :)
  • trabant00 950 days ago
    It does not handle apostrophes good. It says "we L L" instead of we'll or "Andrea S" instead of Andrea's. It also has some problem with pacing around dashes, three points and quotes. It speeds up a lot and connects the words before and after those. Overall would not use it in this state to turn articles into podcasts or something like that.
  • capableweb 950 days ago
    What about something that can do the opposite? Like converting video and/or audio to articles?

    Most of the content I consume fits best (for me) in article format, so I can read it at my own speed, but some really good information can only (annoyingly) be found in videos or podcasts.

    • roryisok 950 days ago
      I've been using whisper from openAI to transcribe stuff this month, and its incredibly accurate. would be a good base for something like this
    • Paul_Grsl 950 days ago
      This is a good idea, I just started the project :) So this is a feature that can be added in the future.

      Here is the roadmap for the future: - Audio sharing - Convert Text To Audio - Convert PDF To Audio - Convert Photo To Audio - Chrome extension - to convert while browsing - Mobile App - to manage audios everywhere, simply

      and adding the possibility to do the opposite is also a great idea!

    • cblavier 950 days ago
      Working on this very topic right now! But specific to Podcast audio content

      https://readable.fm/

    • veb 950 days ago
      oh mate I so agree and as a deaf person this would be a godsend. way too much shit is in videos or podcasts. please just let me read...
  • kretaceous 950 days ago
    I got super psyched to try this. Always wanted a good TTS extension or app.

    However I cannot get it to work. I've logged in, input an article but nothing happens after I click "Convert article to audio" or preview.

    Linux Mint/Firefox 107/Chrome

    Edit:

    I checked devtools and it shows a 500 error with the message "Something is broken. Please let us know what you did"

    The link I was trying to convert was:

    https://www.daemonology.net/blog/2020-09-20-On-the-use-of-a-...

  • akuji1993 950 days ago
    Hey, your tool is not working great with german articles. It can't pronounce Umlauts and also has trouble with some pretty standard simple words.

    I used this article as a test: https://www.saarbruecker-zeitung.de/saarland/landespolitik/s...

    • Paul_Grsl 950 days ago
      I will look at why the charactere that Umlauts are not well pronounced. Do you have examples of other words? I will investigate :)

      Danke für dein Feedback, das hilft mir wirklich weiter .

      • cauners 950 days ago
        On the same topic, in Latvian all the characters with diacritics are stripped out completely. For example, "iedzīšana" is pronounced as "iedzana", making the audio pretty funny, though hard to understand.
        • Paul_Grsl 950 days ago
          I'm working on solving these pronunciation problems for the special characters if I unlock one everything else will follow. Don't hesitate to sign up to be kept informed!
  • nkoren 950 days ago
    Huh, wow, really cool! For the most part it's excellent (in English), however I notice that quotation marks (both single and double) are handled strangely. The leading quotation mark is pronounced as a short A, and the trailing as a long A. This can be rather confusing! But otherwise, I'm incredibly impressed by the results.
    • Paul_Grsl 950 days ago
      Thank you very much for your comment!! It's still a beta version and I have to improve some pronunciations! Especially for the special characters... don't hesitate to signup to be kept informed :)
      • wazoox 950 days ago
        It's certainly that all UTF-8 characters (like UTF-8 fancy quotes and double-quotes) aren't properly interpreted, everything seems to go through as ISO8859-15.
    • malshe 950 days ago
      I also noticed the same issue with quotation marks. But other than that this is a really nice application
  • schreon 950 days ago
    Nice! The german version breaks Umlauts though. Apparently the preprocessing converts e.g. "ä" to "ae", "ö" to "oe" and so on and the text2voice model subsequently pronounces them as e.g. "a-e", instead of what "ä" would actually sound.
    • Paul_Grsl 950 days ago
      Indeed we have a problem with special characters in German (and also Polish… :( ) I am investigating why they are not pronounced correctly.

      I studied German at school but not enough.

      Vielen Dank für Ihren Kommentar, er hilft mir, das Tool zu verbessern :)

  • hutrdvnj 950 days ago
    Could you make a paid TTS engine App on Android and iOS?
    • Paul_Grsl 950 days ago
      Yes it's in the roadmap. and also a chrome extension :) would you be interested ?
      • hutrdvnj 950 days ago
        Yes, I currently use the Read Aloud Android App and it allows me to use any installed TTS. The Google free network TTS voices are quite okay, but I know that there are better premium voices unfortunately I didn't found any high quality human like TTS in the play store as of yet.
  • tiffanyh 950 days ago
    Dumb question: how is “AI” used for text-to-speech?
    • Paul_Grsl 950 days ago
      Text-to-Speech (TTS) technology uses artificial intelligence (AI) translate written information in a given language into a sound, voice or speech with a human accent.to learn the AI had to learn with many parameters, so that the pronunciation improves from version to version :)
  • aronatom 950 days ago
    Tested icelandic! sound really good except for ignoring all special icelandic character such as ð, ó, á ,ö and so on
    • Paul_Grsl 950 days ago
      Thanks for your comment! I'm actually working on this point to fix it as FAST as possible! sorry for that...
    • roland_szabo 950 days ago
      I had the same issue with hungarian language.
      • Paul_Grsl 950 days ago
        I'm working on solving these pronunciation problems for the special characters if I unlock one everything else will follow. Don't hesitate to sign up to be kept informed!
        • aronatom 950 days ago
          I will! Great stuff
  • judex 950 days ago
    Great work! I'm testing if I could use it in my project. It would be good to be able to just paste some text.
    • Paul_Grsl 950 days ago
      Thank you so much! What's your project?

      Here is the roadmap for the future: · Audio sharing ----> FOR YOU · Convert Text To Audio · Convert PDF To Audio · Convert Photo To Audio · Chrome extension - to convert while browsing · Mobile App - to manage audios everywhere, simply

  • jerpint 950 days ago
    I had this idea a few months ago, obviously I never got around to executing it. I’m glad someone else did
    • Paul_Grsl 950 days ago
      That famous moment when you have the idea of a new side project, you buy the domain name and then... you have a new idea (loop).

      This time I developed it, I'm happy with this 1st version (which must be improved).

      Anyway, thanks for your comment

      And you, why didn't you develop it in the end?

      • jerpint 950 days ago
        I can’t remember the list of thousands of excuses I came up with :)
        • Paul_Grsl 950 days ago
          Haha. destroy this list and go for it :D
  • 5amdotis 950 days ago
    I would like to have a tool that does it the other way around. Audio to a somewhat cohesive article.
  • wazoox 950 days ago
    I can only get "200 internal server error" entries in the dev console :)
    • Paul_Grsl 950 days ago
      Wow... :( Still now ?
      • wazoox 950 days ago
        It works on Chromium though. However it has trouble with UTF-8 obviously: it interprets "é" as "é" i.e. ISO8859-15.

        I see that for "French" it selects "Alain" as a voice in Chromium, but "Joe" in Firefox. However in Firefox I must first change the language, it then select another voice, and switching back to French, it switches properly to Alain then it works. It probably doesn't initialise properly the voice selector when loading the page in Firefox (if I don't change the language first, I can't select any voice, the selector isn't working).

        Same problem with accents as Chromium though :)

  • herinskd 950 days ago
    It works amazingly fast, are you using GPUs and QNNX to reach such performance?
    • Paul_Grsl 950 days ago
      Thanks for the great feedback. I think we can still improve the result, it's only a v1 in beta. I use nothing very advanced for rendering, a stack and tools rather simple. :)
    • zachthewf 950 days ago
      I believe it's using Microsoft TTS voices (at least for some of them)
      • Paul_Grsl 950 days ago
        That's right we use Azure TTS! :)
  • phas0ruk 950 days ago
    Nice, what’s the tech stack?
    • Paul_Grsl 950 days ago
      Simple: PHP (Symfony), JS (Vanilla), HTML, TailwindCSS :)
  • loriverkutya 950 days ago
    Not usable with Hungarian.
    • Paul_Grsl 950 days ago
      I'm looking into it, a Polish user is having the same problems. Don't hesitate to sign up so I can let you know when it's fixed :)

      Köszönöm a hozzászólásodat, ez segít nekem! :)

  • fieryskiff11 950 days ago