• Isadora
  • Get it
  • Forum
  • Help
  • ADD-ONS
  • Newsletter
  • Impressum
  • Dsgvo
  • Impressum
Forum

Navigation

    • Register
    • Login
    • Search
    • Categories
    • Recent
    • Popular
    • Tags

    Recognizing spoken words

    How To... ?
    8
    15
    3643
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Armando
      Armando Beta Gold last edited by

      Hello,

      any idea on how to make Isadora recognize spoken words? Talking in a microphone is izzy capable to recognize the word and trigger something?
      Thanks
       Armando

      Armando Menicacci
      www.armandomenicacci.net
      www.studiosit.ca

      Macbook pro 16 inches 2022 64 GB RAM, M1 MAX. OS 13 Ventura

      1 Reply Last reply Reply Quote 0
      • Skulpture
        Skulpture Izzy Guru last edited by

        I imagine you would have to use a third party somewhere and somehow write to a file and then import them... maybe a .txt file? Data array? Not sure really. That's a big task. Sounds like something @Fred would do ;)

        Graham Thorne | www.grahamthorne.co.uk
        RIG 1: Windows 11, AMD 7 Ryzen, RTX3070, 16gig RAM. 2 x M.2 SSD. HD. Lenovo Legion 5 gaming laptop.
        RIG 2: Windows 11, Intel i19 12th Gen. RTX3070ti, 16gig RAM (ddr5), 1x M.2 SSD. UHD DELL G15 Gaming laptop.
        RIG 3: Apple rMBP i7, 8gig RAM 256 SSD, HD, OS X 10.12.12

        1 Reply Last reply Reply Quote 0
        • Maximortal
          Maximortal last edited by

          This is a workaround: use some software like IMITONE (they transform your sing in midi notes)then use a different pitch tp pronunce the words, it recognize not the word but tonality than map isadora's triggers to different midi note.

          Iro Suraci | Win 10 - Ryzen 3600 - 32GB - nVidia gtx 960 4gb / Win 10 - i5 4210U - 8 GB - R5 M230 | Isadora 3.0.7| Located in Brescia, Italy

          1 Reply Last reply Reply Quote 0
          • crystalhorizon
            crystalhorizon Beta Platinum last edited by

            Thats clever

            Alexander Nantschev | http://www.crystalhorizon.at | located in Vienna Austria

            1 Reply Last reply Reply Quote 0
            • J
              jandraka last edited by

              Hello,

              There is a section  in System Preferences named "Dictation". You can turn it on and select the input. If you use Quartz Composer you can use a composition as attached and the "Text" field activate the Dictation function (Fn twice). Talk and takes seconds to recognize it. You press intro and say another phrase. If you use "Syphon to texture" you can handle the text as a layer.
              I hope it helps.

              095d55-text.zip

              MBP 2017, Quad-Core Intel Core i7 3,1GHz, 16GB, Radeon Pro 560 4GB. OSX 11.6.2 Isadora 3.2.5

              1 Reply Last reply Reply Quote 0
              • Skulpture
                Skulpture Izzy Guru last edited by

                @jandraka way is the kind of route i'd take; but i imagine there is a delay and it wont be 100% perfect.

                Graham Thorne | www.grahamthorne.co.uk
                RIG 1: Windows 11, AMD 7 Ryzen, RTX3070, 16gig RAM. 2 x M.2 SSD. HD. Lenovo Legion 5 gaming laptop.
                RIG 2: Windows 11, Intel i19 12th Gen. RTX3070ti, 16gig RAM (ddr5), 1x M.2 SSD. UHD DELL G15 Gaming laptop.
                RIG 3: Apple rMBP i7, 8gig RAM 256 SSD, HD, OS X 10.12.12

                1 Reply Last reply Reply Quote 0
                • Fred
                  Fred last edited by

                  Apart from this Kinect V2 (proper sdk for MS only) has pretty good voice recognition and it is not to hard to use- you can send OSC to Izzy to let it know that a voice command has been activated, or of course the old mac system. My advice would be to use a lapel mic to avoid some confustion. I had voice controlled Izzy several years ago (using apple script) it worked ok, was clunky as the voice recognition supplied was. Also google have a service you can upload files to (well kind of you need to do some work but I think you can get there without complex code) to do speach to text. The recognition is a lot better and contextual, I used it a few times, you can see a way to use it in this code

                  https://github.com/fx-lange/ofxGSTT
                  Pretty simple and you can do it without OF if you need. Or you could do it with OF and get the strings back- this engine understands more accents and does a better job.
                  Fred

                  http://www.fredrodrigues.net/
                  https://github.com/fred-dev
                  OSX 10.15.15 MBP 2019 16" 2.3 GHz 8-Core i9, Radeon Pro 5500M 8 GB, 32g RAM
                  Windows 10 7700K, GTX 1080ti, 32g RAM, 2tb raided SSD
                  Windows 10 Threadripper 3960x 64g ram, 1tb NVME, rtx 2080ti + rtx2070 super

                  1 Reply Last reply Reply Quote 0
                  • Maximortal
                    Maximortal last edited by

                    I found another solution using your smartphone:

                     the chain is: Tasker now - tasker - usb OTG cable - USB midi adapter
                    Tasker now is a plugin for tasker that intecept words in google now
                    Tasker is...all that you want :-)
                    for more infos look at http://tasker.dinglisch.net/userguide/en/midi.html
                    EDIT: you can do it without a midi interface..
                    you record two different sine waves (e.i. 200 Hz and 15 kHz) and store into phone
                    then you set in tasker to play the first when you say "first" and second to "second"
                    then you connect you smartphone in the audio input of isadora and use sound frequences analisys actor
                    next steps are obiouvs
                    Of course you need an android phone with tasker and tasker now

                    Iro Suraci | Win 10 - Ryzen 3600 - 32GB - nVidia gtx 960 4gb / Win 10 - i5 4210U - 8 GB - R5 M230 | Isadora 3.0.7| Located in Brescia, Italy

                    1 Reply Last reply Reply Quote 0
                    • Armando
                      Armando Beta Gold last edited by

                      Thank you all.

                      Just to give a feedback to so many generous suggestions I've tried MAXIMORTAL  suggestion. But it doesn't woks for me because peuple should really sing a specific note associate with a word. And I Don't want to teach music and text, And I'l like people to express more freely.

                      Second maximortsl option is very interesting, But android only.....
                      Fred as always has solutions that make me dream (and sewat) but it seems like a think that could work although out of my reach. (I'll hire people to develop and report to the community in case I go this way).
                      I"d go: text to speach, Data array text comparator.
                      I'll tell when I try

                      Armando Menicacci
                      www.armandomenicacci.net
                      www.studiosit.ca

                      Macbook pro 16 inches 2022 64 GB RAM, M1 MAX. OS 13 Ventura

                      1 Reply Last reply Reply Quote 0
                      • DusX
                        DusX Tech Staff last edited by

                        Heres a python module that offers some Windows options: https://github.com/t4ngo/dragonfly

                        Adding an OSC module for communication with Isadora would make this a workable option.

                        Troikatronix Technical Support

                        • New Support Ticket Link: https://support.troikatronix.com/support/tickets/new
                        • Isadora Add-ons: https://troikatronix.com/add-ons/
                        • My Add-ons: https://troikatronix.com/add-ons/?u=dusx

                        Running: Win 10 64bit, i7-8750H, M.2 PCIe SSD's, 16gb DDR4-2666, nVidia GTX 1070 | located in Ontario Canada.

                        1 Reply Last reply Reply Quote 0
                        • Armando
                          Armando Beta Gold last edited by

                          Thanks DusX,

                          but I'm on a mac plus I don't lump how to use this But thanks anyway. This can help windows people after all.
                          Ciao 

                          Armando Menicacci
                          www.armandomenicacci.net
                          www.studiosit.ca

                          Macbook pro 16 inches 2022 64 GB RAM, M1 MAX. OS 13 Ventura

                          1 Reply Last reply Reply Quote 0
                          • J
                            jandraka last edited by

                            Hi Armando,

                            I do not know if this can help you. It is a video in which I use Dictation, Quartz Composer and Isadora. I use the words "dos", "tres" and "cuatro" in Spanish to toggle the video sources. You must enter a sound to insert text in Quartz Composer, but the sound can be more discreet. In Dictation you can choose the audio input source: Internal microphone, line input ... Dictation is quite fast recognizing the words but not infallible.
                            If it helps, I can share the Quartz Composer patch.
                            Sorry for my English.
                            Best regards.

                            e523a7-20150701_002731.mp4.zip

                            MBP 2017, Quad-Core Intel Core i7 3,1GHz, 16GB, Radeon Pro 560 4GB. OSX 11.6.2 Isadora 3.2.5

                            1 Reply Last reply Reply Quote 0
                            • Armando
                              Armando Beta Gold last edited by

                              Wow. video is impresing. I'll try

                              Thanks

                              Armando Menicacci
                              www.armandomenicacci.net
                              www.studiosit.ca

                              Macbook pro 16 inches 2022 64 GB RAM, M1 MAX. OS 13 Ventura

                              1 Reply Last reply Reply Quote 0
                              • gapworks
                                gapworks last edited by

                                any more details or maybe an example about you quartz patch! your concept sounds really interessting!

                                best

                                Running MBP2017 / Osx 10.14.5 / 16 GB 2133 MHz LPDDR3 / Intel HD Graphics 630 1536 MB / Latest Isadora Version / www.gapworks.at / located in Vienna Austria

                                1 Reply Last reply Reply Quote 0
                                • J
                                  jandraka last edited by

                                  Hi,

                                  I use several chained applications. The result would be better if I could automatically insert text in Text Edit.
                                  I use Dictation, Text Edit, Quartz Compser and Isadora.
                                  First, you have to make some adjustments in Dictation, in System Preferences. You have to activate Dictation (I have selected the option to activate by pressing "fn" twice).
                                  Then, also in the System Preferences, Accessibility, go to Dictation and then to Dictation Commands, "+" press and create a new user command. This command allows you to save changes in Text edit by slight murmur (see sceenshoots 1, 2, and 3).
                                  When these adjustments are made, we can open Text files recogn.txt, .qtz, .izz. Now with Dictation function (appears a small square with a microphone), and from the Text Edit file, we have to go inserting any text. By slight breathing after the text, we will saving Text Edit document and therefore entering text in Quartz Composer. Here, in Quartz Composer, comparison with our keywords occurs and if the result is true, a MIDI event is triggered.
                                  It is important that Kineme String With File patch matches the file path to the .txt document.
                                  The Kineme plugin, you must install it in user / Library / Graphics / Quartz Composer Patches
                                  I understand it's not a very clear settings but it works. The biggest problem is that Dictation sometimes understand anything.
                                  I hope this can be of some use to you.

                                  2aaa44-text-recognition.zip

                                  MBP 2017, Quad-Core Intel Core i7 3,1GHz, 16GB, Radeon Pro 560 4GB. OSX 11.6.2 Isadora 3.2.5

                                  1 Reply Last reply Reply Quote 0
                                  • First post
                                    Last post