Added ttssr-human-only

7 years ago · 0712d4e6b0
parent 26318959cc
commit 0712d4e6b0
7 changed files with 138 additions and 209 deletions
--- a/7
+++ b/7
@ -86,7 +86,6 @@ tts: output/chatbot.txt ocr/output.txt    ## text to speech. Dependencies: espea
 	@echo $(color_w)
 	cat $? | espeak

-ttstt: ocr/output.txt##text to speech, speech to text
-	cat $< | espeak -s 140 -v f2 --stdout > output/sound.wav
-	
-	
+
+ttssr-human-only: ocr/output.txt ## Loop: text to speech-speech recognition. Dependencies: espeak, pocketsphinx
+	bash src/ttssr-loop-human-only.sh ocr/output.txt
--- a/ocr/list.txt
+++ b/ocr/list.txt
@ -1,4 +0,0 @@
-
-images/0004.jpg
-images/0006.jpg
-
--- a/ocr/output.txt
+++ b/ocr/output.txt
@ -1,199 +0,0 @@
- 
-
- 
-
-[J MASTHEAD
-int-F urniu . : .“l’ll' 'J
-
-Masthead
-
-Editorial Team
-Ana (liruilhu
-gmger twins
-
-Ritardo Lat-Liente
-
-Copy editor
-
-Margaret Burnett
-
-Pubﬁsher
-
-ginger (001’!g
-
-Community Board
-Dave Crossland
-
-Louis Desmrdms
-
-AVTTIt'TlL Hansoux
-
-Alexandre Prokoudine
-
-Femke Snelting
-
-Contributors
-Daxe Crossland. Maria Figueiredo, Nelson Concalves. Eric de
-
-Haas Richard Hugheslonathan Puckey, Eric behriiver
-
-Printed in Port. by Crornotema (http: troinotemapt) and in
-T: into M Green Printer (http’ v.wwgreenprinteronlinc-.4 om‘l
-
-in rerwled paper
-
-Lli ensed under at redinek ummons Attril-u’iioi‘i-Rhare Alike
-lli r‘l'lv: N l rmrsm '\|l . (Intent \hlluld be attributed to II\
-iridiuidiial lUlhIlT all . ante-m \titl‘i-iut :1 stated author 4 (iii lm
-
-'TE‘dlir-diiiLilitf‘lrir.1[il]ii\ Hagar”):-
-
-Contacts
-
-»\ Hi" to u .1! Hlllllllilv“.iiI‘lll11{"L“i.jplili trying i an:
-
-HTTP: LIBRFl iRAl’Hll 'beU i.( HM
-
-Images under a ( (' Attribution Sharr- Aliki- “1 Nine
-
-l’lmii- mi piii‘gvi Vll‘l’)‘, lit, lii-i- all
-
-I'll ill. u-l Iv’u ii-lu l .illll‘lil“ liv. "\im‘ .ir'..illi<I
-
-l'li. ll "i|.i‘ .1i‘ ,illii l-'. llll‘ i JIIIJHlI-I
-
-Hit It .-l Iran-I i l.lllll lit ‘n‘Jl‘, i iii-.-.l.iiul
-
-l'lwim ml l Iii llll‘iu‘l ll, llllil‘4‘l,
-
-lllll'ill iii-iii‘ HI l,[n‘ ! l4“.l1'll li.l'l «l 'Ill lii‘L'H m ”nape l-v lli- LI H‘J'l lili i-.-'\ill.i
-liill‘liJ'lvill' m li“.l‘.llv ll In“: .‘H l:, him. lmi llhli, .mvl lliigu l'i-iii-Im
-
-H i 'm 'wil4‘lw HI Hlili'l w! ,iI'lii‘alJH»! ll. l'lwli‘rl .H'lJlIH lliiiimli-iiminli-u
-'llld‘l i vin H .mvl . in “'-I‘_illl'i‘ ”ll )vlmlna .Lllll lliv v-m'i-liuii ml ylliiyyi I-HIII‘. iiliuiw.
-. .iii lu' linHliil .il l lii Li ill Illl' pl.” min-am 'il ilh‘ \l‘l'l ii'ulimli iilliilmlllii’llall-i
-
-“>“lllliv li-llll‘ ’i "il'|' .Iiililll<.llllil‘dllillilllt‘illl‘llH‘ili‘JluIi‘ iiu-iiliniiiul
-
-ilu'iu-m ".li ~lli ll‘l'll’l'll'l l‘vi Milt Ilii l'ul'lillilll ml llIV'.|Vl lilgliliinni lll lll" .llllv ll‘
-
-lei.‘ ,t‘miiln. .l‘li‘ it il l 'lliltlllI‘ R‘.lll\ ll yr lli rim .1 lllltll‘l (i H»
-lllii‘,li,illlili'. iii Jul-la Illil“".lll-lll,i1llil 5pm. .m lllll'l\ll .\ .'.llll loin | l'r lmri
-li, l'llll l l i lllll'l
-l'li I- ill ' “l'4ll||l)_ lilliii ' llii l l"|i|-ii‘llilli \.ilil-I.iliiiii \‘rlil lix l-'ii ll-Illl lllll‘lllv“.
-
-Lin, lump- “it l‘-i .) ml I. .II' Ill-Ill | Via ii'nlii-rl‘il Ill i ill|ll|ll|ll\‘l'-
-
- 
-
-. . , Linn-1.; u“...
-
-A Reader’s Guide to Libre Graphics Magazine
-
-In this magazine. you may ﬁnd concepts. words. ideas and
-things which are new to you. Good. That means your
-horizons are expanding. The problem with that, of course. is
-that sometimes things with steep learning curves are less fun
-than those without.
-
-That's why we're trying to flatten the learning curve. If. while
-reading Libre Graphics magazine. you encounter an unfami ar
-word. project name. whatever it may be. chances are good
-there's an explanation.
-
-At the back of this magazine. you'll ﬁnd a glc ary and
-resource list. The glossary aims to deﬁne we 5 that are
-unique to the world ofLibre Graphics. The resource list
-provides valuable information about tools. licenses. whatever
-items we may be mentioning.
-
-Practically. this means that if. for example. you're reading an
-article about Paper.js (see pages 30 and '31). you can alwc
-
-ﬂip to the back of the magazine. look up Paperjs in the
-resource list and become quickly informed about it. This
-provides some instant gratiﬁcation. giving you the resources
-you need to understand. in a moment. just what we‘re tallxii ‘
-about.
-
-We hope you like our system.
-
-Images under other li- (‘ll\1‘\
-
-l’linlu‘. HI [lepaii ls" l“ ,‘l lx\ lniiiiim Harlem-s .iir iiIi-ler\ \‘ \k’ \-\ Him ueic‘
-['lll‘ll‘.'ll‘il lll'll' \\lllt lllt’ llt'HlIHsIUH iii ilu‘ll .nillini
-
-«minim- ~. lil'yI‘ imagi- iiml m'liml .il Mil' l-x \\ Ikiiiietlm iisei llieiimli lion-(i l\ in (he
-lllll'llk ilniimiii .llhl l .Iii ln‘ imiinl .ii
-
-llll" A:vll|ilh‘ll\ \\|l\lll\l'\l|.l wig \\iki tile“ \lnilviil Nautical (”lupin Row pm:
-
-t-i-m'ml
-‘\-l\l'lilv(‘lll1'l|l\ \\|lll llic r‘\i(‘|rlll'll n! Hui-.r' “Tie-muting l il-ic \liaplin s Hagaluw. .Ils‘
-iml IN“ i‘-.Illl\ -iv\('1l'\| lu the blanket i L in \~\ il: ruse ll h lit-st in \ hei L \ulii the
-
-'llil’K‘l I» lln‘v ll'l'lr'd‘lll lic‘lI-n‘ ir'uullig llH‘llI
- 
-
- 
-
-in
-
-” EDITOR'S LETTE
-
-i.i.i: Liruii If" ’
-
-Editor's letter
-
-ginger coons
-
-\K'e so often draw a strong distinction between the physical
-and the digital. acting as ifthe one is solid. staid and reliable.
-while the other is born in ether. unreal and untouchable. This
-is far from the case. Not only on occasion. but always. the
-physical and the digital are intimately intertwined.
-
-The digital is merely a subset of the physical. It is a land we've
-come to view as different and distinct. despite its reliance on
-
-the physical. Regardless ot‘our perceptions. the two tick along.
-
-happily o-operating and relying on one another. As the
-digital fails to esCape the bounds of the physical. the physical
-comes to embrace its part in the success of the digital.
-
-I vraphic design and media arts are ﬁelds Well atquainted with
-the obvious areas oi overlap between the phv. cal and the
-digital. From the days of air brushing and drafting by hand.
-to the bringing of those same metaphors into the realm of
-digital production. designers :tllCl media artists are at the
-l'orvlront of both the conilicts and the embraces ol the digital
-and the ['ilI\“sI(.'.il,
-
-\‘x'l‘n-thr-r it lI‘Idl]lli“--l‘-ﬁ itsr-ll' III .1 worktlow innirporatiiig lmilI
-
-(ll‘c‘Iiul .ii‘nl [il'iVHIMIl I'i‘Ii'llimlS. lo ilIHi‘I'i‘lit Hills HI' WllcllM'I' It Is
-.i lI'.II‘i‘-l|iI‘II‘IiiiIiIH which takes plan- iii the slim c lH‘th‘i‘lI tlic
-l’i.‘.‘«i I‘i‘.ilI’I’i:-.. llll' point til IIIii'I‘m‘lIliII llt‘lWi'I'II tllt' tllLL’JiJl .Ill(l
-
-the physical is a special place. And it bears exploring.
-
-F Loss graphics and art ﬁelds are taking up that exploration in
-magniﬁcent ways. In the world of F toss. the space where the
-digital and the physical collide is becoming beautifully
-populated.
-
-This issue of Litre Graphics magazine is about border caSes.
-the role ofintentionality and happy ' cident in the mingling
-of physical and digital. and any and a .l points of intersection.
-We're exploring the translation from the digital to the
-physical. the phys' ‘al to the digital and the careful mapping of
-the two. We're looking at the pl‘dkt‘ oi history and the promise
-of the future in harmoniously melding our media.
-
-We‘re looking at tolds. sprays. cuts and prints. We‘re looking
-.it mapping worlds onto each other. In short, in this issue.
-we're looking .it collisions:
-
- 
-
--- a/src/audio_transcribe.py
+++ b/src/audio_transcribe.py
@ -0,0 +1,92 @@
+#!/usr/bin/env python3
+
+import speech_recognition as sr
+import sys
+from termcolor import cprint, colored
+# obtain path to "english.wav" in the same folder as this script
+from os import path
+import random
+
+a1 = sys.argv[1] #same as $1 so when you run python3 audio_transcribe.py FOO ... argv[1] is FOO
+# print ("transcribing", a1, file=sys.stderr)
+AUDIO_FILE = path.join(path.dirname(path.realpath(__file__)), a1) # before it was english.wav
+# AUDIO_FILE = path.join(path.dirname(path.realpath(__file__)), "french.aiff")
+# AUDIO_FILE = path.join(path.dirname(path.realpath(__file__)), "chinese.flac")
+
+# print (AUDIO_FILE)
+# use the audio file as the audio source
+r = sr.Recognizer()
+with sr.AudioFile(AUDIO_FILE) as source:
+    audio = r.record(source)  # read the entire audio file
+
+color = ["white", "yellow"]
+on_color = ["on_red", "on_magenta", "on_blue", "on_grey"]
+# recognize speech using Sphinx
+
+try:
+    cprint( r.recognize_sphinx(audio), random.choice(color), random.choice(on_color))
+    # print( r.recognize_sphinx(audio))
+except sr.UnknownValueError:
+    print("uknown")
+except sr.RequestError as e:
+    print("Sphinx error; {0}".format(e))
+
+# sleep (1)
+
+# # recognize speech using Google Speech Recognition
+# try:
+#     # for testing purposes, we're just using the default API key
+#     # to use another API key, use `r.recognize_google(audio, key="GOOGLE_SPEECH_RECOGNITION_API_KEY")`
+#     # instead of `r.recognize_google(audio)`
+#     print("Google Speech Recognition thinks you said " + r.recognize_google(audio))
+# except sr.UnknownValueError:
+#     print("Google Speech Recognition could not understand audio")
+# except sr.RequestError as e:
+#     print("Could not request results from Google Speech Recognition service; {0}".format(e))
+
+# # recognize speech using Google Cloud Speech
+# GOOGLE_CLOUD_SPEECH_CREDENTIALS = r"""INSERT THE CONTENTS OF THE GOOGLE CLOUD SPEECH JSON CREDENTIALS FILE HERE"""
+# try:
+#     print("Google Cloud Speech thinks you said " + r.recognize_google_cloud(audio, credentials_json=GOOGLE_CLOUD_SPEECH_CREDENTIALS))
+# except sr.UnknownValueError:
+#     print("Google Cloud Speech could not understand audio")
+# except sr.RequestError as e:
+#     print("Could not request results from Google Cloud Speech service; {0}".format(e))
+
+# # recognize speech using Wit.ai
+# WIT_AI_KEY = "INSERT WIT.AI API KEY HERE"  # Wit.ai keys are 32-character uppercase alphanumeric strings
+# try:
+#     print("Wit.ai thinks you said " + r.recognize_wit(audio, key=WIT_AI_KEY))
+# except sr.UnknownValueError:
+#     print("Wit.ai could not understand audio")
+# except sr.RequestError as e:
+#     print("Could not request results from Wit.ai service; {0}".format(e))
+
+# # recognize speech using Microsoft Bing Voice Recognition
+# BING_KEY = "INSERT BING API KEY HERE"  # Microsoft Bing Voice Recognition API keys 32-character lowercase hexadecimal strings
+# try:
+#     print("Microsoft Bing Voice Recognition thinks you said " + r.recognize_bing(audio, key=BING_KEY))
+# except sr.UnknownValueError:
+#     print("Microsoft Bing Voice Recognition could not understand audio")
+# except sr.RequestError as e:
+#     print("Could not request results from Microsoft Bing Voice Recognition service; {0}".format(e))
+
+# # recognize speech using Houndify
+# HOUNDIFY_CLIENT_ID = "INSERT HOUNDIFY CLIENT ID HERE"  # Houndify client IDs are Base64-encoded strings
+# HOUNDIFY_CLIENT_KEY = "INSERT HOUNDIFY CLIENT KEY HERE"  # Houndify client keys are Base64-encoded strings
+# try:
+#     print("Houndify thinks you said " + r.recognize_houndify(audio, client_id=HOUNDIFY_CLIENT_ID, client_key=HOUNDIFY_CLIENT_KEY))
+# except sr.UnknownValueError:
+#     print("Houndify could not understand audio")
+# except sr.RequestError as e:
+#     print("Could not request results from Houndify service; {0}".format(e))
+
+# # recognize speech using IBM Speech to Text
+# IBM_USERNAME = "INSERT IBM SPEECH TO TEXT USERNAME HERE"  # IBM Speech to Text usernames are strings of the form XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX
+# IBM_PASSWORD = "INSERT IBM SPEECH TO TEXT PASSWORD HERE"  # IBM Speech to Text passwords are mixed-case alphanumeric strings
+# try:
+#     print("IBM Speech to Text thinks you said " + r.recognize_ibm(audio, username=IBM_USERNAME, password=IBM_PASSWORD))
+# except sr.UnknownValueError:
+#     print("IBM Speech to Text could not understand audio")
+# except sr.RequestError as e:
+#     print("Could not request results from IBM Speech to Text service; {0}".format(e))
--- a/src/soundtotext.py
+++ b/src/soundtotext.py
@ -1,2 +0,0 @@
-o=open('output/sound.wav', 'r')
-original = o.read()
--- a/src/ttssr-loop-human-only.sh
+++ b/src/ttssr-loop-human-only.sh
@ -0,0 +1,18 @@
+#!/bin/bash
+i=0;
+#cp $1 output/input0.txt
+echo "Read every new sentence out loud!"
+head -n 1 $1 > output/input0.txt
+while [[ $i -le 10 ]]
+	do echo $i
+	cat output/input$i.txt 
+	python3 src/write_audio.py src/sound$i.wav 2> /dev/null
+	play src/sound$i.wav repeat 5 2> /dev/null & #in the background the sound, without it all the sounds play one by one//2 is stderr
+	python3 src/audio_transcribe.py sound$i.wav > output/input$((i+1)).txt 2> /dev/null
+	sleep 
+	(( i++ ))
+done
+today=$(date +%Y-%m-%d);
+mkdir -p "output/ttssr.$today"
+mv -v output/input* output/ttssr.$today;
+mv -v src/sound* output/ttssr.$today;
--- a/src/write_audio.py
+++ b/src/write_audio.py
@ -0,0 +1,25 @@
+#!/usr/bin/env python3
+# https://github.com/Uberi/speech_recognition/blob/master/examples/write_audio.py
+# NOTE: this example requires PyAudio because it uses the Microphone class
+
+import speech_recognition as sr
+import sys
+from time import sleep
+
+a1 = sys.argv[1]
+
+# obtain audio from the microphone
+r = sr.Recognizer()
+with sr.Microphone() as source:
+    # print("Read every new sentence out loud!")
+    audio = r.listen(source)
+
+# sleep (1)
+#
+# # write audio to a RAW file
+# with open("microphone-results.raw", "wb") as f:
+#     f.write(audio.get_raw_data())
+
+# write audio to a WAV file
+with open(a1, "wb") as f:
+	f.write(audio.get_wav_data())