Unoffical empeg BBS

Quick Links: Empeg FAQ | RioCar.Org | Hijack | BigDisk Builder | jEmplode | emphatic
Repairs: Repairs

Page 2 of 2 < 1 2
Topic Options
#47915 - 27/11/2001 16:52 Re: EmpTriv [Re: smu]
tonyc
carpal tunnel

Registered: 27/06/1999
Posts: 7058
Loc: Pittsburgh, PA
Yeah... The Empeg isn't quite ready for primetime as an easily-extensible computing platform. Mlord's kernel was a huge step. The alternate root partition is another biggie. It will let us keep user programs intact between software upgrades, and as I understand it, will provide the user space portion of the seamless launch procedure we're looking for where you just add programs to config.ini. Mlord has done most of what needs to be done in terms of allowing for text from config.ini to be available to the user apps. I'm sure there's other work to be done there.

If people are looking for constructive things to do, I think that the following efforts can go on in parallel with these efforts. Some of these suggestions might already be in the works, if so, please let us know how you're doing!

- an efficient graphics API
possibly written in ARM assembly
- a "storage API"
take smallish chunks of data and save it to / load it from raw sectors on an unused partition such as hda2 or hda6
- a sound API
take PCM data and mix it in with the player output (it appears Kim has a good start on this)
maybe even a routine to mix in an MP3 file (a nice bell/whistle feature)
- a nice installer package for user programs
along the lines of Frank's but one which circumvents Emplode and transfers the package directly

_________________________
- Tony C
my empeg stuff

Top
#47916 - 27/11/2001 17:01 Re: EmpTriv [Re: tfabris]
bobo
member

Registered: 13/08/1999
Posts: 116
I had this one running with ARM linux on my ipaq:
http://www.speech.cs.cmu.edu/flite/

bobo

Top
#47917 - 27/11/2001 17:03 Re: EmpTriv [Re: tfabris]
tonyc
carpal tunnel

Registered: 27/06/1999
Posts: 7058
Loc: Pittsburgh, PA
Well... Here's the deal. There are basically two options for TTS on Linux. Rsynth and Festival. Rsynth is much lower quality and doesn't do as good of a job figuring out sentence structure or the proper sound for a given letter. Festival is what is used in the demo MP3. Festival unfortunately has a memory footprint of 10 megabytes. Even if we had the RAM I don't think we'd have the power to do real time text-to-speech. I did cross-compile Festival and the binary runs, butt immediately crashes on my Empeg.

Rsynth would be an option except that I can't get it to compile and run properly on my Empeg. I need to hack it up to write exactly 4608 bytes and I haven't had any luck with it. Anyone smarter than me who wants to give it a try is more than welcome to do so. If someone can get rsynth working then we have at least rudimentary TTS.

If it used Rsynth then I'd have to do a lot more manual fiddling with the questions (replacing some E's with the "schwa" phoneme, for instance) because rsynth isn't as intelligent about choosing the proper sound. Me parsing through 20,000 questions is not likely.

The authors of Festival have decreasing the memory footprint listed as one of their goals, but I'm not holding my breath.
_________________________
- Tony C
my empeg stuff

Top
#47918 - 27/11/2001 17:04 Re: EmpTriv [Re: bobo]
tonyc
carpal tunnel

Registered: 27/06/1999
Posts: 7058
Loc: Pittsburgh, PA
Ahhhh... I could NOT find that one in any of my searches. I was even looking at CMU's speech site! Thanks for the URL. Downloading now...
_________________________
- Tony C
my empeg stuff

Top
#47919 - 27/11/2001 18:08 Re: EmpTriv [Re: tonyc]
tanstaafl.
carpal tunnel

Registered: 08/07/1999
Posts: 5543
Loc: Ajijic, Mexico
20,000 questions * 10k = 200 megs. Not sure anyone would want to download that much.

That's only 1/3 of one percent of the capacity of a 60 GB empeg. I think there are a lot of people who would give up that much space! Or, another way of looking at it, it's about three CDs worth of space.

tanstaafl.
_________________________
"There Ain't No Such Thing As A Free Lunch"

Top
#47920 - 27/11/2001 18:16 Re: EmpTriv [Re: bobo]
tonyc
carpal tunnel

Registered: 27/06/1999
Posts: 7058
Loc: Pittsburgh, PA
Okay this is another victim of the 4608 byte requirement on /dev/audio. Running Flite gives me this:


empeg:/usr/local/flite# ./flite "hello, world"
on-4608 byte write (256)
failed to write 128 samples


So I'm not great at all this low level IO stuff, but I found the #define for the audio buffer size was set to 128. So if 128 samples = 256 bytes (16-bit sound) I figured increasing the #define to 2304 (4608 / 2) would do the trick. Naaah. For some reason it only writes 1584 samples at a time, and I get this:


non-4608 byte write (3168)


I'm not smart enough to figure out the rest. So flite has the ability to output to a WAV file. I did that and copied the WAV file over to my PC and the quality of the voice synthesis definitely leaves little to be desired compared to the full version of Festival. I guess that's what makes it "Festival Lite." For instance in the example I posted, it says the verb form of "contest" instead of the noun form (accent is on the second syllable when it should be on the first.) The real Festival handles it without problems.

So if we want anything good sounding, we need to use sound files rather than realtime synthesis. At least for now...
_________________________
- Tony C
my empeg stuff

Top
#47921 - 27/11/2001 18:18 Re: EmpTriv [Re: tanstaafl.]
tonyc
carpal tunnel

Registered: 27/06/1999
Posts: 7058
Loc: Pittsburgh, PA
Sure I realize that. I've got a 40GB Empeg so it's no big deal. But downloading that 200 megs is a little onerous... I dunno. That example question isn't necessarily the longest question in the database either, I'm not sure what the mean is, but the average question could end up being 15 or 20k instead of 10k. Then if you wanted it to say the answers aloud, that'd be even more.

Then there's the fact that I don't know how to play an MP3 file from my software, much less mix it in with the Empeg's output...

But it's something to think about. A neat feature, certainly not what I'm going to focus on, though.
_________________________
- Tony C
my empeg stuff

Top
#47922 - 28/11/2001 12:25 Re: EmpTriv [Re: tonyc]
eternalsun
Pooh-Bah

Registered: 09/09/1999
Posts: 1721
Loc: San Jose, CA
I have some mp3s that are 200 megs.

Calvin

Top
#47923 - 28/11/2001 13:14 Re: EmpTriv [Re: eternalsun]
tonyc
carpal tunnel

Registered: 27/06/1999
Posts: 7058
Loc: Pittsburgh, PA
Yah. 200 megs isn't that bad, which is why I am considering the pre-recorded soundfile route for this after seeing that realtime text-to-speech directly on the Empeg might be somewhat far off. I am going to experiment some more with Flite and at least try to get it correctly outputting sound to the audio device before I totally give up, though.
_________________________
- Tony C
my empeg stuff

Top
#47924 - 03/01/2002 20:37 Re: EmpTriv [Re: tonyc]
andygjones
journeyman

Registered: 30/07/2001
Posts: 63
Im trying to install emptriv and having problems. I have uploaded the file to my empeg and extracted it to /drive1/emptriv
It seems to have all extracted ok with a questions directory, but when i execute emptriv it doesnt do anything. I have tried leaving it for a few minutes and breaking out of it, but then nothing appears in the hijack menu.

Any help appreciated.
_________________________
RioCar 40GB RGB - can never decide

Top
#47925 - 03/01/2002 20:51 Re: EmpTriv [Re: andygjones]
tonyc
carpal tunnel

Registered: 27/06/1999
Posts: 7058
Loc: Pittsburgh, PA
Are you running it in the background? You can't break out of it and still have it bound to the menu (the process ends!)

You need to type emptriv& to run it as a background task. Then fire up the player (exit back to the shell-player loop will also work.)

It should then appear in your menu.
_________________________
- Tony C
my empeg stuff

Top
Page 2 of 2 < 1 2