Audio / Video ToolsWelcome to the Skunkware Audio/Video Tools section. Here you will find sound card drivers, MPEG players and encoders, midi players, audio CD players and mixers, and other tools used for developing and enjoying multimedia presentations. Package List
|
http://www.sco.com/skunkware/uw7/audio/mpg123/
maplay version 1.2 is the second release of my MPEG audio player/decoder. It decodes layer I and layer II MPEG audio streams and plays them using a CD-quality audio device. Currently supported devices are the dbri device of SPARC 10 computers and the audio ports of Silicon Graphics Indigo machines. Thanks to Louis P. Kruger (lpkruger@phoenix. Princeton.EDU), maplay 1.2 can also use the /dev/dsp device under Linux. Louis has tested it with the Pro Audio Spectrum 16 soundcard. Sound Blaster 16 and Gravis Ultrasound cards should also work, but a bug in the dsp driver prevents stereo playback on Gravis Ultrasound cards. An amd device of a SPARC 2/IPX/... machine can be used, too, but this device is only capable of producing audio output at 8 kHz in u-law format, which sounds like transmitted through a telephone. Other audio device are not supported directly, but can be used with the "decode to stdout" option and an audio format converter. Besides it shouldn't be a problem to adapt the program to other audio devices.
The player supports all modes, which are single channel, stereo, joint stereo and dual channel, and all bitrates except free mode. The missing free mode support should not be a problem for now, because I haven't seen such a stream yet.
maplay needs approximately 46% CPU time on SPARC 10/40 machines and 50% on Indigos for realtime stereo playback of a 44.1 kHz 128 kbit/s stream. Single channel playback needs about the half CPU time. On a SPARCstation IPX, maplay needs about 43% CPU time for realtime mono playback. Stereo playback is not possible via an amd device.
Besides realtime playing of audio streams, maplay can decode streams to stdout for further conversions. The output consists of 16 bit signed PCM values. For stereo streams, the values are interleaved, which means that a value for the left channel is followed by a value for the right channel and so on. If maplay has been compiled for u-law output, the output consists of 8 bit u-law samples at a rate of 8 kHz, no matter what frequency the stream uses.
http://www.sco.com/skunkware/uw7/audio/
Festival offers a general framework for building speech synthesis systems as well as including examples of various modules. As a whole it offers full text to speech through a number APIs: from shell level, though a Scheme command interpreter, as a C++ library, and an Emacs interface. Festival is multi-lingual (currently English, Welsh and Spanish) though English is the most advanced.
The system is written in C++ and uses the Edinburgh Speech Tools for low level architecture and has a Scheme (SIOD) based command interpreter for control. Documentation is given in the FSF texinfo format which can generate, a printed manual, info files and HTML.
http://www.sco.com/skunkware/uw7/audio/festival/
http://www.sco.com/skunkware/osr5/audio/festival/
MBROLA v3.00 is a speech synthesizer based on the concatenation of diphones. It takes a list of phonemes as input, together with prosodic information (duration of phonemes and a piecewise linear description of pitch), and produces speech samples on 16 bits (linear), at the sampling frequency of the diphone database.
It is therefore NOT a Text-To-Speech (TTS) synthesizer, since it does not accept raw text as input. In order to obtain a full TTS system, you need to use this synthesizer in combination with a text processing system that produces phonetic and prosodic commands. The Skunkware MBROLA distribution is pre-configured for use in conjunction with the Festival Speech Synthesis system as well as the txt2pho and emofilt utilities. These tools provide support for TTS synthesis, Text-to-Phoneme conversion, and manipulation of prosody of text-to-speech output.
There is currently only an SCO OpenServer 5 binary which works on both OpenServer and UnixWare 7.
http://www.sco.com/skunkware/uw7/audio/mbrola/
http://www.sco.com/skunkware/uw7/audio/mbrola/
The Edinburgh Speech Tools Library is a collection of C++ class, functions and related programs for manipulating the sorts of objects used in speech processing. It includes support for reading and writing waveforms, parameter files (LPC, Ceptra, F0) in various formats and converting between them. It also includes support for linguistic type objects and support for various label files and ngrams (with smoothing).
In addition to the library a number of programs are included. An intonation library which includes a pitch tracker, smoother and labelling system (using the Tilt Labelling system), a classification and regression tree (CART) building program called wagon. Also there is growing support for various speech recognition classes such as decoders and HMMs.
The Edinburgh Speech Tools Library is not an end in itself but desgined to make the construction of other speech systems easy. It is for example to provided the underlying classes in the Festival Speech Synthesis System
The speech tools are currently distributed in full source form free for unrestricted use.
The following c++ programs are available:
- na_play: generic playback program for use with net_audio and CSTR ao.
- ch_wave: Waveform file conversion program.
- ch_lab: label file conversion program.
- ch_track: Track file conversion program.
- wagon: a CART tree build and test program
- And others
The following C++ sub-libraries are available
- audio: C++ audio functions for Network Audio system, Suns OpenServer, UnixWare, Linux and FreeBSD
- speech_class: C++ speech classes, including waveform and track.
- ling_class: C++ linguistic classes.
- sigpr: Signal processing
- utils: Various utilities.
http://www.sco.com/skunkware/uw7/audio/speech_tools/
http://www.sco.com/skunkware/osr5/audio/speech_tools/
Txt2pho is a German TTS front end for the MBROLA synthesizer. This program is derived from the speech synthesis system Hadifix.
Currently there are no UnixWare or OpenServer binaries available but the freely available Linux binary works when used in conjunction with the Linux Emulation System developed by SCO, lxrun.
http://www.sco.com/skunkware/uw7/audio/txt2pho/
Open Sound System for SCO OpenServer and SCO UnixWare provides device drivers for popular soundcards under SCO Open Server 5, SCO UnixWare 2.x, UnixWare 7 and Free/SCO. OSS/SCO comes with a configuration tool and complies with the Open Sound System API.
Features include :
- Easy menu based installation and configuration program
- Support for Sound Blaster AWE32/AWE64 Emu8000 synth
- Supports PnP sound cards.
- Autodetection of Sound Blaster, ESS and GUS sound cards.
- Drivers for over 150 brand name soundcards and onboard audio devices.
- Support for a wide variety of audio applications
- Support for select()
- Support for "Virtual Mixer" - play 8 simultaneous audio apps!
- Support for OPL3-SAx, AD1816, CMI8330 and Sound Blaster AWE64 PCI
- Support for S3 Sonic Vibes/Turtle Beach Daytona
- Full Duplex support for Sound Blaster 16/AWE-32/AWE-64 and Vibra16
- Support for Ensoniq AudioPCI and AudioPCI97
OSS/SCO version 980728 now available (August 14, 1998) for OpenServer and OSS/UnixWare version 3.9 BETA announced (April 17, 1998) for UnixWare. SCO Skunkware was the initial SCO distribution mechanism for these drivers. The OSS audio drivers are being incorporated into the standard product line (beginning with UnixWare 7 and soon with OpenServer). As this transition takes place, the Skunkware audio pages will attempt to direct you to the best place to download the current driver for your platform(s). Currently, the best place to get the OSS audio driver(s) is from 4Front Technologies as they provide the latest bug-fixed release sooner than SCO is able to integrate it into their product line. Unfortunately, the 4Front drivers are not free. SCO will continue to provide free fully-functional SoundBlaster compatible OSS drivers on-line and in the product. Whew.
The download page at 4Front Technologies is http://www.4front-tech.com/download.cgi. The 4Front Technologies OpenServer page is at http://www.4front-tech.com/sco.html and the 4Front UnixWare page is http://www.4front-tech.com/unixware.html.
In addition, 4Front maintains a pretty good set of links to free audio software at http://www.4front-tech.com/ossapps.html.
The links below will attempt to take you to the latest (free, fully functional) SCO pre-licensed drivers. Hopefully these locations will stabilize over time.
Snd is a freeware sound editor modelled loosely after Emacs and an old, sorely-missed PDP-10 sound editor named Dpysnd. It is an X/Motif application written by Bill Schottstaedt (bil@ccrma.stanford.edu).
It can accomodate any number of sounds at once, each with any number of channels. Each channel is normally displayed in its own window, with its own cursor, edit history, and marks; each sound has a 'control panel' to try out various changes quickly, and an expression parser, used mainly during searches; there is an overall stack of 'regions' that can be browsed and edited; channels and sounds can be grouped together during editing; edits can be undone and redone without restriction ('unlimited undo')
http://www.sco.com/skunkware/uw7/audio/snd/
SoX is a sound file format converter for Unix and DOS PCs written by Lance Norskog and other invaluable contributors. It also does sample rate conversion and some sound effects. It's the swiss army knife of sound tools: the interface isn't great, but it does almost everything.
SoX uses file suffices to determine the nature of a sound sample file. If it finds the suffix in its list, it uses the appropriate read or write handler to deal with that file. SoX has an auto-detect feature that attempts to figure out the nature of an unmarked sound sample.
http://www.sco.com/skunkware/osr5/audio/sox/
TiMidity is a MIDI to WAVE converter using Gravis Ultrasound-compatible patch files to generate digital audio data from General MIDI files. The data can be stored in a file for processing, or played in real time through an audio device.
http://www.sco.com/skunkware/osr5/audio/timidity/
http://www.sco.com/skunkware/uw7/audio/
Xmcd is CD Player utility package including xmcd, a CD Player for the X window system using the Motif graphical user interface and cda, a command-line driven, text mode CD Player which also features a curses-based, screen-oriented mode. Both utilities transform your CD-ROM or CD-R drive into a stereo CD player, allowing you to play music CDs on your computer.
These CD player utilities are designed to be attractive, feature-rich yet intuitive to use, and takes advantage of many CD-ROM drive capabilities that are not accessible via other players. Moreover, a CD database feature is supported, maintaining the disc artist/title, track titles, and arbitrary text (such as band information and song lyrics).
Xmcd and cda have emerged as the most ported CD player package, supporting a substantial list of UNIX operating system variants (as well as a non-UNIX OS) and hardware platforms. Moreover, these utilities also support a vast spectrum of CD-ROM and CD-R drives, including many older SCSI-1 units.
http://www.sco.com/skunkware/uw7/audio/
http://www.sco.com/skunkware/osr5/audio/xmcd
Xmmix is an audio mixer utility for the X window system using the Motif graphical user interface. It operates the input and output mixer section on many PC sound cards.
http://www.sco.com/skunkware/uw7/audio
http://www.sco.com/skunkware/osr5/audio/xmmix/
Xmixer is a Visual TCL (vtcl) script written by John Gray (johng@sco.com) which acts as a graphical front-end to the mixer program thus providing an easy-to-use and simple way to control the mixing of your sound card.
/usr/local/bin/Xmixer [ linear ] [ gang ] [ notitle ] [ help ] linear slider control, default is log gang slider control, default is separate notitle No L/R labels on sliders, default is labels help This message, default is no messagehttp://www.sco.com/skunkware/uw7/audio/
Xwave supports editing of large files, cut,copy,paste,merge, some effects (echo,reverse,swap channels,resample,volume), supports RIFF,AIFF,AIFC,AU SCO, Linux,SGI,SUN,FreeBSD
http://www.sco.com/skunkware/uw7/audio/
mpeg_encode produces an MPEG-1 video stream. param_file is a parameter file which includes a list of input files and other parameters.
mpeg_play decodes and displays mpeg-1 encoded bitstreams on systems running X11. The player will create a new window, display the bitstream, and exit.
osr5/video/mpeg_play/mpeg_play-2.3.tar.gz
Last Updated: Monday Jul 19, 1999 at 10:15:45 PDT