Internet Engineering Task Force                          David Cromwell
INTERNET DRAFT                                           Nortel Networks
File: draft-cromwell-megaco-advanced-audio-pkg-00.txt    Date: December 1, 1999


             Proposal for an MEGACO Advanced Audio Package
           <draft-cromwell-megaco-advanced-audio-pkg-00.txt>


Status of this Document

   This document is an Internet-Draft and is in full conformance with
   all provisions of Section 10 of RFC2026

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF), its areas, and its working groups.  Note that
   other groups may also distribute working documents as Internet-
   Drafts.

   Internet-Drafts are draft documents valid for a maximum of six months
   and may be updated, replaced, or obsoleted by other documents at any
   time.  It is inappropriate to use Internet- Drafts as reference
   material or to cite them other than as work in progress."

   The list of current Internet-Drafts can be accessed at
   http://www.ietf.org/ietf/1id-abstracts.txt

   The list of Internet-Draft Shadow Directories can be accessed at
   http://www.ietf.org/shadow.html.


Abstract

   This document is a proposal to add a new event/signal package to the
   Megaco protocol to control an ARF (Audio Resource Function) which may
   reside on a Media Gateway or specialized Audio Server.

   This event package provides support for the standard IVR operations
   of PlayAnnouncement, PlayCollect, and PlayRecord.  It supports direct
   references to simple audio as well as indirect references to simple
   and complex audio. It provides audio variables, control of audio
   interruptibility, digit buffer control, special key sequences, and
   support for reprompting during data collection.   It also provides an
   arbitrary number of user defined qualifiers to be used in resolving
   complex audio structures.  For example, the user could define qualif-
   iers for any or all of the following: language, accent, audio file
   format, gender, speaker, or customer.


Cromwell                  expires June 1, 2000                  [Page 1]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


Table of Contents


        1. Introduction ...................................................    3

        1.1. Sequences And Sets ...........................................    3

        1.2. Segment Types ................................................    4

        1.3. Variables ....................................................    6

        2. Advanced Audio Package Definition ..............................    6

        2.1. Properties ...................................................    7

        2.2. Events .......................................................    7

        2.2.1. Audio Operation Complete ...................................    7

        2.2.2. Audio Operation Failure ....................................    9

        2.3. Signals ......................................................    9

        2.3.1. Play .......................................................    9

        2.3.2. PlayCollect ................................................   19

        2.3.3. PlayRecord .................................................   26

        2.3.4. EndAudioSignal .............................................   32

        2.4. Statistics ...................................................   32

        2.5. Procedures ...................................................   32

        3. Syntax Rules ...................................................   32

        4. Formal Syntax Description ......................................   33

        5. Examples .......................................................   33

        6. References .....................................................   37

        7. Author's Address ...............................................   37


Cromwell                  expires June 1, 2000                  [Page 2]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


   1.  Introduction

   The following syntax supports both simple and complex audio struc-
   tures.  A simple audio structure might be a single announcement such
   as "Welcome to Bell South's Automated Directory Assistance Service."
   A more complex audio structure might consist of an announcement fol-
   lowed by voice variable followed by another announcement, for example
   "There are thirty seven minutes remaining on your prepaid calling
   card," where "There are" is a prompt, the number of minutes is a
   voice variable, and "minutes remaining on your prepaid calling card"
   is another prompt.

   It is also possible to define complex audio structures that are qual-
   ified by user defined selectors such as language, accent, audio file
   format, gender, speaker, or customer.  For instance, if the above
   example were qualified by language and accent selectors, it would be
   possible to play "There are thirty seven minutes remaining on your
   prepaid calling card" in English spoken with a southern accent or in
   English spoken with a mid-western accent, providing that the audio to
   support this had been provisioned.

   There are two methods of specifying complex audio.  The first is to
   directly reference the individual components.  This requires a com-
   plete description of each component to be specified via the protocol.
   The second method is to provision the components on the Audio Server
   as a single entity and to export a reference to that entity to the
   call agent.  In this case, only the reference (plus any dynamic data
   required, such as a variable data) is passed via the protocol, and no
   specification of individual components is necessary.

   The Advanced Audio Package provides significant functionality con-
   trolled via protocol parameters.  Most parameters are optional, and
   where ever possible default to reasonable values.  An audio applica-
   tion that references to provisioned, complex audio structures, and
   which takes advantage of parameter optionality and defaults, can
   specify audio events using a minimum of syntax.

   The next three sections contain background information which may be
   helpful in understanding the syntax.


   1.1.  Sequence And Sets

   The syntax supports abstractions of set and sequence for storing and
   referencing audio data.

   A sequence is a provisioned sequence of one or more audio segments.
   Component segments are not necessarily all of the same type.  Every


Cromwell                  expires June 1, 2000                  [Page 3]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


   sequence is assigned a unique segment id.  On playback, a sequence id
   reference is deconstructed into its individual parts, each of which
   is played in order.

   A set is a provisioned collection of audio segments with an associ-
   ated selector.  On playback, the selector value is resolved to a par-
   ticular set element.  Selector types are supported by the syntax, but
   individual selector types are not defined in the syntax; they are
   defined by the user (i.e.  provisioner).  For example, a user could
   define one or more of the following selector types: language, accent,
   audio file format, gender, accent, customer, or day of the week.  For
   each selector type, the user must define a range of valid values.
   The user may also choose to define a default value.  Then if a selec-
   tor value is not supplied at runtime the default value is used.

   For example, suppose that a user has defined a selector of type
   language as well as a set of legal values for that selector consist-
   ing of English, French, and Russian, and that the user has set
   English to be the default value.  Let us also assume that the user
   has provisioned English, French, and Russian versions of a particular
   prompt, "Please enter your id" for instance. At runtime a reference
   to the set with the selector set to Russian would result in the Rus-
   sian version of the prompt being played.  A reference to the set with
   no selector would result in the English version of the prompt being
   played since English has been set as the default.

   Recursive definition of both sets and sequences is allowed, i.e. it
   legal to define a set of sets or a sequence of sequences.  In addi-
   tion, audio structures may also be specified by intermixing sets and
   sequences, and it is possible to specify a set of sequences or a
   sequence containing one or more set elements.  Direct or transitive
   definition of a set or segment in terms of itself is not allowed.


   1.2.  Segment Types

   The syntax supports the following segment types:

        RECORDING:  A reference by unique id to a single piece of
        recorded audio.

        RECORDINGs may be provisioned or they may be made during the
        course of a call.  A RECORDING made during the course of a call
        can be temporary or persistant.  A temporary RECORDING lasts
        only for the life of the call during which it was recorded.  A
        persistant RECORDING lasts beyond the live of the call during
        which it was recorded.


Cromwell                  expires June 1, 2000                  [Page 4]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


        A provisioned RECORDING may be replaced (or overriden) by a per-
        sistant RECORDING.  A reference to the id of the provisioned
        RECORDING will then resolve to the persistant RECORDING.  The
        overriding persistant audio can subsequently be deleted and the
        original provisioned audio can be restored.

        A provisioned RECORDING may be overrided more than once.  In
        this case, the id of the provisioned RECORDING refers to the
        latest overriding RECORDING.  When the overriding RECORDING is
        deleted, the original provisioned RECORDING is restored, even if
        the segment has been overriden multiple times.

        TEXT:  A reference to a block of text to be converted to speech
        or to be displayed on a device. Reference may be by unique id
        to a block of provisioned text or by direct specification of
        text in a parameter.

        SILENCE:  A specification of a length of silence to be played in
        units of 100 milliseconds.

        TONE: The specification of a tone to be played by algorithmic
        generation.  Most tones however will probably be recorded, not
        generated. Exact specification of this segment type is tbd.

        VARIABLE:  The specification of a voice variable by the parame-
        ters of type, subtype, and value.  Specification of variables is
        considered in more detail in a subsequent section of this docu-
        ment.

        SEQUENCE: A reference by unique id to a provisioned sequence of
        mixed RECORDING, TEXT, SILENCE, TONE, VARIABLE, SET, or SEQUENCE
        segments. Recursive definition of SEQUENCE segments is allowed.
        Direct or transitive definition of a SEQUENCE segment in terms
        of itself is not allowed.

        SET:  A  reference by unique id to a provisioned set of seg-
        ments.  The intended and recommended use of the SET type is that
        all segments in the set should be semantically equivalent, how-
        ever there is no real way of enforcing this restriction either
        in the protocol or in provisioning.  Every set has an associated
        selector which is used at runtime to resolve the set reference
        to a specific element of the set.  The elements of a set may one
        of the following segment types:  RECORDING, TEXT, TONE, SILENCE,
        SEQUENCE, or SET.  Specific selector types are not specified by
        the protocol and must be defined by the user.  Recursive defini-
        tion of SET segments is allowed. Direct or transitive definition
        of a SET segment in terms of itself is not allowed.


Cromwell                  expires June 1, 2000                  [Page 5]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


   1.3.  Variables

   The Advanced Audio Package provides  two kinds of voice variables.
   Embedded embedded variables are variables that have been provisioned
   as part of a segment. Direct variables are completely specified in
   the protocol message.

   Variables are specified by the following parameters: type, subtype,
   and value.  Variable types include Date, Money, Number, Time, etc.
   Subtype is a refinement of type.  For example the variable type Money
   might have an associated range of subtypes such as Dollar, Rupee,
   Dinar, etc.  Not all variables require a subtype, and for these vari-
   ables the subtype parameter should be set to null.

   For embedded variables, only the value is specified since the type
   and subtype have been provisioned.  In a list of segments, an embed-
   ded variable specification applies only to the segment that directly
   preceeds it.  If a segment has multiple embedded variables, the
   values must be given in the order in which the variables are encoun-
   tered when the segment is played.

   Typically embedded variables are provisioned along with recorded
   speech, e.g. "A representative will be with you in approximately 5
   minutes.  If you would prefer to leave a voice message, press 1 now."
   where the variable is the number of minutes.  To play the preceeding
   announcement the MGC would specify the id of the audio segment and a
   variable value, in this case 5.

   In some cases it may be desirable to play an announcement that con-
   tains an embedded variable without playing the variable itself.  To
   do this set the value of the embedded variable to "null".


   2.  Advanced Audio Package

   PackageID:  au

   Version:  1

   Extends:  None

   This package defines events and signals for an ARF package for an
   Audio Server Media Gateway.


Cromwell                  expires June 1, 2000                  [Page 6]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


   2.1.  Properties

        None


   2.2.  Events


   2.2.1.  Audio Operation Complete

        EventID: audcomp

        Signifies the successful completion of a play, playcol, or
        playrec signal.

        EventDescriptor parameters

             None

        ObservedEventDescriptor parameters


             Voice interrupted

                  ParameterID: vi

                  Whether or not the initial prompt of a playrec signal
                  was interrupted by voice.

                  Type: string

                  Possible values:  true or false


             Interrupting key sequence

                  ParameterID: ik

                  The key or key sequence that interrupted the initial
                  prompt of a playrec signal, if any.

                  Type: string

                  Possible values:  any sequence of valid DTMF digits.


             Amount played


Cromwell                  expires June 1, 2000                  [Page 7]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


                  ParameterID: ap

                  The length played of an interrupted prompt, in 100 ms
                  units.  Type: integer

                  Possible values:


             Number of attempts

                  ParameterID: na

                  The number of attempts the user needed to enter a
                  valid digit pattern or to make a recording.

                  Type: integer

                  Possible Values:


             Digits collected

                  ParameterID: dc

                  The DTMF digits that were collected during a play sig-
                  nal.

                  Type: string

                  Possible Values: any sequence of valid DTMF digits.


             Recording id

                  ParameterID: ri

                  A 32 bit binary integer assigned to audio recording
                  made during a playrec signal.

                  Type: integer

                  Possible Values:


             Return code

                  ParameterID: rc


Cromwell                  expires June 1, 2000                  [Page 8]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


                  A return code indicating that the signal was success-
                  ful and possibly carrying other information as well.
                  Currently the only defined success return code is 100
                  (unspecified success.

                  Type: integer

                  Possible Values:  success return codes range from
                  100-1999.


        2.2.2.  Audio Operation Failure

             EventID: audfail

             Signifies the failure of a play, playcol, or playrec sig-
             nal.

             EventDescriptor parameters

                  None

             ObservedEventDescriptor parameters

                  ParameterID: rc

                  A return code indicating why the signal failed.
                  Currently the only defined failure return code is 300
                  (unspecified failure).

                  Type: integer

                  Possible Values:  failure return codes range from
                  300-399.


2.3.  Signals


2.3.1.  Play

     SignalID: play

     Plays one or more audio segments.


Cromwell                  expires June 1, 2000                  [Page 9]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


     Required Parameters: Announcement.

     Parameters:

          Announcement

               ParameterID:  an

               An announcement to be played.  Consist of one or more
               audio segments.

               Type: string

               Possible values: integer,ts,dt,si,tn,vb,ev,alias,lsl

                    TextToSpeech

                         ValueTag:  ts

                         Specifies a text string to be converted to
                         speech.

                         Syntax: ts/<string>


                    DisplayText

                         ValueTag:  dt

                         Specifies a text string to be displayed on a
                         device.

                         Syntax: dt/<string>


                    Silence

                         ValueTag:  si

                         Specifies a length of silence to be played in
                         units of 100 milliseconds.

                         Syntax: si/<integer>


                    Tone

                         ValueTag:  tn


Cromwell                  expires June 1, 2000                 [Page 10]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


                         Specifies a tone to be played by algorithmic
                         generation.  Exact specification of this param-
                         eter is tbd.  Most tones will likely be
                         recorded, not generated.

                         Syntax: tbd


                    Variable

                         ValueTag:  vb

                         Specifies a voice variable by type, subtype,
                         and value.

                         Syntax:  vb/<type>/<subtype>/<value>

                         Variable Types:

                              dat (date)

                                   Speaks a date specified as YYYYMMDD
                                   (per ISO 8601, International Date and
                                   Time Notation [8]).  For example
                                   "19981015" is spoken as "October
                                   fifteenth nineteen ninety eight."

                              dig (digits)

                                   Speaks a string of digits one at a
                                   time.  If the subtype is North Ameri-
                                   can DN, the format of which is NPA-
                                   NXX-XXXX, the digits are spoken with
                                   appropriate pauses between the NPA
                                   and NXX and between the NXX and XXXX.
                                   If the subtype is generic, the digits
                                   are spoken no pauses.

                              dur (duration)

                                   Duration is specified in seconds and
                                   is spoken in one or more units of
                                   time as appropriate, e.g. "3661" is
                                   spoken as "One hour, one minute, and
                                   one second."

                              mny (money)


Cromwell                  expires June 1, 2000                 [Page 11]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


                                   Money is specified in the smallest
                                   units of a given currency and is spo-
                                   ken in one or more units of currency
                                   as appropriate, e.g. "110" in U.S.
                                   Dollars would be spoken "one dollar
                                   and ten cents."


                              mth (month)

                                   Speaks the specified month, e.g. "10"
                                   is spoken as "October."  Specifica-
                                   tion is in MM format with "01" denot-
                                   ing January, "02" denoting February,
                                   etc.

                              num (number)

                                   Speaks a number in cardinal form or
                                   in ordinal form.  For example, "100"
                                   is spoken as "one hundred" in cardi-
                                   nal form and "one hundredth" in ordi-
                                   nal form.

                              sil (silence)

                                   Plays a specified period of silence.
                                   Specification is in 100 millisecond
                                   units.

                              str (string)

                                   Speaks each character of a string,
                                   e.g. "a34bc" is spoken "A, three,
                                   four, b, c."  Valid characters are
                                   a-z, A-Z, 0-9, #, and *.

                              txt (text) Text:

                                   Produces the specified text as speech
                                   or displays it on a device.

                              tme (time)

                                   Speaks a time specified as HHMM (per
                                   ISO 8601, International Data and Time
                                   Notation [8]) in twenty four hour
                                   format in either twelve hour format


Cromwell                  expires June 1, 2000                 [Page 12]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


                                   or twenty four hour format. For exam-
                                   ple "1700" is spoken as "Five pm" in
                                   twelve hour format or as "Seventeen
                                   hundred hours" in twenty four hour
                                   format.

                              ton (tone)

                                   Plays an algorithmically generated
                                   tone, specification of which is tbd.
                                   Probably most applications will use
                                   prerecorded tones.

                              wkd  (weekday)

                                   Speaks the day of the week, e.g.
                                   "Monday."  Weekdays are specified as
                                   single digits, with "1" denoting Sun-
                                   day, "2" denoting Monday, etc.


                         Variable Subtypes:

                              null

                                   If a particular variable type does
                                   not have an associated subtype the
                                   subtype must be set to "null".

                              gen

                                   Generic digits; only valid when used
                                   used with the digits variable type.
                                   Causes a string of digits supplied by
                                   the variable value parameter to be
                                   played one at a time with no pauses
                                   between digits.

                              ndn
                                   North American DN; only valid when
                                   used used with the digits variable
                                   type.  Causes a string of nine digits
                                   supplied by the variable value param-
                                   eter to be spoken in NPA-NXX-XXXX
                                   format, i.e. the digits are spoken
                                   with appropriate pauses between the
                                   NPA and NXX and between the NXX and
                                   XXXX.


Cromwell                  expires June 1, 2000                 [Page 13]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


                              crd
                                   Cardinal Number; only valid when used
                                   used with the digits variable type.
                                   Causes the number supplied by the
                                   variable value parameter to be spoken
                                   in cardinal form, e.g. "100" is spo-
                                   ken as "one hundred." form.

                              ord
                                   Ordinal Number; only valid when used
                                   used with the digits variable type.
                                   Causes the number supplied by the
                                   variable value parameter to be spoken
                                   in cardinal for, e.g. "100" is spoken
                                   as "one hundredth."

                              dsp
                                   Display text; only valid when used
                                   with the text variable type.  Causes
                                   the text supplied by the variable
                                   value parameter to be displayed on a
                                   device.


                              spk
                                   Spoken text; only valid when used
                                   with the text variable type.  Causes
                                   the text supplied by the variable
                                   value parameter to be spoken (i.e.
                                   text to speech).


                              t12
                                   Twelve hour format; only valid when
                                   used with the time variable type.
                                   Causes the time supplied by the vari-
                                   able value parameter to be spoken in
                                   twelve hour format.  For example
                                   "1700" would be spoken as "Five pm."


                              t24
                                   Twenty four hour format; only valid
                                   when used with the time variable
                                   type. Causes the time supplied by the
                                   variable value parameter to be spoken
                                   in twenty four hour format.  For
                                   example "1700" would be spoken as


Cromwell                  expires June 1, 2000                 [Page 14]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


                                   "Seventeen hundred hours."


                              <various language codes>

                                   The three letter codes defined in ISO
                                   4217, Currency And Funds Code List
                                   [7] are used to specify the currency
                                   subtype. A small excerpt from ISO
                                   4217 follows:


                 __________________________________________________________
                 |Alpha-code | Numeric-code | Currency |      Entity       |
                 |___________|______________|__________|___________________|
                 |GQE        | 226          | Ekwele   | Equatorial Guinea |
                 |GRD        | 300          | Drachma  | Greece            |
                 |GTQ        | 320          | Quetzal  | Guatemala         |
                 |___________|______________|__________|___________________|


                    Embedded Variable

                         ValueTag:  ev

                         Specifies the value for a previously pro-
                         visioned voice variable.

                         Syntax: ev/<string>

                         Possible values: In some cases it may be desir-
                         able to play an announcement that contains an
                         embedded variable without playing the variable
                         itself.  To do this set the value of the embed-
                         ded variable to "null".


                    Alias

                         ValueTag:  alias

                         Indicates a reference to an audio segments is
                         by alias rather than segment id.  The ARF
                         software supports alias to segment id mapping.
                         The mapping is set up via provisioning.

                         Syntax:  alias/<string>


Cromwell                  expires June 1, 2000                 [Page 15]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


                         Possible values:


                    LocalSelector

                         ValueTag:  lsl

                         See the definition of the GlobalSelector param-
                         eter (following) for a discussion of selectors,
                         tags, and values.  A LocalSelector works
                         exactly like a GlobalSelector except that it
                         applies to only a single audio segment within
                         an announcement.

                         Syntax:  lsl/<tag>/<value>


          GlobalSelector

               ParameterID:  gsl

               A selector is a user defined tag and an associated value
               which is used to resolve a reference to a set of semanti-
               cally related segment ids to a particular id.  There is
               one pre-defined selector name for language, "lang". The
               values associated with this selector are the three letter
               codes defined in ISO standard 639-2, Code For The
               Representation Of Names Of Languages [6]. A small excerpt
               from ISO 639-2 follows:


                                   _________________
                                   |Code | Language |
                                   |_____|__________|
                                   |cze  | Czech    |
                                   |cym  | Welsh    |
                                   |dan  | Danish   |
                                   |_____|__________|


               For example, an ARF provisioner could define the selector
               tag "accent", a set of possible values:  "southernUS",
               "northernUS", and "midwesternUS", and a default value,
               "midwesternUS" for instance.  The provisioner would then
               provision the audio required to support each of these
               accents.  During a call if the MGC asked the ARF to play
               a prompt it could specify the "accent" selector tag and
               one of the three defined values, and the prompt would be
               played in the accent selected.  If the MGC did not


Cromwell                  expires June 1, 2000                 [Page 16]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


               specify a selector the prompt would be played in the
               default accent.

               Global selectors apply to all segments within an
               announcement (e.g. Announcement, InitialPrompt, Reprompt,
               etc.).

               Type: string

               Syntax:  gsl=<tag>/<value>

               Possible values:  all tags and values are defined by the
               ARF provisioner with the exception of the "lang" selector
               tag and associated values which are defined in ISO stan-
               dard 639-2.


          NonInterruptiblePlay

               ParameterID:  ni

               Specifies whether or not an initial prompt is interrupti-
               ble by either voice or digits.  Defaults to false.

               Type: string

               Possible values:  true or false.  Defaults to false.


          Iterations

               ParameterID:  it

               The maximum number of times an announcement is to be
               played.

               Type: string

               Possible values:  Defaults to one.  A value of minus one
               indicates that the announcement is to be repeated for-
               ever.


          Interval

               ParameterID:  iv

               The interval of silence to be inserted between iterative


Cromwell                  expires June 1, 2000                 [Page 17]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


               plays.  Specified in units of 100 milliseconds.

               Type: integer

               Possible values:  Defaults to one second.


          Duration

               ParameterID:  du

               The maximum amount of time to play and possibly replay an
               announcement. Takes precedence over iteration and inter-
               val.  Specified in units of 100 milliseconds.  No
               default.

               Type: integer

               Possible values:


          Speed

               ParameterID:  sp

               The relative playback speed of announcement specifiable
               as a positive or negative percentage variation from the
               normal playback speed.

               Type: signed integer

               Possible values:


          Volume

               ParameterID:  vl

               The relative playback volume of announcement specifiable
               as a positive or negative percentage variation from the
               normal playback volume.

               Type: signed integer

               Possible values:


Cromwell                  expires June 1, 2000                 [Page 18]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


2.3.2.  PlayCollect

     SignalID: playcol

     Plays an announcement (optionally) and collects dtmf digits input
     by the user.

     Required Parameters: none.

     Parameters:

          InitialPrompt

               ParameterID:  ip

               The initial announcement prompting the user to either
               enter DTMF digits or to speak.  Consists of one or more
               audio segments.  If not specified, digit collection or
               recording begins immediately.

               Type: string

               Possible values: See Play Signal Announcement Parameter
               for details.


          Reprompt

               ParameterID:  rp

               Played after the user has made an error such as entering
               an invalid digit pattern or not speaking.  Consists of
               one or more audio segments.  Defaults to Initial Prompt.

               Type: string

               Possible values: see Play Signal Annoumcement Parameter
               for details.


          NoDigitsPrompt

               ParameterID:  nd

               Played after the user has failed to enter a valid digit
               pattern during a Play Collect event.  Consists of one or
               more audio segments.  Defaults to Reprompt.


Cromwell                  expires June 1, 2000                 [Page 19]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


               Type: string

               Possible values: see Play Signal Annoumcement Parameter
               for details.


          SuccessAnnouncement

               ParameterID:  sa

               Played when data collection has succeeded.  Consists of
               one or more audio segments.  No default.

               Type: string

               Possible values: see Play Signal Annoumcement Parameter
               for details.


          FailureAnnouncement

               ParameterID:  fa

               Played when all data entry attempts have failed.  Con-
               sists of one or more audio segments.  No default.

               Type: string

               Possible values: see Play Signal Annoumcement Parameter
               for details.


          GlobalSelector

               ParameterID:  gsl

               See definition of this parameter in the play Signal sec-
               tion.


          NonInterruptiblePlay

               ParameterID:  ni

               Specifies whether or not an initial prompt is interrupti-
               ble by either voice or digits.  Defaults to false.

               Type: string


Cromwell                  expires June 1, 2000                 [Page 20]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


               Possible values:  true or false.  Defaults to false.

          Speed

               ParameterID:  sp

               See definition of this parameter in the play Signal sec-
               tion.


          Volume

               ParameterID:  vl

               See definition of this parameter in the play Signal sec-
               tion.


          ClearDigitBuffer

               ParameterID:  cb

               If set to true, clears the digit buffer before playing
               the initial prompt.

               Type: string

               Possible values:  true and false.  Defaults to false.


          MaxDigits

               ParameterID: mx

               The maximum number of digits to collect.  Defaults to
               one.

               Type: integer

               Possible values:


          MinDigits

               ParameterID: mn

               The minimum number of digits to collect.  Defaults to
               one.


Cromwell                  expires June 1, 2000                 [Page 21]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


               Type: integer

               Possible values:


          DigitPattern

               ParameterID: dp

               Type: string

               An extended regular expression specifying a digit collec-
               tion pattern.  Uses extended regular expressions as sup-
               ported by the Rogue Wave Class Library [9], which sup-
               ports a subset of the POSIX.2 standard [10] for regular
               expressions.  If a pattern is not specified, pattern
               matching is not attempted.

               OPEN ISSUE:  Currently the H.248 specification does not
               allow the left square bracket, right square bracket, or
               vertical line characters in parameter values.  These
               characters are necessary to implement extended regular
               expressions.

               Possible values:


          FirstDigitTimer

               ParameterID: fdt

               The amount of time allowed for the user to enter the
               first digit. Specified in units of 100 milliseconds.
               Defaults to five seconds.

               Type: integer

               Possible values:


          InterDigitTimer

               ParameterID: idt

               The amount of time allowed for the user to enter each
               subsequent digit. Specified units of 100 milliseconds
               seconds.  Defaults to three seconds.


Cromwell                  expires June 1, 2000                 [Page 22]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


               Type: integer

               Possible values:


          ExtraDigitTimer

               ParameterID: edt

               The amount of time to wait for a user to enter a final
               digit once the maximum expected amount of digits have
               been entered.  Typically this timer is used to wait for a
               terminating key in applications which have defined a
               specific key to terminate input.  Specified in units of
               100 milliseconds. If not specified, this timer is not
               activated.

               Type: integer

               Possible values:


          RestartKey

               ParameterID: rsk

               Defines a key sequence consisting of a command key
               optionally followed by zero or more keys.  This key
               sequence has the following action:  discard any digits
               collected or recording in progress, replay the prompt,
               and resume digit collection or recording.  No default.
               An application that defines more than one command key
               sequence, will typically use the same command key for all
               command key sequences.  If more than one command key
               sequence is defined, then all key sequences must consist
               of a command key plus at least one other key. Restart ,
               reinput, and return keys take precedence over all other
               keys.

               Type: string

               Possible values:  0-9,*,#

          Reinput Key:

               ParameterID: rik

               Defines a key sequence consisting of a command key


Cromwell                  expires June 1, 2000                 [Page 23]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


               optionally followed by zero or more keys.  This key
               sequence has the following action: discard any digits
               collected or recordings in progress and resume digit col-
               lection or recording. No default.  An application that
               defines more than one command key sequence, will typi-
               cally use the same command key for all command key
               sequences.  If more than one command key sequence is
               defined, then all key sequences must consist of a command
               key plus at least one other key.  Restart , reinput, and
               return keys take precedence over all other keys.

               Type: string

               Possible values:  0-9,*,#

          Return Key:

               ParameterID: rtk

               Defines a key sequence consisting of a command key
               optionally followed by zero or more keys.  This key
               sequence has the following action:  terminate the current
               event and any queued event and return the terminating key
               sequence to the call processing agent.  No default.   An
               application that defines more than one command key
               sequence, will typically use the same command key for all
               command key sequences.  If more than one command key
               sequence is defined, then all key sequences must consist
               of a command key plus at least one other key. Restart ,
               reinput, and return keys take precedence over all other
               keys.

               Type: string

               Possible values:  0-9,*,#

          Position Key:

               ParameterID: psk

               Defines a key with the following action.  Stop playing
               the current announcement and resume playing at the begin-
               ning of the first, last, previous, next, or the current
               segment of the announcement.  The position key is valid
               only during the play phase of a signal and takes pre-
               cedence over the start input key and the end input key.
               No default.


Cromwell                  expires June 1, 2000                 [Page 24]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


               Type: string

               Syntax: psk/<key>/<action>

               Possible values:

                    <key> :   0-9,*, or #

                    <action> :  fst, lst, prv, nxt, or cur


          StopKey

               ParameterID: stk

               Defines a key the striking of which terminates playback
               of the announcement. The position key is valid only dur-
               ing the play phase of a signal and takes precedence over
               the start input key and the end input key.  No default.

               Type: string

               Possible values:  0-9,*, or #


          StartInputKey

               ParameterID: sik

               Defines a set of keys that are acceptable as the first
               digit collected. This set of keys can be specified to
               interrupt a playing announcement or to not interrupt a
               playing announcement.  The default key set is 0-9. The
               default behavior is to interrupt a playing announcement
               when a Start Input Key is pressed.  This behavior can be
               overidden by using the ni (Non-Interruptible Play) param-
               eter.  Specification is a list of keys with no sepera-
               tors, e.g.  123456789#.

               Type: string

               Possible values:  0-9,*,#


          EndInputKey

               ParameterID: eik


Cromwell                  expires June 1, 2000                 [Page 25]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


               Specifies a key that signals the end of digit collection
               or voice recording. The default end input key is the #
               key. To specify that no End Input Key be used the parame-
               ter is set to the string "null".  The default behavior
               not to return the End Input Key in the digits returned to
               the call agent.  This behavior can be overidden by the
               Include End Input Key (eik)

               Type: string

               Possible values:  0-9,*, or #


          IncludeEndInputKey

               ParameterID: iek

               By  default the End Input Key is not included in the col-
               lected digits returned to the call agent.  If this param-
               eter is set to true then the End Input Key will be
               returned with the collected digits returned to the call
               agent.

               Type: string

               Possible values:  true or false.  Defaults to false.


          NumberOfAttempts

               ParameterID: na

               The number of attempts the user is given to enter a valid
               digit pattern or to make a recording.  Defaults to one.

               Type: integer

               Possible values:


2.3.3.  PlayRecord

     SignalID: playrec

     Plays an announcement (optionally) and records user speech.

     Required Parameters: RecordLengthTimer


Cromwell                  expires June 1, 2000                 [Page 26]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


     Parameters:

          InitialPrompt

               ParameterID:  ip

               See definition of this parameter in the playcol Signal
               section.


          Reprompt

               ParameterID:  rp

               See definition of this parameter in the playcol Signal
               section.


          NoSpeechPrompt

               ParameterID:  ns

               Played after the user has failed to speak during a Play
               Record event.  Consists of one or more audio segments.
               Defaults to Reprompt.

               Type: string

               Possible values: see play Signal Annoumcement Parameter
               for details.


          SuccessAnnouncement

               ParameterID:  sa

               See definition of this parameter in the playcol Signal
               section.


          FailureAnnouncement

               ParameterID:  fa

               See definition of this parameter in the playcol Signal
               section.


Cromwell                  expires June 1, 2000                 [Page 27]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


          GlobalSelector

               ParameterID:  gsl

               See definition of this parameter in the play Signal sec-
               tion.


          NonInterruptiblePlay

               ParameterID:  ni

               See definition of this parameter in the playcol Signal
               section.


          Speed

               ParameterID:  sp

               See definition of this parameter in the play Signal sec-
               tion.


          Volume

               ParameterID:  vl

               See definition of this parameter in the play Signal sec-
               tion.


          ClearDigitBuffer

               ParameterID:  cb

               See definition of this parameter in the playcol Signal
               section.


          PreSpeechTimer

               ParameterID: prt

               The amount of time to wait for the user to initially
               speak.  Specified in units of 100 milliseconds.  Defaults
               to three seconds.


Cromwell                  expires June 1, 2000                 [Page 28]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


               Type: integer

               Possible values:


          PostSpeechTimer

               ParameterID: pst

               The amount of silence necessary after the end of the last
               speech segment for the recording to be considered com-
               plete.  Specified in units of 100 milliseconds.  Defaults
               to two seconds.

               Type: integer

               Possible values:


          RecordLengthTimer

               ParameterID: rlt

               The maximum allowable length of the recording, not
               including pre or post speech silence.  Specified in units
               of 100 milliseconds.

               Type: integer

               Possible values:


          RestartKey

               ParameterID: rsk

               See definition of this parameter in the playcol Signal
               section.

          Reinput Key:

               ParameterID: rik

               See definition of this parameter in the playcol Signal
               section.

          Return Key:


Cromwell                  expires June 1, 2000                 [Page 29]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


               ParameterID: rtk

               See definition of this parameter in the playcol Signal
               section.

          Position Key:

               ParameterID: rtk

               See definition of this parameter in the playcol Signal
               section.


          StopKey

               ParameterID: stk

               See definition of this parameter in the playcol Signal
               section.


          EndInputKey

               ParameterID: eik

               See definition of this parameter in the playcol Signal
               section.


          NumberOfAttempts

               ParameterID: na

               The number of attempts the user is given to enter a valid
               digit pattern or to make a recording.  Defaults to one.

               Type: integer

               Possible values:


          RecordPersistantAudio

               ParameterID: rpa

               If set to true, the recording that is made is persistant
               instead of temporary.


Cromwell                  expires June 1, 2000                 [Page 30]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


               Type: string

               Possible values:  true or false.  Defaults to false.


          DeletePersistantAudio

               ParameterID: dpa

               Indicates that the specified persistant audio segment is
               to be deleted.

               Type: string

               Possible values:


          OverrideAudio

               ParameterID: oa

               Indicates that the specified provisioned audio segment is
               to be overridden with a persistant audio segment to be
               recorded shortly.

               Type: string

               Possible values:


          RestoreAudio

               ParameterID: ra

               Indicates that the provisioned audio segment originally
               associated with the specified segment id is to to be
               restored and that the overriding persistant audio is to
               be deleted.  This parameter is carried by the play record
               signal, although nothing is either played or recorded in
               this case.

               Type: string

               Possible values:


Cromwell                  expires June 1, 2000                 [Page 31]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


2.3.4.  EndAudioSignal

     SignalID: endaudsig

     Causes a play, playcol, or playrec signal to terminate normally.
     This signal would be applied for example to gracefully terminate an
     open ended playrec. Application of a new signal list (possibly
     empty) that did not contain the currently executing signal would
     abort the operation; no recording would be saved and no recording
     id would be returned to the MGC.

     OPEN ISSUE:  This type of event is not currently defined in H.248,
     however it is clear that mechanisms for both aborting and grace-
     fully terminating certain signals are needed.

     Parameters:

          None


2.4.  Statistics

     None


2.5.  Procedures

     None


3.  Syntax Rules

     1.  Parameters to Advanced Audio Package signals may occur in any
     order.

     2.  The following parameters may be have more than one value, i.e.
     the value may be an orderded list of values:  Announcement (an),
     InitialPrompt (ip), Reprompt (rp), NoDigits (nd), No Speech (ns),
     Success Announcement (sa), or FailureAnnouncement (fa) parameter
     value.  These parameters are known collectively as announcement
     parameters.  The values in parameter list are played from left to
     right.

     3.  An announcement parameter value list may contain the following
     types of values:  a audio segment id (integer), TextToSpeech value,
     DisplayText value, Silence value, Tone value, Variable value,
     EmbeddedVariable value, and LocalSelector value.


Cromwell                  expires June 1, 2000                 [Page 32]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


     4.  There are two types of values in an announcement parameter
     list.  The first type is a playable value:  audio segment id,
     TextToSpeech, DisplayText, Silence, Tone, and Variable.  The second
     type qualifies a playable value: EmbeddedVariable and LocalSelec-
     tor.  A qualifier value must always be preceeded by a playable
     value.

     5.  Qualifier values apply to the immediately preceeding value in
     the list.

     6. If an EmbeddedVariable value and a LocalSelector value apply to
     the same value, the EmbeddedVariable value always comes first.  In
     this case the LocalSelector applies to both the value being modi-
     fied and the embedded variable.

     7. LocalSelector values have precedence over GlobalSelector parame-
     ters.


4.  Formal Syntax Description

ABNF specification needs to be added.


5.  Examples

This section presents a number of examples of how to specify Advanced
Audio Package signals.


Play an announcement that consists of a single segment:

     Signals { au/play { an = 39 } }


Play an announcement that consists of multiple segments:

     Signals { au/play { an = 39 , 121 , 67 } }


Play an announcement that consists of a recording followed by silence
followed by text to speech followed by a direct voice variable:

     Signals { au/play { an = [ 39 , si/30 , ts/hello ,
               vb/mny/usd/1995 ] }


Cromwell                  expires June 1, 2000                 [Page 33]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


Play an announcement with an embedded variable.  If the four segments of
the previous announcement were provisioned as segment 239 with the vari-
able type provisioned as mny and the variable subtype provisioned as
usd, the following would be exactly equivalent to the play in the
preceeding example:

     Signals { au/play { an = [ 239 , ev/1995 ] } }


Play an announcement with two embedded variables:

     Signals { au/play { an = [ 113 , ev/3999 , ev/20001015 ]  } }


Play a prompt and collect a single digit (the default).  If need be,
play a reprompt, a no digits prompt, and a success or failure announce-
ment.  Give the user three attempts to enter a digit:

     Signals { au/playcol { ip = 21 , rp = 109 , nd = 102 , sa = 72 ,
                            fa = 81 , na = 3 } }


Play a prompt and collect a single digit.  If the user does not enter a
digit replay the intial prompt.  Give the user three attempts to enter a
digit:

     Signals { au/playcol { ip = 21 , na = 3  } }


Play a prompt and record voice.  If the user does not speak play a no
speech prompt.  Give the user two attempts to record:

     Signals { au/playrec { ip = 22 , ns = 42 , na = 2  } }


Play an announcement ten percent faster than normal speed and five per-
cent softer than normal volume.  Play the announcement three times with
two seconds of silence between plays.

     Signals { au/play { an = 27 , sp = +10 , vl = -5 , it = 3 ,
                         iv = 20  } }


Give the user two attempts to enter a three digit pattern.  Clear the
digit buffer before playing the prompt. The user can signal end of input
using the # key (the default), which is not returned to the call agent
with the collected digits (by default).


Cromwell                  expires June 1, 2000                 [Page 34]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


     Signals { au/playcol { ip = 43 , cb = true , mn = 3 , mx = 3 ,
                            na = 2  } }


Give the user three attempts to enter a three digit pattern.  If the
user enters one digits or two digits on the first or second attempts a
reprompt is played.  If the user enters no digits on the first or second
attempts a no digits reprompt is played.  If all three attempts fail, a
failure announcement is played.  If one of the attempts is successful, a
success announcement is played and the collected digits are returned to
the call agent.  The user can signal end of input using the # key (the
default). If the # key terminates a successful input attempt, the col-
lected digits, but not the # key (the default), are returned to the call
agent.

     Signals { au/playcol { ip = 87 , rp = 5 , nd = 409 , fa = 9 ,
                            sa = 18 , mx = 3 ,  mn = 3 , na = 3  } }


Give the user a single attempt to enter a 1 (the default minimum) to 4
digit pattern, allow 8 seconds for the user to enter the first digit,
and allow 6 seconds for the user to enter each subsequent digit.  If the
subsequent digit timer expires after the user has less than four digits,
the digits collected are returned to the call agent.  The user can sig-
nal end of input using the # key which is not returned to the call agent
with the collected digits.

     Signals { au/playcol { ip = 4 , fdt = 80 , idt = 60 , mx = 4  } }


Give the user one chance to enter 2 digits where the first digit is 3,4,
or 5 and the second digit is any digit except 5, 6, or 7.

     Signals { au/playcol { ip = 8 , dp = [3-5][^567]  } }


Give the user three chances to enter an 11 digit number that begins with
0 or 1.  If the user makes a mistake while entering digits, he can press
the * key to discard any digits already collected, replay the prompt,
and resume collection.

     Signals { au/playcol { ip = 33 , mn = 11 , mx = 11 , sik = 01 ,
                            rsk = * , na = 3  } }


Give the user three chances to enter an 11 digit number that begins with
0 or 1.  If the user makes a mistake while entering digits, he can press
the key sequence *11 to discard any digits already collected, replay the


Cromwell                  expires June 1, 2000                 [Page 35]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


prompt, and resume collection.  If the user enters the key sequence *12
the play collect is terminated along with any queued events, and the
terminating key sequence is returned to the call agent for processing.

     Signals { au/playcol { ip = 33 , mn = 11 , mx = 11 , sik = 01 ,
                            rsk = *11 , rtk = *12 , na = 3  } }


Give the user two chances to make a recording.  After playing the
prompt, wait 5 seconds for the user to speak.  If no speech is detected
replay the initial prompt and try again.  If the user does speak, wait
for seven seconds after speech stops to make sure the user is finished.
If the recording is successful, return a reference to the recording to
the call agent.

     Signals { au/playrec {  ip = 6 , prt = 50 , pst = 70 , rlt = 600 ,
                             na = 2 } }


Play an announcement in the default language:

     Signals { au/play { an = 5  } }


Play the same announcement in the English.  In the first example, the
selector applies to the an segment; in the second it applies to the pa
operation.  For these particular examples, the net effect is the same.

     Signals { au/play { an = [ 5 , lsl/lang/eng ]  } }

     Signals { au/play { an = 5 , gsl = lang/eng  } }


Play an announcement in Danish using a female voice.

     Signals { au/play { an = 6 , gsl = lang/dan ,
                         gsl = gender/female  } }

Play the first part of an announcement in English, the second part in
the default language, and the third part in French.

     Signals { au/play { an = [ 5 , lsl/lang/eng , 6 , 7 ,
                         lsl/lang/fra  } }

Play an announcement with an embedded variable in English:

     Signals { au/play { an = [ 5 , ev/20001015 ] , gsl = lang/eng } }


Cromwell                  expires June 1, 2000                 [Page 36]


INTERNET DRAFT       MEGACO Advanced Audio Package      December 1, 1999


6.  References

     [1] Bradner, S., "Key words for use in RFCs to Indicate Requirement
     Levels", BCP 14, RFC 2119, March 1997.

     [2] ITU Proposed Recommendation  H.248,  Version  12,  October  29,
     1999.

     [3] Cromwell, D., Durling, M., "Suggested Requirements For  Control
     Of An IVR Function", Version 0.0, April 1999, INTERNET DRAFT.

     [5] Greene, N., Ramalho, M, and Rosen, B., "Media  Gateway  control
     protocol  architecture  and requirements", Version 0.8, October 21,
     1999, INTERNET DRAFT.

     [6]   ISO  639-2,  "Code  For  The  Representation  Of   Names   Of
     Languages", 1998.

     [7]  ISO 4217, "Currency And Funds Code List", 1981.

     [8]  ISO 8601, "International Date and Time Notation", 1998.

     [9]  Tools.h++ Class Reference Version 7, Rouge Wave Software Inc.,
     1996.

     [10]   ANSI/IEEE  Standard  1003.2   (Portable   Operating   System
     Interface), Version D11.2, September 1991.


7.  Author's Address

David Cromwell
Nortel Networks
Box 130010
35 Davis Drive
Research Triangle Park, NC 27709
Email: cromwell@nortelnetworks.com

Phone: (919) 991-8870


Cromwell                  expires June 1, 2000                 [Page 37]