This document defines a set of ECMAScript APIs in WebIDL to allow media to be sent to and received from another browser or device implementing the appropriate set of real-time protocols. This specification is being developed in conjunction with a protocol specification developed by the IETF RTCWEB group and an API specification to get access to local media devices developed by the Media Capture Task Force.
This document is neither complete nor stable, and as such is not yet suitable for commercial implementation. However, early experimentation is encouraged. The API is based on preliminary work done in the WHATWG. The Web Real-Time Communications Working Group expects this specification to evolve significantly based on:
There are a number of facets to video-conferencing in HTML covered by this specification:
This document defines the APIs used for these features. This specification is being developed in conjunction with a protocol specification developed by the IETF RTCWEB group and an API specification to get access to local media devices [[!GETUSERMEDIA]]developed by the Media Capture Task Force. An overview of the system can be found in [[RTCWEB-OVERVIEW]] and [[RTCWEB-SECURITY]].
This specification defines conformance criteria that apply to a single product: the user agent that implements the interfaces that it contains.
Conformance requirements phrased as algorithms or specific steps may be implemented in any manner, so long as the end result is equivalent. (In particular, the algorithms defined in this specification are intended to be easy to follow, and not intended to be performant.)
Implementations that use ECMAScript to implement the APIs defined in this specification must implement them in a manner consistent with the ECMAScript Bindings defined in the Web IDL specification [[!WEBIDL]], as this specification uses that specification and terminology.
The EventHandler
interface represents a callback used for event handlers as defined in
[[!HTML5]].
The concepts queue a task and fires a simple event are defined in [[!HTML5]].
The terms event, event handlers and event handler event types are defined in [[!HTML5]].
The terms MediaStream, MediaStreamTrack, Constraints, and Consumer are defined in [[!GETUSERMEDIA]].
An RTCPeerConnection
allows two users to
communicate directly, browser to browser. Communications are coordinated
via a signaling channel which is provided by unspecified means, but
generally by a script in the page via the server, e.g. using
XMLHttpRequest
.
An array containing URIs of servers available to be used by ICE, such as STUN and TURN server.
Indicates which candidates the ICE engine is allowed to use.
Sets the target peer identity for the RTCPeerConnection. The RTCPeerConnection will establish a connection to a remote peer unless it can be successfully authenticated with the provided name.
STUN or TURN URI(s) as defined in [[!RFC7064]] and [[!RFC7065]] or other URI types.
If this RTCIceServer
object represents a
TURN server, then this attribute specifies the username to use with
that TURN server.
If this RTCIceServer
object represents a
TURN server, then this attribute specifies the credential to use
with that TURN server.
In network topologies with multiple layers of NATs, it is desirable to have a STUN server between every layer of NATs in addition to the TURN servers to minimize the peer to peer network latency.
An example array of RTCIceServer objects is:
[ { "urls": "stun:stun1.example.net" }, { "urls":
"turn:turn.example.org", "username": "user", "credential": "myPassword"
} ]
These dictionaries describe the options that can be used to control the offer/answer creation process.
In some cases, an RTCPeerConnection
may wish to
receive video but not send any video. The
RTCPeerConnection
needs to know if it should signal to
the remote side whether it wishes to receive video or not. This
option allows an application to indicate its preferences for the
number of video streams to receive when creating an offer.
In some cases, an RTCPeerConnection
may wish to
receive audio but not send any audio. The
RTCPeerConnection
needs to know if it should signal to
the remote side whether it wishes to receive audio. This option
allows an application to indicate its preferences for the number of
audio streams to receive when creating an offer.
Many codecs and system are capable of detecting "silence" and changing their behavior in this case by doing things such as not transmitting any media. In many cases, such as when dealing with emergency calling or sounds other than spoken voice, it is desirable to be able to turn off this behavior. This option allows the application to provide information about whether it wishes this type of processing enabled or disabled.
When the value of this dictionary member is true, the
generated description will have ICE credentials that are different
from the current credentials (as visible in the
localDescription
attribute's SDP). Applying the
generated description will restart ICE.
When the value of this dictionary member is false, and the
localDescription
attribute has valid ICE
credentials, the generated description will have the same ICE
credentials as the current value from the
localDescription
attribute.
setIdentityProvider()
call has been made in JavaScript.
As this is the default value, an identity will be requested if and
only if the user has configured an IdP in some way.The general operation of the RTCPeerConnection is described in [[!RTCWEB-JSEP]].
Calling new RTCPeerConnection(configuration
)
creates an RTCPeerConnection
object.
The configuration has the information to find and access the servers used by ICE. There may be multiple servers of each type and any TURN server also acts as a STUN server.
An RTCPeerConnection
object has an associated
ICE agent [[!ICE]],
RTCPeerConnection signaling state, ICE gathering state, and ICE
connection state. These are initialized when the object is created.
An RTCPeerConnection
object has two associated
stream sets. A local streams set,
representing streams that are currently sent, and a remote streams set, representing streams
that are currently received with this
RTCPeerConnection
object. The stream sets are
initialized to empty sets when the
RTCPeerConnection
object is created.
When the RTCPeerConnection()
constructor
is invoked, the user agent MUST run the following steps:
Validate the RTCConfiguration
argument by
running the steps defined by the updateIce() method.
Let connection be a newly created
RTCPeerConnection
object.
Create an ICE Agent as defined in [[!ICE]] and let
connection's RTCPeerConnection
ICE Agent be
that ICE Agent and provide it the the ICE servers list. The ICE Agent will proceed
with gathering as soon as the ICE
transports setting is not set to none
. At this
point the ICE Agent does not know how many ICE components it needs
(and hence the number of candidates to gather), but it can make a
reasonable assumption such as 2. As the
RTCPeerConnection
object gets more information, the
ICE Agent can adjust the number of components.
Set connection's RTCPeerConnection
signalingState to stable
.
Set connection's RTCPeerConnection
ice connection state to new
.
Set connection's RTCPeerConnection
ice gathering state to new
.
Initialize an internal variable to represent a queue of
operations
with an empty set.
Return connection.
Once the RTCPeerConnection object has been initialized, for every
call to createOffer
, setLocalDescription
,
createAnswer
and setRemoteDescription
;
execute the following steps:
Append an object representing the current call being handled
(i.e. function name and corresponding arguments) to the
operations
array.
If the length of the operations
array is exactly 1,
execute the function from the front of the queue
asynchronously.
When the asynchronous operation completes (either successfully
or with an error), remove the corresponding object from the
operations
array. After removal, if the array is
non-empty, execute the first object queued asynchronously and
repeat this step on completion.
The general idea is to have only one among createOffer
,
setLocalDescription
, createAnswer
and
setRemoteDescription
executing at any given time. If
subsequent calls are made while one of them is still executing, they
are added to a queue and processed when the previous operation is fully
completed. It is valid, and expected, for normal error handling
procedures to be applied.
Additionally, during the lifetime of the RTCPeerConnection object, the following procedures are followed when an ICE event occurs:
If the RTCPeerConnection
ice gathering state is new
and the ICE transports setting is not
set to none
, the user agent MUST
queue a task to start gathering ICE addresses and set the ice gathering state
to gathering
.
If the ICE Agent has found one or more candidate pairs for each MediaStreamTrack that forms a valid connection, the ICE connection state is changed to "connected".
When the ICE Agent finishes checking all candidate pairs, if at
least one connection has been found for each MediaStreamTrack, the
RTCPeerConnection
ice connection state is changed to "completed"; otherwise
"failed".
When the ICE Agent needs to notify the script about the candidate gathering progress, the user agent must queue a task to run the following steps:
Let connection be the
RTCPeerConnection
object associated with this
ICE Agent.
If connection's RTCPeerConnection
signalingState is closed
, abort these steps.
If the intent of the ICE Agent is to notify the script that:
A new candidate is available.
Add the candidate to connection's
localDescription
and create a
RTCIceCandidate
object to represent the
candidate. Let newCandidate be that object.
The gathering process is done.
Set connection's ice gathering
state to completed
and let
newCandidate be null.
Fire a icecandidate event named icecandidate
with
newCandidate at connection.
User agents negotiate the codec resolution, bitrate, and other media
parameters. It is RECOMMENDED that user agents initially negotiate for
the maximum resolution of a video stream. For streams that are then
rendered (using a video
element), it is RECOMMENDED that
user agents renegotiate for a resolution that matches the rendered
display size.
The word "components" in this context refers to an RTP media flow and does not have anything to do with how [[ICE]] uses the term "component".
When a user agent has reached the point where a
MediaStream
can be created to represent incoming
components, the user agent MUST run the following steps:
Let connection be the
RTCPeerConnection
expecting this media.
Create a MediaStream
object
stream, to represent the incoming media stream.
Run the algorithm to represent an incoming component with a track for each incoming component.
The creation of new incoming
MediaStream
s may be triggered either by SDP
negotiation or by the receipt of media on a given flow.
Queue a task to run the following substeps:
If the connection's RTCPeerConnection
signalingState is closed
, abort these
steps.
Add stream to connection's remote streams set.
Fire a stream event named
addstream
with
stream at the connection
object.
When a user agent has negotiated media for a component that belongs
to a media stream that is already represented by an existing
MediaStream
object, the user agent MUST associate
the component with that MediaStream
object.
When an RTCPeerConnection
finds that a stream
from the remote peer has been removed, the user agent MUST follow these
steps:
Let connection be the
RTCPeerConnection
associated with the stream
being removed.
Let stream be the MediaStream
object that represents the media stream being removed, if any. If
there isn't one, then abort these steps.
By definition, stream is now ended.
A task is thus queued to update stream and fire an event.
Queue a task to run the following substeps:
If the connection's RTCPeerConnection
signalingState is closed
, abort these
steps.
Remove stream from connection's remote streams set.
Fire a stream event named
removestream
with
stream at the connection
object.
The task source for the tasks listed in this section is the networking task source.
If something in the browser changes that causes the
RTCPeerConnection
object to need to initiate a new
session description negotiation, a negotiationneeded
event is fired at the
RTCPeerConnection
object.
In particular, if an RTCPeerConnection
object is
consuming a MediaStream
on
which a track is added, by, e.g., the addTrack()
method being invoked, the RTCPeerConnection
object
MUST fire the "negotiationneeded" event. Removal of media components
must also trigger "negotiationneeded".
To prevent network sniffing from allowing a fourth party to establish a connection to a peer using the information sent out-of-band to the other peer and thus spoofing the client, the configuration information SHOULD always be transmitted using an encrypted connection.
The createOffer method generates a blob of SDP that contains an
RFC 3264 offer with the supported configurations for the session,
including descriptions of the local MediaStream
s
attached to this RTCPeerConnection
, the codec/RTP/RTCP
options supported by this implementation, and any candidates that
have been gathered by the ICE Agent. The options parameter may
be supplied to provide additional control over the offer generated.
As an offer, the generated SDP will contain the full set of capabilities supported by the session (as opposed to an answer, which will include only a specific negotiated subset to use); for each SDP line, the generation of the SDP must follow the appropriate process for generating an offer. In the event createOffer is called after the session is established, createOffer will generate an offer that is compatible with the current session, incorporating any changes that have been made to the session since the last complete offer-answer exchange, such as addition or removal of streams. If no changes have been made, the offer will include the capabilities of the current local description as well as any additional capabilities that could be negotiated in an updated offer.
Session descriptions generated by createOffer MUST be immediately usable by setLocalDescription without causing an error as long as setLocalDescription is called within the successCallback function. If a system has limited resources (e.g. a finite number of decoders), createOffer needs to return an offer that reflects the current state of the system, so that setLocalDescription will succeed when it attempts to acquire those resources. The session descriptions MUST remain usable by setLocalDescription without causing an error until at least end of the successCallback function. Calling this method is needed to get the ICE user name fragment and password.
If the RTCPeerConnection
is configured to generate
Identity assertions, then the session description SHALL contain an
appropriate assertion.
If this RTCPeerConnection
object is closed before
the SDP generation process completes, the USER agent MUST suppress
the result and not call any of the result callbacks.
If the SDP generation process completed successfully, the user
agent MUST queue a task to invoke successCallback with a
newly created RTCSessionDescription
object,
representing the generated offer, as its argument.
If the SDP generation process failed for any reason, the user
agent MUST queue a task to invoke failureCallback with
an DOMError
object of type TBD as its argument.
To Do: Discuss privacy aspects of this from a fingerprinting point of view - it's probably around as bad as access to a canvas :-)
The createAnswer method generates an [[!SDP]] answer with the supported configuration for the session that is compatible with the parameters in the remote configuration. Like createOffer, the returned blob contains descriptions of the local MediaStreams attached to this RTCPeerConnection, the codec/RTP/RTCP options negotiated for this session, and any candidates that have been gathered by the ICE Agent. The options parameter may be supplied to provide additional control over the generated answer.
As an answer, the generated SDP will contain a specific configuration that, along with the corresponding offer, specifies how the media plane should be established. The generation of the SDP must follow the appropriate process for generating an answer.
Session descriptions generated by createAnswer must be immediately usable by setLocalDescription without generating an error if setLocalDescription is called from the successCallback function. Like createOffer, the returned description should reflect the current state of the system. The session descriptions MUST remain usable by setLocalDescription without causing an error until at least the end of the successCallback function. Calling this method is needed to get the ICE user name fragment and password.
An answer can be marked as provisional, as described in
[[!RTCWEB-JSEP]], by setting the type
to
"pranswer"
.
If the RTCPeerConnection
is configured to generate
Identity assertions, then the session description SHALL contain an
appropriate assertion.
If this RTCPeerConnection
object is closed before
the SDP generation process completes, the USER agent MUST suppress
the result and not call any of the result callbacks.
If the SDP generation process completed successfully, the user
agent MUST queue a task to invoke successCallback with a
newly created RTCSessionDescription
object,
representing the generated answer, as its argument.
If the SDP generation process failed for any reason, the user
agent MUST queue a task to invoke failureCallback with
an DOMError
object of type TBD as its argument.
The setLocalDescription()
method instructs the RTCPeerConnection
to apply
the supplied RTCSessionDescription
as the local
description.
This API changes the local media state. In order to successfully
handle scenarios where the application wants to offer to change
from one media format to a different, incompatible format, the
RTCPeerConnection
must be able to
simultaneously support use of both the old and new local
descriptions (e.g. support codecs that exist in both descriptions)
until a final answer is received, at which point the
RTCPeerConnection
can fully adopt the new local
description, or rollback to the old description if the remote side
denied the change.
ISSUE: how to indicate to rollback?
To Do: specify what parts of the SDP can be changed between the createOffer and setLocalDescription
When the method is invoked, the user agent must follow the processing model described by the following list:
If this RTCPeerConnection
object's
signaling
state is closed
, the user agent MUST throw an
InvalidStateError
exception and abort this
operation.
If a local description contains a different set of ICE
credentials, then the ICE Agent MUST trigger an ICE restart.
When ICE restarts, the gathering state will be changed back to
"gathering", if it was not already gathering. If the
RTCPeerConnection
ice connection state was "completed", it will be changed
back to "connected".
If the process to apply the
RTCSessionDescription
argument fails for
any reason, then user agent must queue a task runs the
following steps:
Let connection be the
RTCPeerConnection
object on with this
method was invoked.
If connection's signaling state
is closed
, then abort these steps.
If the reason for the failure is:
The content of the
RTCSessionDescription
argument is
invalid or the type
is
wrong for the current signaling
state of connection.
Let errorType be
InvalidSessionDescriptionError
.
The RTCSessionDescription
is a
valid description but cannot be applied at the media
layer.
TODO ISSUE - next few points are probably wrong. Make sure to check this in setRemote too.
This can happen, e.g., if there are insufficient resources to apply the SDP. The user agent MUST then rollback as necessary if the new description was partially applied when the failure occurred.
If rollback was not necessary or was completed
successfully, let errorType be
IncompatibleSessionDescriptionError
. If
rollback was not possible, let errorType be
InternalError
and set
connection's signaling
state to closed
.
Invoke the failureCallback with an
DOMError
object, whose name
attribute is errorType, as its argument.
If the RTCSessionDescription
argument is
applied successfully, then user agent must queue a task runs
the following steps:
Let connection be the
RTCPeerConnection
object on with this
metod was invoked.
If connection's signaling state
is closed
, then abort these steps.
Set connection's description attribute
(localDescription
or
remoteDescription
depending on the
setting operation) to the
RTCSessionDescription
argument.
If the local description was set,
connection's ice gathering
state is new
, and the local description
contains media, then set connection's ice gathering
state to gathering
.
If the local description was set with content that
caused an ICE restart, then set connection's
ice
gathering state to gathering
.
Set connection's signalingState accordingly.
If connection's signalingState
changed, fire a simple event named signalingstatechange
at connection.
Queue a new task that, if connection's
signalingState is
not closed
, invokes the
successCallback.
The localDescription
attribute MUST return the RTCSessionDescription
that was most recently passed to setLocalDescription()
,
plus any local candidates that have been generated by the ICE Agent
since then.
A null object will be returned if the local description has not yet been set.
The setRemoteDescription()
method instructs the RTCPeerConnection
to apply
the supplied RTCSessionDescription
as the
remote offer or answer. This API changes the local media state.
When the method is invoked, the user agent must follow the processing model of setLocalDescription()
,
with the following additional conditions:
If an a=identity
attribute is present in the
session description, the browser validates the identity
assertion.. Identity validation completes asynchronously
and does not block the completion of
setRemoteDescription
, unless there is a target peer identity.
The target peer identity
cannot be changed once set. Once set, if a different value is
provided, the user agent MUST throw an
InvalidStateError
exception and abort this
operation.
If the "peerIdentity" configuration is applied to the
RTCPeerConnection
, this establishes a target peer identity.
Alternatively, if the RTCPeerConnection
has
previously authenticated the identity of the peer (that is,
there is a current value for peerIdentity
),
then this also establishes a target peer identity.
If there is a target peer
identity, then setRemoteDescription
fails
unless it contains an identity assertion that matches the target peer identity. The
RTCPeerConnection
MAY be closed if the
validated peer identity does not match the target peer identity.
The remoteDescription
attribute MUST return the RTCSessionDescription
that was most recently passed to setRemoteDescription()
,
plus any remote candidates that have been supplied via
addIceCandidate()
since then.
A null object will be returned if the remote description has not yet been set.
The signalingState
attribute MUST return the RTCPeerConnection
object's RTCPeerConnection
signaling state.
The updateIce method updates the ICE Agent process of gathering local candidates and pinging remote candidates.
This call may result in a change to the state of the ICE Agent, and may result in a change to media state if it results in connectivity being established.
When the updateIce()
method is invoked, the user MUST run the following steps to
process the RTCConfiguration
dictionary:
If the iceTransports member is present, let its value be the ICE Agent's ICE transports setting.
If the iceTransports member was omitted and the ICE Agent's ICE transports setting is unset, set the ICE Agent's ICE transports setting to the iceTransports dictionary member default value.
If the iceServers dictionary
member is present, but its value is an empty list, then throw
an InvalidAccessError
and abort these steps. If
the list, on the other hand, has elements, each element must be
validated by running the following sub-steps:
Let server be the current list element.
If the server.urls dictionary member is
omitted or an empty list, then throw an
InvalidAccessError
and abort these steps.
If server.urls is a string, let urls be a list consisting of just that string. Otherwise, let urls refer to the server.urls list.
For each url in urls, parse the url and
obtain scheme name. If the parsing fails or if
scheme name is not implemented by the browser,
throw a SyntaxError
and abort these steps.
If scheme name is "turn" and either of the
dictionary members server.username or
server.credential are omitted, then throw an
InvalidAccessError
and abort these steps.
After passing the validation, let the iceServers dictionary member be the ICE Agent's ICE servers list.
If a new list of servers replaces the ICE Agent's existing
ICE servers list, no action will taken until the
RTCPeerConnection
's ice gathering
state transitions to gathering
. If a script
wants this to happen immediately, it should do an ICE restart.
If the iceServers dictionary
member was omitted, and the ICE Agent's ICE servers list is unset, throw an
InvalidAccessError
and abort these steps.
The addIceCandidate()
method provides a remote candidate to the ICE Agent. In addition to
being added to the remote description, connectivity checks will be
sent to the new candidates as long as the ICE Transports setting is not
set to none
. This call will result in a change
to the connection state of the ICE Agent, and may result in a
change to media state if it results in different connectivity being
established.
If the candidate parameter is malformed, throw a
SyntaxError
exception and abort these steps.
If the candidate is successfully applied, the user agent MUST queue a task to invoke successCallback.
If the candidate could not be successfully applied, the user
agent MUST queue a task to invoke failureCallback with a
DOMError
object whose name
attribute has
the value TBD (TODO InvalidCandidate and InvalidMidIndex).
The iceGatheringState
attribute MUST return the gathering state of the RTCPeerConnection
ICE
Agent.
The iceConnectionState
attribute MUST return the state of the RTCPeerConnection
ICE
Agent ICE state.
Returns a RTCConfiguration
object representing the current configuration of this RTCPeerConnection
object.
When this method is call, the user agent MUST construct new RTCConfiguration
object to be returned, and initialize it using the ICE Agent's ICE transports setting and ICE servers list.
Returns a sequence of MediaStream
objects
representing the streams that are currently sent with this
RTCPeerConnection
object.
The getLocalStreams()
method MUST return a new sequence that represents a snapshot of all
the MediaStream
objects in this
RTCPeerConnection
object’s local streams set. The conversion from the
streams set to the sequence, to be returned, is user agent defined
and the order does not have to stable between calls.
Returns a sequence of MediaStream
objects
representing the streams that are currently received with this
RTCPeerConnection
object.
The getRemoteStreams()
method MUST return a new sequence that represents a snapshot of all
the MediaStream
objects in this
RTCPeerConnection
object’s remote streams set. The conversion from
the streams set to the sequence, to be returned, is user agent
defined and the order does not have to stable between calls.
If a MediaStream
object, with an
id
equal to streamId, exists in this
RTCPeerConnection
object’s stream sets
(local streams set or remote streams set), then the getStreamById()
method MUST return that MediaStream
object. The
method MUST return null if no stream matches the
streamId argument.
For this method to make sense, we need to make sure that ids are unique within the two stream sets of a RTCPeerConnection. This is not the case today when a peer re-adds a stream that is received. Two different stream instances will now have the same id at both peers; one in the remote stream set and one in the local stream set.
One way to resolve this is to not allow re-adding a stream instance that is received (guard on id). If an application really needs this functionality it's really easy to make a clone of the stream, which will give it a new id, and send the clone.
Adds a new stream to the RTCPeerConnection.
When the addStream()
method is invoked, the user agent MUST
run the following steps:
Let connection be the
RTCPeerConnection
object on which the
MediaStream
, stream, is to be
added.
If connection's RTCPeerConnection
signalingState is closed
, throw an
InvalidStateError
exception and abort these
steps.
If stream is already in connection's local streams set, then abort these steps.
Add stream to connection's local streams set.
A stream could have contents that are inaccessible to the application. This can be due to being marked with a peerIdentity option or anything that would make a stream CORS cross-origin. These streams can be added to the local streams set but content MUST NOT be transmitted, though streams marked with peerIdentity can be transmitted if they meet the requirements for sending (see ) .
All other streams that are not accessible to the application MUST NOT be sent to the peer, with silence (audio), black frames (video) or equivalently absent content being sent in place of stream content.
Note that this property can change over time.
If connection's RTCPeerConnection
signalingState is stable
, then fire a negotiationneeded event at
connection.
Removes the given stream from the
RTCPeerConnection
.
When the other peer stops sending a stream in this manner, a
removestream
event is
fired at the RTCPeerConnection
object.
When the removeStream()
method is invoked, the user agent
MUST run the following steps:
Let connection be the
RTCPeerConnection
object on which the
MediaStream
, stream, is to be
removed.
If connection's RTCPeerConnection
signalingState is closed
, throw an
InvalidStateError
exception.
If stream is not in connection's local streams set, then abort these steps.
Remove stream from connection's local streams set.
If connection's RTCPeerConnection
signalingState is stable
, then fire a negotiationneeded event at
connection.
When the RTCPeerConnection close()
method is invoked, the
user agent MUST run the following steps:
RTCPeerConnection
object's RTCPeerConnection
signalingState
is closed
, abort these steps.
Destroy the RTCPeerConnection
ICE Agent, abruptly ending any active ICE processing and
any active streaming, and releasing any relevant resources
(e.g. TURN permissions).
Set the object's RTCPeerConnection
signalingState to closed
.
negotiationneeded
, MUST be supported
by all objects implementing the RTCPeerConnection
interface.icecandidate
, MUST be supported by
all objects implementing the RTCPeerConnection
interface.signalingstatechange
, MUST
be supported by all objects implementing the
RTCPeerConnection
interface. It is called any
time the readyState
changes, i.e., from a call to
setLocalDescription
, a call to
setRemoteDescription
, or code. It does not fire for the
initial state change into new
.addstream
, MUST be fired by
all objects implementing the RTCPeerConnection
interface. It is called any time a MediaStream
is added
by the remote peer. This will be fired only as a result of
setRemoteDescription
. Onnaddstream happens as early as
possible after the setRemoteDescription
. This callback
does not wait for a given media stream to be accepted or rejected via
SDP negotiation.removestream
, MUST be
fired by all objects implementing the
RTCPeerConnection
interface. It is called any
time a MediaStream
is removed by the remote peer. This
will be fired only as a result of
setRemoteDescription
.iceconnectionstatechange
,
MUST be fired by all objects implementing the
RTCPeerConnection
interface. It is called any
time the RTCPeerConnection
ice connection state changes.A Window
object has a strong reference to any
RTCPeerConnection
objects created from the
constructor whose global object is that Window
object.
The non-normative peer state transitions are:
An example set of transitions might be:
Caller transition:
stable
have-local-offer
have-remote-pranswer
stable
closed
Callee transition:
stable
have-remote-offer
have-local-pranswer
stable
closed
failed
, and may trigger
intermittently (and resolve itself without action) on a flaky
network.States take either the value of any component or all components, as outlined below:
checking
occurs if ANY component has received a
candidate and can start checkingconnected
occurs if ALL components have established
a working connectioncompleted
occurs if ALL components have finalized
the running of their ICE processesfailed
occurs if ANY component has given up trying
to connectdisconnected
occurs if ANY component has failed
liveness checksclosed
occurs only if
RTCPeerConnection.close()
has been called.If a component is discarded as a result of signaling (e.g. RTCP mux
or BUNDLE), the state may advance directly from checking
to completed
.
Some example transitions might be:
new
new
, remote candidates received):
checking
checking
, found usable connection):
connected
checking
, gave up): failed
connected
, finished all checks):
completed
completed
, lost connectivity):
disconnected
new
closed
The non-normative ICE state transitions are:
Errors are indicated in two ways: exceptions and objects passed to
error callbacks. Exceptions are thrown to indicate invalid state and
other programming errors. For example when a method is called when the
RTCPeerConnection
is in an invalid state, or a
state in which that particular method is not allowed to be executed. In
all other cases, an error object MUST be provided to the error
callback.
RTCSessionDescription
at which the error was encountered.Ask the DOM team to extend their list with the following errors. The error names and their descriptions are directly copied from the old RTCErrorName enum and might need some adjustment before being added to the public list of errors.
The RTCSdpType enum describes the type of an
RTCSessionDescription
instance.
An RTCSdpType of "offer" indicates that a description should be treated as an [[!SDP]] offer.
An RTCSdpType of "pranswer" indicates that a description should be treated as an [[!SDP]] answer, but not a final answer. A description used as an SDP "pranswer" may be applied as a response to a SDP offer, or an update to a previously sent SDP "pranswer".
An RTCSdpType of "answer" indicates that a description should be treated as an [[!SDP]] final answer, and the offer-answer exchange should be considered complete. A description used as an SDP answer may be applied as a response to an SDP offer or as an update to a previously sent SDP "pranswer".
RTCSessionDescription()
constructor takes an optional dictionary argument,
descriptionInitDict, whose content is used to initialize
the new RTCSessionDescription
object. If a
dictionary key is not present in descriptionInitDict, the
corresponding attribute will be initialized to null. If the
constructor is run without the dictionary argument, all attributes
will be initialized to null. This class is a future extensible
carrier for the data contained in it and does not perform any
substantive processing.This class is a future extensible carrier for the data contained in it and does not perform any substantive processing.
RTCIceCandidate()
constructor
takes an optional dictionary argument, candidateInitDict,
whose content is used to initialize the new
RTCIceCandidate
object. If a dictionary key is
not present in candidateInitDict, the corresponding
attribute will be initialized to null. If the constructor is run
without the dictionary argument, all attributes will be initialized
to null.The icecandidate
event of the RTCPeerConnection uses
the RTCPeerConnectionIceEvent
interface.
Firing an
RTCPeerConnectionIceEvent
event named
e with an RTCIceCandidate
candidate means that an event with the name e,
which does not bubble (except where otherwise stated) and is not
cancelable (except where otherwise stated), and which uses the
RTCPeerConnectionIceEvent
interface with the
candidate
attribute set to the new ICE candidate, MUST be
created and dispatched at the given target.
The candidate
attribute is the
RTCIceCandidate
object with the new ICE
candidate that caused the event.
TODO
The Peer-to-peer Data API lets a web application send and receive generic application data peer-to-peer. The API for sending and receiving data models the behavior of WebSockets [[WEBSOCKETS-API]].
The Peer-to-peer data API extends the
RTCPeerConnection
interface as described below.
Creates a new RTCDataChannel
object with the
given label. The RTCDataChannelInit
dictionary
can be used to configure properties of the underlying channel such as
data reliability.
When the createDataChannel()
method is invoked, the user agent MUST run the following steps.
If the RTCPeerConnection
object’s RTCPeerConnection
signalingState is closed
, throw an
InvalidStateError
exception and abort these
steps.
Let channel be a newly created
RTCDataChannel
object.
Initialize channel's label
attribute to the value
of the first argument.
If the second (dictionary) argument is present, set
channel's ordered
, maxPacketLifeTime
,
maxRetransmits
,
protocol
,
negotiated
and id
attributes
to the values of their corresponding dictionary members (if
present in the dictionary).
If both the maxPacketLifeTime
and maxRetransmits
attributes are set (not null), then throw a
SyntaxError
exception and abort these steps.
If an attribute, either maxPacketLifeTime
or maxRetransmits
, has
been set to indicate unreliable mode, and that value exceeds the
maximum value supported by the user agent, the value must be set
to the user agents maximum value.
If id
attribute
is uninitialized (not set via the dictionary), initialize it to a
value generated by the user agent, according to the WebRTC
DataChannel Protocol specification, and skip to the next step.
Otherwise, if the value of the id
attribute is taken by an
existing RTCDataChannel
, throw a
ResourceInUse
exception and abort these steps.
Return channel and continue the following steps in the background.
Create channel's associated underlying data transport and configure it according to the relevant properties of channel.
datachannel
, MUST be supported by all
objects implementing the RTCPeerConnection
interface.The RTCDataChannel
interface represents a
bi-directional data channel between two peers. A
RTCDataChannel
is created via a factory method on an
RTCPeerConnection
object. The messages sent between
the browsers are described in [[!RTCWEB-DATA]] and [[!RTCWEB-DATA-PROTOCOL]].
There are two ways to establish a connection with
RTCDataChannel
. The first way is to simply create a
RTCDataChannel
at one of the peers with the
negotiated
RTCDataChannelInit
dictionary member unset or set to
its default value false. This will announce the new channel in-band and
trigger a RTCDataChannelEvent
with the corresponding
RTCDataChannel
object at the other peer. The second
way is to let the application negotiate the
RTCDataChannel
. To do this, create a
RTCDataChannel
object with the negotiated
RTCDataChannelInit
dictionary member set to true, and
signal out-of-band (e.g. via a web server) to the other side that it
should create a corresponding RTCDataChannel
with the
negotiated
RTCDataChannelInit
dictionary member set to true and
the same id
. This will
connect the two separately created RTCDataChannel
objects. The second way makes it possible to create channels with
asymmetric properties and to create channels in a declarative way by
specifying matching ids
.
Each RTCDataChannel
has an associated
underlying data transport that is used to transport actual
data to the other peer. The transport properties of the underlying
data transport, such as in order delivery settings and reliability
mode, are configured by the peer as the channel is created. The
properties of a channel cannot change after the channel has been created.
The actual wire protocol between the peers is specified by the WebRTC
DataChannel Protocol specification (TODO: reference needed).
A RTCDataChannel
can be configured to operate in
different reliability modes. A reliable channel ensures that the data is
delivered at the other peer through retransmissions. An unreliable
channel is configured to either limit the number of retransmissions
(maxRetransmits
) or
set a time during which transmissions (including retransmissions) are allowed (maxPacketLifeTime
).
These properties can not be used simultaneously and an attempt to do so
will result in an error. Not setting any of these properties results in a
reliable channel.
A RTCDataChannel
, created with createDataChannel()
or
dispatched via a RTCDataChannelEvent
, MUST initially
be in the connecting
state. When the
RTCDataChannel
object’s underlying data
transport is ready, the user agent MUST announce the RTCDataChannel
as
open.
When the user agent is to announce
a RTCDataChannel
as open, the user agent MUST queue a
task to run the following steps:
If the associated RTCPeerConnection
object's
RTCPeerConnection
signalingState is closed
, abort these steps.
Let channel be the RTCDataChannel
object to be announced.
Set channel's readyState
attribute to
open
.
Fire a simple event named open
at channel.
When an underlying data transport is to be announced (the other
peer created a channel with negotiated
unset or set
to false), the user agent of the peer that did not initiate the creation
process MUST queue a task to run the following steps:
If the associated RTCPeerConnection
object's
RTCPeerConnection
signalingState is closed
, abort these steps.
Let channel be a newly created
RTCDataChannel
object.
Let configuration be an information bundle received from the other peer as a part of the process to establish the underlying data transport described by the WebRTC DataChannel Protocol specification.
Initialize channel's label
, ordered
, maxPacketLifeTime
,
maxRetransmits
,
protocol
,
negotiated
and
id
attributes to their
corresponding values in configuration.
Set channel's readyState
attribute to
connecting
.
Fire a datachannel event named datachannel
with channel
at the RTCPeerConnection
object.
An RTCDataChannel
object's underlying data
transport may be torn down in a non-abrupt manner by running the
closing procedure. When
that happens the user agent MUST, unless the procedure was initiated by
the close()
method,
queue a task that sets the object's readyState
attribute to
closing
. This will eventually render the data transport closed.
When a RTCDataChannel
object's underlying data
transport has been closed, the
user agent MUST queue a task to run the following steps:
Let channel be the RTCDataChannel
object whose transport
was closed.
Set channel's readyState
attribute to
closed
.
If the transport was closed with an error, fire an NetworkError event at channel.
Fire a simple event named close
at
channel.
The RTCDataChannel.label
attribute represents a label that can be used to distinguish this
RTCDataChannel
object from other
RTCDataChannel
objects. Scripts are allowed to
create multiple RTCDataChannel
objects with the
same label. The attribute MUST return the value to which it was set
when the RTCDataChannel
object was created.
The RTCDataChannel.ordered
attribute returns true if the RTCDataChannel
is
ordered, and false if other of order delivery is allowed. The
attribute MUST be initialized to true by default and MUST return the
value to which it was set when the RTCDataChannel
was created.
The RTCDataChannel.maxPacketLifeTime
attribute returns the length of the time window (in milliseconds)
during which transmissions and retransmissions may occur in
unreliable mode, or null if unset. The attribute MUST be initialized
to null by default and MUST return the value to which it was set when
the RTCDataChannel
was created.
The RTCDataChannel.maxRetransmits
attribute returns the maximum number of retransmissions that are
attempted in unreliable mode, or null if unset. The attribute MUST be
initialized to null by default and MUST return the value to which it
was set when the RTCDataChannel
was created.
The RTCDataChannel.protocol
attribute returns the name of the sub-protocol used with this
RTCDataChannel
if any, or the empty string
otherwise. The attribute MUST be initialized to the empty string by
default and MUST return the value to which it was set when the
RTCDataChannel
was created.
The RTCDataChannel.negotiated
attribute returns true if this RTCDataChannel
was
negotiated by the application, or false otherwise. The attribute MUST
be initialized to false by default and MUST return the value to which
it was set when the RTCDataChannel
was
created.
The RTCDataChannel.id
attribute
returns the id for this RTCDataChannel
. The id
was either assigned by the user agent at channel creation time or
selected by the script. The attribute MUST return the value to which
it was set when the RTCDataChannel
was
created.
The RTCDataChannel.readyState
attribute represents the state of the RTCDataChannel
object. It MUST return the value to which the user agent last set it
(as defined by the processing model algorithms).
The bufferedAmount
attribute MUST return the number of bytes of application data (UTF-8
text and binary data) that have been queued using send()
but that, as of the last
time the event loop started executing a task, had not yet been
transmitted to the network. (This thus includes any text sent during
the execution of the current task, regardless of whether the user
agent is able to transmit text asynchronously with script execution.)
This does not include framing overhead incurred by the protocol, or
buffering done by the operating system or network hardware. If the
channel is closed, this attribute's value will only increase with
each call to the send()
method (the attribute does
not reset to zero once the channel closes).
open
, MUST be supported by all
objects implementing the RTCDataChannel
interface.error
, MUST be supported by all
objects implementing the RTCDataChannel
interface.close
, MUST be supported by all
objects implementing the RTCDataChannel
interface.Closes the RTCDataChannel
. It may be called
regardless of whether the RTCDataChannel
object
was created by this peer or the remote peer.
When the RTCDataChannel
close()
method is called, the user agent MUST run the
following steps:
Let channel be the
RTCDataChannel
object which is about to be
closed.
If channel's readyState
is
closing
or closed
, then abort these
steps.
Set channel's readyState
attribute to
closing
.
If the closing procedure
has not started yet, start it.
message
,MUST be supported by
all objects implementing the RTCDataChannel
interface.The binaryType
attribute
MUST, on getting, return the value to which it was last set. On
setting, the user agent must set the IDL attribute to the new value.
When a RTCDataChannel
object is created, the
binaryType
attribute MUST be initialized to the string "blob
".
This attribute controls how binary data is exposed to scripts. See the [[WEBSOCKETS-API]] for more information.
Run the steps described by the send()
algorithm with argument
type string
object.
Run the steps described by the send()
algorithm with argument
type Blob
object.
Run the steps described by the send()
algorithm with argument
type ArrayBuffer
object.
Run the steps described by the send()
algorithm with argument
type ArrayBufferView
object.
If set to false, data is allowed to be delivered out of order. The default value of true, guarantees that data will be delivered in order.
Limits the time during which the channel will transmit or retransmit data if not acknowledged. This value may be clamped if it exceeds the maximum value supported by the user agent.
Limits the number of times a channel will retransmit data if not successfully delivered. This value may be clamped if it exceeds the maximum value supported by the user agent..
Subprotocol name used for this channel.
The default value of false tells the user agent to announce the
channel in-band and instruct the other peer to dispatch a
corresponding RTCDataChannel
object. If set to
true, it is up to the application to negotiate the channel and create
a RTCDataChannel
object with the same
id
at the other
peer.
Overrides the default selection of id for this channel.
The send()
method is
overloaded to handle different data argument types. When any version of
the method is called, the user agent MUST run the following steps:
Let channel be the RTCDataChannel
object on which data is to be sent.
If channel’s readyState
attribute
is connecting
, throw an InvalidStateError
exception and abort these steps.
Execute the sub step that corresponds to the type of the methods argument:
string
object:
Let data be the result of converting the argument
object to a sequence of Unicode characters and increase the
bufferedAmount
attribute by the number of bytes needed to express
data as UTF-8.
Blob
object:
Let data be the raw data represented by the
Blob
object and increase the bufferedAmount
attribute by the size of data, in bytes.
ArrayBuffer
object:
Let data be the data stored in the buffer described
by the ArrayBuffer
object and increase the
bufferedAmount
attribute by the length of the ArrayBuffer
in
bytes.
ArrayBufferView
object:
Let data be the data stored in the section of the
buffer described by the ArrayBuffer
object that the
ArrayBufferView
object references and increase the
bufferedAmount
attribute by the length of the ArrayBufferView
in
bytes.
If channel’s underlying data transport is not
established yet, or if the closing procedure
has
started, then abort these steps.
Attempt to send data on channel’s underlying data transport; if the data cannot be sent, e.g. because it would need to be buffered but the buffer is full, the user agent MUST abruptly close channel’s underlying data transport with an error.
The user agent is attempting to establish the underlying data
transport. This is the initial state of a
RTCDataChannel
object created with createDataChannel()
.
The underlying data transport is established and
communication is possible. This is the initial state of a
RTCDataChannel
object dispatched as a part of a
RTCDataChannelEvent
.
The procedure
to close
down the underlying data transport has started.
The underlying data transport has been closed
or could not be
established.
The datachannel
event
uses the RTCDataChannelEvent
interface.
Firing a datachannel event named
e with a RTCDataChannel
channel means that an event with the name e, which
does not bubble (except where otherwise stated) and is not cancelable
(except where otherwise stated), and which uses the
RTCDataChannelEvent
interface with the channel
attribute set to
channel, MUST be created and dispatched at the given
target.
The channel
attribute
represents the RTCDataChannel
object associated
with the event.
TODO
A RTCDataChannel
object MUST not be garbage
collected if its
readyState
is connecting
and at least one event listener is
registered for open
events, message
events,
error
events, or close
events.
readyState
is open
and at least one event listener is registered
for message
events, error
events, or
close
events.
readyState
is closing
and at least one event listener is registered
for error
events, or close
events.
underlying data transport is established and data is queued to be transmitted.
In order to send DTMF (phone keypad) values across an
RTCPeerConnection
, the user agent needs to know which
MediaStreamTrack
on which
RTCPeerConnection
will carry the DTMF. This section
describes an interface on RTCPeerConnection
to
associate DTMF capability with a MediaStreamTrack
for
that RTCPeerConnection
. Details of how DTMF is sent to
the other peer are described in [[!RTCWEB-AUDIO]].
The Peer-to-peer DTMF API extends the
RTCPeerConnection
interface as described below.
The createDTMFSender() method creates an RTCDTMFSender
that references the given MediaStreamTrack. The MediaStreamTrack MUST
be an element of a MediaStream that's currently in the
RTCPeerConnection
object's local streams set; if not, throw an
InvalidParameter
exception and abort these steps.
An RTCDTMFSender
is created by calling the
createDTMFSender()
method on an
RTCPeerConnection
. This constructs an object that
exposes the functions required to send DTMF on the given
MediaStreamTrack
.
The canInsertDTMF
attribute MUST indicate if the RTCDTMFSender
is
capable of sending DTMF.
An RTCDTMFSender
object’s insertDTMF()
method
is used to send DTMF tones.
The tones parameter is treated as a series of characters. The characters 0 through 9, A through D, #, and * generate the associated DTMF tones. The characters a to d are equivalent to A to D. The character ',' indicates a delay of 2 seconds before processing the next character in the tones parameter. All other characters MUST be considered unrecognized.
The duration parameter indicates the duration in ms to use for each character passed in the tones parameters. The duration cannot be more than 6000 ms or less than 40 ms. The default duration is 100 ms for each tone.
The interToneGap parameter indicates the gap between tones. It MUST be at least 30 ms. The default value is 70 ms.
The browser MAY increase the duration and interToneGap times to cause the times that DTMF start and stop to align with the boundaries of RTP packets but it MUST not increase either of them by more than the duration of a single RTP audio packet.
ISSUE: How are invalid values handled?
When the insertDTMF()
method is invoked, the
user agent MUST run the following steps:
MediaStreamTrack
is not
connected to the associated RTCPeerConnection
,
return.canInsertDTMF
attribute is false, return.toneBuffer
attribute to
the value of the first argument, the duration
attribute to the
value of the second argument, and the interToneGap
attribute
to the value of the third argument.toneBuffer
contains any
unrecognized characters, throw an
InvalidCharacterError
exception and abort these
steps.toneBuffer
is an empty
string, return.duration
attribute is less
than 40, set it to 40. If, on the other hand, the value is greater
than 6000, set it to 6000.interToneGap
attribute
is less than 30, set it to 30.toneBuffer
is an
empty string, fire an event named tonechange
with an
empty string at the RTCDTMFSender
object
and abort these steps.toneBuffer
and let
that character be tone.duration
ms on the
associated RTP media stream, using the appropriate codec.duration
+
interToneGap
ms
from now that runs the steps labelled Playout
task.tonechange
with a
string consisting of tone at the
RTCDTMFSender
object.Calling insertDTMF()
with an empty
tones parameter can be used to cancel all tones queued to play after
the currently playing tone.
The track
attribute MUST return the
MediaStreamTrack
given as argument to the
createDTMFSender()
method.
This event handler uses the
RTCDTMFToneChangeEvent
interface to return the
character for each tone as it is played out. See
RTCDTMFToneChangeEvent
for details.
The toneBuffer
attribute MUST return a list of the tones remaining to be played out.
For the syntax, content, and interpretation of this list, see
insertDTMF
.
The duration
attribute
MUST return the current tone duration value. This value will be the
value last set via the insertDTMF()
method, or
the default value of 100 ms if insertDTMF()
was
called without specifying the duration.
The interToneGap
attribute MUST return the current value of the between-tone gap. This
value will be the value last set via the
insertDTMF()
method, or the default value of 70
ms if insertDTMF()
was called without specifying
the interToneGap.
The tonechange
event uses the
RTCDTMFToneChangeEvent
interface.
Firing a tonechange event named
e with a DOMString
tone means
that an event with the name e, which does not bubble (except
where otherwise stated) and is not cancelable (except where otherwise
stated), and which uses the RTCDTMFToneChangeEvent
interface with the tone
attribute set to
tone, MUST be created and dispatched at the given target.
The tone
attribute contains the character for the tone that has just begun
playout (see insertDTMF()
). If the value is the
empty string, it indicates that the previous tone has completed
playback.
TODO
The basic statistics model is that the browser maintains a set of
statistics referenced by a selector. The
selector may, for example, be a MediaStreamTrack
. For a
track to be a valid selector, it must be a member of a
MediaStream
that is sent or received by the
RTCPeerConnection
object on which the stats request
was issued. The calling Web application provides the selector to the
getStats()
method
and the browser emits (in the JavaScript) a set of statistics that it
believes is relevant to the selector.
The statistics returned are designed in such a way that repeated
queries can be linked by the RTCStats
id dictionary member. Thus, a Web application can
make measurements over a given time period by requesting measurements at
the beginning and end of that period.
The Statistics API extends the RTCPeerConnection
interface as described below.
Gathers stats for the given selector and reports the result asynchronously.
When the getStats()
method is
invoked, the user agent MUST queue a task to run the following
steps:
If the RTCPeerConnection
object's RTCPeerConnection
signalingState is closed
, throw an
InvalidStateError
exception.
Return, but continue the following steps in the background.
Let selectorArg be the methods first argument.
If selectorArg is an invalid selector, the user agent MUST queue a task to invoke the failure callback (the method's third argument).
Start gathering the stats indicated by selectorArg.
In case selectorArg is null, stats MUST be gathered
for the whole RTCPeerConnection
object.
When the relevant stats have been gathered, queue a task to
invoke the success callback (the method's second argument) with a
new RTCStatsReport
object, representing the
gathered stats, as its argument.
A RTCStatsReport
representing the gathered
stats.
The getStats()
method delivers a successful result in the form of a
RTCStatsReport
object. A
RTCStatsReport
object represents a map between
strings, identifying the inspected objects (RTCStats.id), and their corresponding
RTCStats
objects.
An RTCStatsReport
may be composed of several
RTCStats
objects, each reporting stats for one
underlying object that the implementation thinks is relevant for the
selector. One achieves the total for the
selector by summing over all the stats of a
certain type; for instance, if a MediaStreamTrack
is carried
by multiple SSRCs over the network, the
RTCStatsReport
may contain one RTCStats
object per SSRC (which can be distinguished by the value of the "ssrc"
stats attribute).
Getter to retrieve the RTCStats
objects that
this stats report is composed of.
The set of supported property names [[!WEBIDL]] is defined as the
ids of all the RTCStats
objects that has been
generated for this stats report. The order of the property names is
left to the user agent.
An RTCStats
dictionary represents the stats
gathered by inspecting a specific object relevant to a selector. The RTCStats
dictionary is a base type that specifies as set of default attributes,
such as timestamp and type. Specific stats are added by extending the
RTCStats
dictionary.
Note that while stats names are standardized, any given implementation may be using experimental values or values not yet known to the Web application. Thus, applications MUST be prepared to deal with unknown stats.
Statistics need to be synchronized with each other in order to yield
reasonable values in computation; for instance, if "bytesSent" and
"packetsSent" are both reported, they both need to be reported over the
same interval, so that "average packet size" can be computed as "bytes /
packets" - if the intervals are different, this will yield errors. Thus
implementations MUST return synchronized values for all stats in a
RTCStats
object.
The timestamp
,
of type DOMHiResTimeStamp
[[!HIGHRES-TIME]], associated
with this object. The time is relative to the UNIX epoch (Jan 1,
1970, UTC).
The type of this object.
The type
attribute
MUST be initialized to the name of the most specific type this
RTCStats
dictionary represents.
A unique id
that is
associated with the object that was inspected to produce this
RTCStats
object. Two RTCStats
objects, extracted from two different
RTCStatsReport
objects, MUST have the same id if
they were produced by inspecting the same underlying object. User
agents are free to pick any format for the id as long as it meets the
requirements above.
...
The remoteId
can be used to look up the corresponding
RTCStats
object that represents stats reported by
the other peer.
...
...
...
...
Consider the case where the user is experiencing bad sound and the application wants to determine if the cause of it is packet loss. The following example code might be used:
var baselineReport, currentReport; var selector = pc.getRemoteStreams()[0].getAudioTracks()[0]; pc.getStats(selector, function (report) { baselineReport = report; }, logError); // ... wait a bit setTimeout(function () { pc.getStats(selector, function (report) { currentReport = report; processStats(); }, logError); }, aBit); function processStats() { // compare the elements from the current report with the baseline for each (var now in currentReport) { if (now.type != "outbund-rtp") continue; // get the corresponding stats from the baseline report base = baselineReport[now.id]; if (base) { remoteNow = currentReport[now.remoteId]; remoteBase = baselineReport[base.remoteId]; var packetsSent = now.packetsSent - base.packetsSent; var packetsReceived = remoteNow.packetsReceived - remoteBase.packetsReceived; // if fractionLost is > 0.3, we have probably found the culprit var fractionLost = (packetsSent - packetsReceived) / packetsSent; } } } function logError(error) { log(error.name + ": " + error.message); }
WebRTC offers and answers (and hence the channels established by
RTCPeerConnection
objects) can be authenticated by
using a web-based Identity Provider (IdP). The idea is that the entity
sending the offer/answer acts as the Authenticating Party (AP) and obtains
an identity assertion from the IdP which it attaches to the offer/answer.
The consumer of the offer/answer (i.e., the
RTCPeerConnection
on which
setRemoteDescription()
is called) acts as the Relying Party
(RP) and verifies the assertion.
The interaction with the IdP is designed to decouple the browser from any particular identity provider; the browser need only know how to load the IdP's JavaScript—which is deterministic from the IdP's identity—and the generic protocol for requesting and verifying assertions. The IdP provides whatever logic is necessary to bridge the generic protocol to the IdP's specific requirements. Thus, a single browser can support any number of identity protocols, including being forward compatible with IdPs which did not exist at the time the browser was written. The generic protocol details are described in [[!RTCWEB-SECURITY-ARCH]]. This document specifies the procedures required to instantiate the IdP proxy, request identity assertions, and consume the results.
In order to communicate with the IdP, the browser instantiates an isolated interpreted context, effectively an invisible IFRAME. The initial contents of the context are loaded from a URI derived from the IdP's domain name, as described in [[!RTCWEB-SECURITY-ARCH]].
For purposes of generating assertions, the IdP shall be chosen as follows:
setIdentityProvider()
method has been called,
the IdP provided shall be used.setIdentityProvider()
method has not been
called, then the browser can use an IdP configured into the
browser.In order to verify assertions, the IdP domain name and protocol are
taken from the domain
and protocol
fields of
the identity assertion.
The browser creates an IdP proxy by loading an isolated, invisible
IFRAME with HTML content from the IdP URI. The URI for the IdP is a
well-known URI formed from the domain
and protocol
fields, as specified in [[!RTCWEB-SECURITY-ARCH]].
When an IdP proxy is requiured, the browser performs the following steps:
sandbox
attribute is set to
"allow-forms allow-scripts allow-same-origin" to limit the
capabilities available to the IdP. The browser MUST prevent the IdP
proxy from navigating the browsing context to a different location.
The browser MUST prevent the IdP proxy from interacting with the user
(this includes, in particular, popup windows and user dialogs).MessageChannel
[[!webmessaging]] within the context of
the IdP proxy and assigns one port from the channel to a variable
named rtcwebIdentityPort on the window. This
message channel forms the basis of communication between the browser
and the IdP proxy. Since it is an essential security property of the
web sandbox that a page is unable to insert objects into content from
another origin, this ensures that the IdP proxy can trust that
messages originating from window.rtcwebIdentityPort are
from RTCPeerConnection
and not some other page. This
protection ensures that pages from other origins are unable to
instantiate IdP proxies and obtain identity assertions.RTCPeerConnection
that it is ready by sending a "READY"
message to the message channel port [[!RTCWEB-SECURITY-ARCH]]. Once
this message is received by the RTCPeerConnection
, the
IdP is considered ready to receive requests to generate or verify
identity assertions.[TODO: This is not sufficient unless we expect the IdP to protect this information. Otherwise, the a=identity information can be copied from a session with "good" properties to any other session with the same fingerprint information. Since we want to reuse credentials, that would be bad.] The identity mechanism MUST provide an indication to the remote side of whether it requires the stream contents to be protected. Implementations MUST have an user interface that indicates the different cases and identity for these.
The identity assertion request process involves the following steps:
RTCPeerConnection
instantiates an IdP proxy as
described in Identity
Provider Selection section and waits
for the IdP to signal that it is ready.RTCPeerConnection
desires to be bound to the user's
identity.RTCPeerConnection
over the message channel.RTCPeerConnection
MAY store the identity assertion
for use with future offers or answers.createOffer()
or createAnswer()
, then the
assertion is inserted in the offer/answer SDP.The format and contents of the messages that are exchanged are described in detail in [[!RTCWEB-SECURITY-ARCH]].
The IdP proxy can return an "ERROR" response. If an error is
encountered, the browser MUST generate
an idpassertionerror
event. No "a=identity" attribute is added to SDP as a result.
The browser SHOULD limit the time that it will allow for this process. This includes both the loading of the IdP proxy and the identity assertion generation. Failure to do so potentially causes the corresponding operation to take an indefinite amount of time. This timer can be cancelled when the IdP produces a response. The timer running to completion can be treated as equivalent to an error from the IdP.
An IdP could respond to a request to generate an identity assertion with a "LOGINNEEDED" error. This indicates that the site does not have the necessary information available to it (such as cookies) to authorize the creation of an identity assertion.
The "LOGINNEEDED" response includes a URL for a page where the
authorization process can be completed. This URL is exposed to the
application through the loginUrl
attribute
of the idpassertionerror event.
This URL might be to a page where a user is able to enter their (IdP)
username and password, or otherwise provide any information the IdP
needs to authorize a assertion request.
An application can load the login URL in an IFRAME or popup; the resulting page then provides the user with an opportunity to provide information necessary to complete the authorization process.
Once the authorization process is complete, the page loaded in the IFRAME or popup sends a message using postMessage [[!webmessaging]] to the page that loaded it (through the window.opener attribute for popups, or through window.parent for pages loaded in an IFRAME). The message MUST be the DOMString "LOGINDONE". This message informs the application that another attempt at generating an identity assertion is likely to be successful.
Identity assertion validation happens
when setRemoteDescription
is invoked on RTCPeerConnection
. The process runs
asynchronously, meaning that validation of an identity assertion does not
block the completion of setRemoteDescription
.
The identity assertion request process involves the following steps:
RTCPeerConnection
instantiates an IdP proxy as
described in Identity
Provider Selection section and waits
for the IdP to signal that it is ready.RTCPeerConnection
over the message channel.RTCPeerConnection
validates that the fingerprint
provided by the IdP in the validation response matches the certificate
fingerprint that is, or will be, used for communications. This is either by:
RTCPeerConnection
validates that the domain portion
of the identity matches the domain of the IdP as described in [[!RTCWEB-SECURITY-ARCH]].RTCPeerConnection
stores the assertion in the
peerIdentity
attribute and raises a simple event
named peeridentity at itself. The assertion
information to be displayed MUST contain the domain name of the IdP as
provided in the assertion.The IdP might fail to validate the identity assertion by providing an "ERROR" response to the validation request. Validation can also fail due to the additional checks performed by the browser. In both cases, the process terminates and no identity information is exposed to the application or the user.
The browser MUST raise an idpvalidationerror
event if
validation of an identity assertion fails for any reason.
If the "peerIdentity" constraint is applied to the
RTCPeerConnection
, any error MUST
cause setRemoteDescription
to fail.
The browser SHOULD limit the time that it will allow for this process. This includes both the loading of the IdP proxy and the identity assertion validation. Failure to do so potentially causes the corresponding operation to take an indefinite amount of time. This timer can be cancelled when the IdP produces a response. The timer running to completion can be treated as equivalent to an error from the IdP.
It is possible that different values for the "a=identity" attribute is provided at a media level in SDP. A browser MAY either choose to treat this as an error or ignore the attribute. If multiple different assertions are validated, then they MUST produce identical identity values.
The format and contents of the messages that are exchanged are described in detail in [[!RTCWEB-SECURITY-ARCH]].
The Identity API extends the RTCPeerConnection
interface as described below.
Sets the identity provider to be used for a given
RTCPeerConnection
object. Applications need not make
this call; if the browser is already configured for an IdP, then that
configured IdP will be used to get an assertion.
When the setIdentityProvider()
method is invoked, the user agent MUST run the following steps:
If the connection's RTCPeerConnection
signalingState is closed
, throw an
InvalidStateError
exception and abort these
steps.
Set the current identity provider values to the triplet
(provider
, protocol
,
username
).
If any identity provider value has changed, discard any stored identity assertion.
Identity provider information is not used until an identity
assertion is required, either in response to a call to
getIdentityAssertion
, or the need to generate SDP with
either createOffer
or createAnswer
.
Initiates the process of obtaining an identity assertion.
Applications need not make this call. It is merely intended to allow
them to start the process of obtaining identity assertions before a
call is initiated. If an identity is needed, either because the
browser has been configured with a default identity provider or
because the setIdentityProvider()
method was called,
then an identity will be automatically requested when an offer or
answer is created.
When getIdentityAssertion
is invoked, queue a task to
run the following steps:
If the connection's RTCPeerConnection
signalingState is closed
, abort these steps.
Request an identity assertion from the IdP.
Contains the peer identity assertion information if an identity assertion was provided and verified. Once this value is set to a non-null value, it cannot change.
identityresult
, MUST be fired by all
objects implementing the RTCPeerConnection
interface. This event is fired when an identity assertion is
successfully generated. Note: this event is fired when an identity
assertion is generated during the creation of an offer or answer.peeridentity
MUST be fired when a
peer identity from a peer is successfully validated.idpassertionerror
MUST be
fired when an IdP encounters an error in generating an identity
assertion.idvalidationperror
MUST be
fired when an IdP encounters an error in validating an identity
assertion.A domain name representing the identity provider.
An RFC5322-conformant [[RFC5322]] representation of the verified peer identity. This identity will have been verified via the procedures described in [RTCWEB-SECURITY-ARCH].
The RTCIdentiytEvent
is raised when an IdP
produces an identity assertion.
A string containing the encoded identity assertion (the information that would be added to the "a=identity" line in SDP [[!RTCWEB-SECURITY-ARCH]]).
The RTCIdentityErrorEvent
is raised when an
IdP fails to successfully produce an identity assertion.
The identity system is designed so that applications need not take any special action in order for users to generate and verify identity assertions; if a user has configured an IdP into their browser, then the browser will automatically request/generate assertions and the other side will automatically verify them and display the results. However, applications may wish to exercise tighter control over the identity system as shown by the following examples.
This example shows how to configure the identity provider and protocol.
pc.setIdentityProvider("example.com", "default", "alice@example.com");
This example shows how to consume identity assertions inside a Web application.
pc.onpeeridentity = function(e) { console.log("IdP= " + e.target.peerIdentity.idp + " identity=" + e.target.peerIdentity.name); };
The MediaStream
interface, as defined in the
[[!GETUSERMEDIA]] specification, typically represents a stream of data of
audio and/or video. A MediaStream
may be extended to
represent a stream that either comes from or is sent to a remote node (and
not just the local camera, for instance). The extensions required to
enable this capability on the MediaStream
object will be
described in this section. How the media is transmitted to the peer is
described in [[!RTCWEB-RTP]], [[!RTCWEB-AUDIO]], and
[[!RTCWEB-TRANSPORT]].
A MediaStream
as defined in [[!GETUSERMEDIA]] may contain
zero or more MediaStreamTrack
objects. A
MediaStreamTrack
sent to another peer will appear as one and
only one MediaStreamTrack
to the recipient. A peer is
defined as a user agent that supports this specification.
Channels are the smallest unit considered in the
MediaStream
specification. Channels are intended to be
encoded together for transmission as, for instance, an RTP payload type.
All of the channels that a codec needs to encode jointly MUST be in the
same MediaStreamTrack
and the codecs SHOULD be able to
encode, or discard, all the channels in the track.
The concepts of an input and output to a given
MediaStream
apply in the case of MediaStream
objects transmitted over the network as well. A
MediaStream
created by an
RTCPeerConnection
object (described later in this
document) will take as input the data received from a remote peer.
Similarly, a MediaStream
from a local source, for instance a
camera via [[!GETUSERMEDIA]], will have an output that represents what is
transmitted to a remote peer if the object is used with an
RTCPeerConnection
object.
The concept of duplicating MediaStream
objects as
described in [[!GETUSERMEDIA]] is also applicable here. This feature can
be used, for instance, in a video-conferencing scenario to display the
local video from the user’s camera and microphone in a local monitor,
while only transmitting the audio to the remote peer (e.g. in response to
the user using a "video mute" feature). Combining tracks from different
MediaStream
objects into a new
MediaStream
is useful in certain situations.
In this document, we only specify aspects of the
following objects that are relevant when used along with an
RTCPeerConnection
. Please refer to the original
definitions of the objects in the [[!GETUSERMEDIA]] document for general
information on using MediaStream
and
MediaStreamTrack
.
The id
attribute
specified in MediaStream
returns an id that is unique to
this stream, so that streams can be recognized after they are sent
through the RTCPeerConnection
API.
When a MediaStream
is
created to represent a stream obtained from a remote peer, the
id
attribute is initialized from information provided by the remote
source.
The id of a MediaStream
object is
unique to the source of the stream, but that does not mean it is not
possible to end up with duplicates. For example, a locally generated
stream could be sent from one user agent to a remote peer using
RTCPeerConnection
and then sent back to the
original user agent in the same manner, in which case the original user
agent will have multiple streams with the same id (the
locally-generated one and the one received from the remote peer).
A new media track may be associated with an existing
MediaStream
. For example, if a remote peer adds a
new MediaStreamTrack
object to a
MediaStream
that is being sent over an
RTCPeerConnection
, this is observed on the local
user agent. If this happens for the reason exemplified, or for any
other reason than the addTrack()
method being invoked locally on a MediaStream
or
tracks being added as the stream is created (i.e. the stream is
initialized with tracks), the user agent MUST run the following
steps:
Let stream be the target
MediaStream
object.
Represent component with track: Run the following steps to create a track representing the incoming component:
Create a MediaStreamTrack
object
track to represent the component.
Initialize track’s kind
attribute to "audio
" or "video
"
depending on the media type of the incoming component.
Initialize track’s id
attribute to the component track id.
Initialize track’s label
attribute to "remote audio
" or "remote
video
" depending on the media type of the incoming
component.
Initialize track’s readyState
attribute to muted
.
Add track to stream’s track set.
Fire a track event named addtrack
with the newly created MediaStreamTrack
object
at stream.
An existing media track may also be disassociated from a
MediaStream
. If this happens for any other reason
than the removeTrack()
method being invoked locally on a MediaStream
or
the stream being destroyed, the user agent MUST run the following
steps:
Let stream be the target
MediaStream
object.
Let track be the MediaStreamTrack
object representing the media component about to be removed.
Remove track from stream’s track set.
Fire a track event named removetrack
with track at stream.
The event source for the onended
event in the networked
case is the RTCPeerConnection
object.
A MediaStreamTrack
object’s reference to its
MediaStream
in the non-local media source case (an RTP
source, as is the case for a MediaStream
received over an
RTCPeerConnection
) is always strong.
When a track belongs to a MediaStream
that comes
from a remote peer and the remote peer has permanently stopped sending
data the ended
event MUST be fired on the track, as
specified in [[!GETUSERMEDIA]].
ISSUE: How do you know when it has stopped? This seems like an SDP question, not a media-level question.
A track in a MediaStream
, received with an
RTCPeerConnection
, MUST have its
readyState
attribute [[!GETUSERMEDIA]] set to
muted
until media data arrives.
In addition, a MediaStreamTrack
has its
readyState
set to muted
on the remote peer if
the local user agent disables the corresponding
MediaStreamTrack
in the
MediaStream
that is being sent. When the addstream
event triggers on an RTCPeerConnection
, all
MediaStreamTrack
objects in the resulting
MediaStream
are muted until media data can be read
from the RTP source.
ISSUE: How do you know when it has been disabled? This seems like an SDP question, not a media-level question.
The addstream
and removestream
events use the
MediaStreamEvent
interface.
Firing a
stream event named e with a
MediaStream
stream means that an event
with the name e, which does not bubble (except where otherwise
stated) and is not cancelable (except where otherwise stated), and which
uses the MediaStreamEvent
interface with the
stream
attribute
set to stream, MUST be created and dispatched at the
given target.
The stream
attribute
represents the MediaStream
object associated with
the event.
TODO
A MediaStream acquired using getUserMedia()
is, by
default, accessible to an application. This means that the application is
able to access the contents of tracks, modify their content, and send that
media to any peer it chooses.
WebRTC supports calling scenarios where media is sent to a specifically
identified peer, without the contents of media streams being accessible to
applications. This is enabled by use of the
peerIdentity
parameter to
getUserMedia()
.
An application willingly relinquishes access to media by including a
peerIdentity
parameter in the
MediaStreamConstraints
. This attribute is set to a
DOMString
containing the identity of a specific peer.
The MediaStreamConstraints
dictionary is
expanded to include the peerIdentity
parameter.
If set, peerIdentity
isolates media from the
application. Media can only be sent to the identified peer.
A user that is prompted to provide consent for access to a camera or
microphone can be shown the value of the peerIdentity
parameter, so that they can be informed that the consent is more narrowly
restricted.
When the peerIdentity
option is supplied to
getUserMedia()
, all of the MediaStreamTrack
s in
the resulting MediaStream
are isolated so that content is not
accessible to any application. Isolated MediaStreamTrack
s
can be used for two purposes:
Displayed in an appropriate media tag (e.g., a video or audio element). The browser MUST ensure that content is inaccessible to the application by ensuring that the resulting content is given the same protections as content that is CORS cross-origin, as described in the relevant Security and privacy considerations section of [[HTML5]].
Used as the argument to addStream()
on an RTCPeerConnection
instance, subject to the
restrictions in .
A MediaStreamTrack
that is added to another
MediaStream
remains isolated. When an isolated
MediaStreamTrack
is added to a MediaStream
with
a different peerIdentity, the MediaStream
gets a combination
of isolation restrictions. A MediaStream
containing
MediaStreamTrack
instances with mixed isolation properties
can be displayed, but cannot be sent using
RTCPeerConnection
.
Any peerIdentity
property MUST be retained on cloned
copies of MediaStreamTrack
s.
MediaStreamTrack
is expanded to include
an isolated attribute and a corresponding event. This allows
an application to quickly and easily determine whether a track is
accessible.
A MediaStreamTrack
is isolated (and the
corresponding isolated attribute set to true)
when content is inaccessible to the owning document. This occurs as
a result of setting the peerIdentity option. A track is
also isolated if it comes from a cross origin source.
This event handler, of type isolationchange, is fired when the value of the isolated attribute changes.
A MediaStreamTrack
with a peerIdentity option
set can be added to any RTCPeerConnection
. However,
the content of an isolated track MUST NOT be transmitted unless all of
the following constraints are met:
A MediaStreamTrack
from a stream acquired using the
peerIdentity option can be transmitted if the
RTCPeerConnection
has successfully validated the identity of
the peer AND that identity is the same identity that was used in the
peerIdentity option associated with the track. That is,
the name
attribute of the peerIdentity
attribute of the RTCPeerConnection
instance MUST
match the value of the peerIdentity
option passed to
getUserMedia()
.
Rules for matching identity are described in [[!RTCWEB-SECURITY-ARCH]].
The peer has indicated that it will respect the isolation properties of streams. That is, a DTLS connection with a promise to respect stream confidentiality, as defined in [[!WEBRTC-ALPN]] has been established.
Failing to meet these conditions means that no media can be sent for
the affected MediaStreamTrack
. Video MUST be replaced by
black frames, audio MUST be replaced by silence, and equivalently
information-free content MUST be provided for other media types.
Remotely sourced MediaStreamTrack
s MUST be isolated if
they are received over a DTLS connection that has been negotiated with
track isolation. This protects isolated media from the application in
the receiving browser. These tracks MUST only be displayed to a user
using the appropriate media element (e.g., <video> or
<audio>).
Any MediaStreamTrack
that has the peerIdentity option set causes all tracks
sent using the same RTCPeerConnection
to be isolated
at the receiving peer. All DTLS connections created for a
RTCPeerConnection
with isolated local streams MUST
be negotiated so that media remains isolated at the remote peer. This
causes non-isolated media to become isolated at the receiving peer if
any isolated tracks are added to the same
RTCPeerConnection
.
Tracks that are not bound to a particular peerIdentity do not cause other streams to be isolated, these tracks simply do not have their content transmitted.
If a stream becomes isolated after initially being accessible, or an isolated stream is added to an active session, then media for that stream is replaced by information-free content (e.g., black frames or silence).
Media isolation ensures that the content of
a MediaStreamTrack
is not accessible to web applications.
However, to ensure that media with a peerIdentity option set
can be sent to peers, some meta-information about the media will be
exposed to applications.
Applications will be able to observe the parameters of the media that
affect session negotiation and conversion into RTP. This includes the
codecs that might be supported by the track, the bitrate, the number of
packets, and the current settings that are set on the
MediaStreamTrack
.
In particular, the statistics
that RTCPeerConnection
records are not reduced in
capability. New statistics that might compromise isolation MUST be
avoided, or explicitly suppressed for isolated streams.
Most of these data are exposed to the network when the media is
transmitted. Only the settings for the MediaStreamTrack
present a new source of information. This can includes the frame rate
and resolution of video tracks, the bandwidth of audio tracks, and other
information about the source, which would not otherwise be revealed to a
network observer. Since settings don't change at a high frequency or in
response to changes in media content, settings only reveal limited
reveal information about the content of a track. However, any setting
that might change dynamically in response to the content of an
isolated MediaStreamTrack
MUST have changes suppressed.
When two peers decide they are going to set up a connection to each other, they both go through these steps. The STUN/TURN server configuration describes a server they can use to get things like their public IP address or to set up NAT traversal. They also have to send data for the signaling channel to each other using the same out-of-band mechanism they used to establish that they were going to communicate in the first place.
var signalingChannel = new SignalingChannel(); var configuration = { "iceServers": [{ "url": "stun:stun.example.org" }] }; var pc; // call start() to initiate function start() { pc = new RTCPeerConnection(configuration); // send any ice candidates to the other peer pc.onicecandidate = function (evt) { if (evt.candidate) signalingChannel.send(JSON.stringify({ "candidate": evt.candidate })); }; // let the "negotiationneeded" event trigger offer generation pc.onnegotiationneeded = function () { pc.createOffer(localDescCreated, logError); } // once remote stream arrives, show it in the remote video element pc.onaddstream = function (evt) { remoteView.srcObject = evt.stream; }; // get a local stream, show it in a self-view and add it to be sent navigator.mediaDevices.getUserMedia({ "audio": true, "video": true }, function (stream) { selfView.srcObject = stream; pc.addStream(stream); }, logError); } function localDescCreated(desc) { pc.setLocalDescription(desc, function () { signalingChannel.send(JSON.stringify({ "sdp": pc.localDescription })); }, logError); } signalingChannel.onmessage = function (evt) { if (!pc) start(); var message = JSON.parse(evt.data); if (message.sdp) pc.setRemoteDescription(new RTCSessionDescription(message.sdp), function () { // if we received an offer, we need to answer if (pc.remoteDescription.type == "offer") pc.createAnswer(localDescCreated, logError); }, logError); else pc.addIceCandidate(new RTCIceCandidate(message.candidate), function () {}, logError); }; function logError(error) { log(error.name + ": " + error.message); }
This example shows the more complete functionality.
TODO
This example shows how to create a
RTCDataChannel
object and perform the offer/answer
exchange required to connect the channel to the other peer. The
RTCDataChannel
is used in the context of a simple
chat application and listeners are attached to monitor when the channel
is ready, messages are received and when the channel is closed.
This example uses the negotiationneeded
event to initiate the offer/answer dialog. The exact behavior
surrounding the negotiationneeded
event is not specified
in detail at the moment. This example can hopefully help to drive that
discussion. An assumption made in this example is that the event only
triggers when a new negotiation should be started. This means that an
action (such as addStream()) that normally would have fired the
negotiationneeded
event will not do so during an ongoing
offer/answer dialog.
var signalingChannel = new SignalingChannel(); var configuration = { "iceServers": [{ "url": "stun:stun.example.org" }] }; var pc; var channel; // call start(true) to initiate function start(isInitiator) { pc = new RTCPeerConnection(configuration); // send any ice candidates to the other peer pc.onicecandidate = function (evt) { if (evt.candidate) signalingChannel.send(JSON.stringify({ "candidate": evt.candidate })); }; // let the "negotiationneeded" event trigger offer generation pc.onnegotiationneeded = function () { pc.createOffer(localDescCreated, logError); } if (isInitiator) { // create data channel and setup chat channel = pc.createDataChannel("chat"); setupChat(); } else { // setup chat on incoming data channel pc.ondatachannel = function (evt) { channel = evt.channel; setupChat(); }; } } function localDescCreated(desc) { pc.setLocalDescription(desc, function () { signalingChannel.send(JSON.stringify({ "sdp": pc.localDescription })); }, logError); } signalingChannel.onmessage = function (evt) { if (!pc) start(false); var message = JSON.parse(evt.data); if (message.sdp) pc.setRemoteDescription(new RTCSessionDescription(message.sdp), function () { // if we received an offer, we need to answer if (pc.remoteDescription.type == "offer") pc.createAnswer(localDescCreated, logError); }, logError); else pc.addIceCandidate(new RTCIceCandidate(message.candidate), function () {}, logError); }; function setupChat() { channel.onopen = function () { // e.g. enable send button enableChat(channel); }; channel.onmessage = function (evt) { showChatMessage(evt.data); }; } function sendChatMessage(msg) { channel.send(msg); } function logError(error) { log(error.name + ": " + error.message); }
Editors' Note: This example flow needs to be discussed on the list and is likely wrong in many ways.
This shows an example of one possible call flow between two browsers. This does not show the procedure to get access to local media or every callback that gets fired but instead tries to reduce it down to only show the key events and messages.
Examples assume that pc is a connected RTCPeerConnection, and track is an audio track on that connection.
Sending the DTMF signal "1234" with 500 ms duration per tone:
var sender = pc.createDTMFSender(track); if (sender.canInsertDTMF) { var duration = 500; sender.insertDTMF("1234", duration); } else log("DTMF function not available");
Send the DTMF signal "1234", and light up the active key using
lightKey(key)
while the tone is playing (assuming that
lightKey("")
will darken all the keys):
var sender = pc.createDTMFSender(track); sender.ontonechange = function (e) { if (!e.tone) return; // light up the key when playout starts lightKey(e.tone); // turn off the light after tone duration setTimeout(lightKey, sender.duration, ""); }; sender.insertDTMF("1234");
Send a 1-second "1" tone followed by a 2-second "2" tone:
var sender = pc.createDTMFSender(track); sender.ontonechange = function (e) { if (e.tone == "1") sender.insertDTMF("2", 2000); }; sender.insertDTMF("1", 1000);
It is always safe to append to the tone buffer. This example appends before any tone playout has started as well as during playout.
var sender = pc.createDTMFSender(track); sender.insertDTMF("123"); // append more tones to the tone buffer before playout has begun sender.insertDTMF(sender.toneBuffer + "456"); sender.ontonechange = function (e) { if (e.tone == "1") // append more tones when playout has begun sender.insertDTMF(sender.toneBuffer + "789"); };
Send the DTMF signal "123" and abort after sending "2".
var sender = pc.createDTMFSender(track); sender.ontonechange = function (e) { if (e.tone == "2") // empty the buffer to not play any tone after "2" sender.insertDTMF(""); }; sender.insertDTMF("123");
The following events fire on RTCDataChannel
objects:
Event name | Interface | Fired when... |
---|---|---|
open
|
Event
|
The RTCDataChannel object's underlying data
transport has been established (or re-established).
|
MessageEvent
|
Event
|
A message was successfully received. TODO: Ref where MessageEvent is defined? |
error
|
Event
|
TODO. |
close
|
Event
|
The RTCDataChannel object's underlying data
transport has bee closed.
|
The following events fire on RTCPeerConnection
objects:
Event name | Interface | Fired when... |
---|---|---|
connecting
|
Event
|
TODO |
addstream
|
MediaStreamEvent
|
A new stream has been added to the remote streams set. |
removestream
|
MediaStreamEvent
|
A stream has been removed from the remote streams set. |
negotiationneeded
|
Event
|
The browser wishes to inform the application that session negotiation needs to be done at some point in the near future. |
signalingstatechange
|
Event
|
The RTCPeerConnection
signalingState has changed. This state change is the result of
either setLocalDescription()
or setRemoteDescription()
being invoked.
|
iceconnectionstatechange
|
Event
|
The RTCPeerConnection
ice connection state has changed.
|
icecandidate
|
RTCPeerConnectionIceEvent
|
A new RTCIceCandidate is made available to
the script. |
datachannel
|
RTCDataChannelEvent
|
A new RTCDataChannel is dispatched to the
script in response to the other peer creating a channel. |
isolationchange
|
Event
|
A new Event is dispatched to the script when
the isolated attribute on a MediaStreamTrack
changes. |
identityresult
|
RTCIdentityEvent
|
A new RTCIdentityEvent is dispatched to the
script when an identity assertion is successfully generated by an
IdP. |
peeridentity
|
Event
|
A new Event is dispatched to the script when
an identity assertion provided by a peer is successfully
validated. |
idpassertionerror
|
RTCIdentityErrorEvent
|
A new RTCIdentityErrorEvent is
dispatched to the script when an IdP encounters an error while
generating an identity assertion. |
idpvalidationerror
|
RTCIdentityErrorEvent
|
A new RTCIdentityErrorEvent is dispatched to the script
when an IdP encounters an error while validating an identity
assertion. |
The following events fire on RTCDTMFSender
objects:
Event name | Interface | Fired when... |
---|---|---|
tonechange
|
Event
|
The RTCDTMFSender object has either just
begun playout of a tone (returned as the tone
attribute) or just ended playout of a tone (returned as an empty
value in the tone attribute). |
TBD
This section will be removed before publication.
The editors wish to thank the Working Group chairs and Team Contact, Harald Alvestrand, Stefan Håkansson and Dominique Hazaël-Massieux, for their support. Substantial text in this specification was provided by many people including Martin Thomson, Harald Alvestrand, Justin Uberti, and Eric Rescorla.