Initial Author of this Specification was Ian Hickson, Google Inc., with the following copyright statement:
© Copyright 2004-2011 Apple Computer, Inc., Mozilla Foundation, and Opera Software ASA. You are granted a license to use, reproduce and create derivative works of this document.
All subsequent changes since 26 July 2011 done by the W3C WebRTC Working Group are under the following Copyright:
© 2011-2012 W3C® (MIT, ERCIM, Keio), All Rights Reserved. Document use rules apply.
For the entire publication on the W3C site the liability and trademark rules apply.
This document defines a set of ECMAScript APIs in WebIDL to allow media to be sent over the network to another browser or device implementing the appropriate set of real-time protocols, and media to be received from another browser or device. This specification is being developed in conjunction with a protocol specification developed by the IETF RTCWEB group and an API specification to get access to local media devices developed by the Media Capture Task Force.
This section describes the status of this document at the time of its publication. Other documents may supersede this document. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at http://www.w3.org/TR/.
This document is not complete. It is subject to major changes and, while early experimentation is encouraged, it is therefore not intended for implementation. The API is based on preliminary work done in the WHATWG. The Web Real-Time Communications Working Group expects this specification to evolve significantly based on:
This document was published by the Web Real-Time Communications Working Group as an Editor's Draft. If you wish to make comments regarding this document, please send them to public-webrtc@w3.org (subscribe, archives). All feedback is welcome.
Publication as an Editor's Draft does not imply endorsement by the W3C Membership. This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress.
This document was produced by a group operating under the 5 February 2004 W3C Patent Policy. W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy.
This section is non-normative.
There are a number of facets to video-conferencing in HTML covered by this specification:
This document defines the APIs used for these features. This specification is being developed in conjunction with a protocol specification developed by the IETF RTCWEB group and an API specification to get access to local media devices developed by the Media Capture Task Force.
As well as sections marked as non-normative, all authoring guidelines, diagrams, examples, and notes in this specification are non-normative. Everything else in this specification is normative.
The key words must, must not, required, should, should not, recommended, may, and optional in this specification are to be interpreted as described in [RFC2119].
This specification defines conformance criteria that apply to a single product: the user agent that implements the interfaces that it contains.
Implementations that use ECMAScript to implement the APIs defined in this specification must implement them in a manner consistent with the ECMAScript Bindings defined in the Web IDL specification [WEBIDL], as this specification uses that specification and terminology.
The EventHandler
interface represents a callback used for event handlers as defined in
[HTML5].
The concepts queue a task and fires a simple event are defined in [HTML5].
The terms event handlers and event handler event types are defined in [HTML5].
The MediaStream interface, as defined in the
[GETUSERMEDIA] specification, typically represents a stream of data of
audio and/or video. A MediaStream may be extended to
represent a stream that either comes from or is sent to a remote node
(and not just the local camera, for instance). The extensions required to
enable this capability on the MediaStream object will be
described in this document.
A MediaStream as defined in [GETUSERMEDIA] may contain
zero or more MediaStreamTrack objects. A
MediaStreamTrack sent to another peer will appear as one and
only one MediaStreamTrack to the recipient. A peer is
defined as a user agent that supports this specification.
Channels are the smallest unit considered in the
MediaStream specification. Channels are intended to be
encoded together for transmission as, for instance, an RTP payload type.
All of the channels that a codec needs to encode jointly must be in the
same MediaStreamTrack and the codecs should be able to
encode, or discard, all the channels in the track.
The concepts of an input and output to a given
MediaStream apply in the case of MediaStream
objects transmitted over the network as well. A
MediaStream created by a
object (later described in this
document) will take as input the data received from a remote peer.
Similarly, a RTCPeerConnectionMediaStream from a local source, for instance a
camera via [GETUSERMEDIA] will have an output that represents what is
transmitted to a remote peer if the object is used with a
object.RTCPeerConnection
The concept of duplicating MediaStream objects as
described in [GETUSERMEDIA] is also applicable here. This feature can
be used, for instance, in a video-conferencing scenario to display the
local video from the user’s camera and microphone in a local monitor,
while only transmitting the audio to the remote peer (e.g. in response to
the user using a "video mute" feature). Combining tracks from different
MediaStream objects into a new
MediaStream is useful in certain cases.
In this section, we only specify aspects of the
following objects that are relevant when used along with a
. Please refer to the original
definitions of the objects in the [GETUSERMEDIA] document for general
information on using RTCPeerConnectionMediaStream and
MediaStreamTrack.
The label attribute specified in
MediaStream returns a label that is unique to this
stream, so that streams can be recognized after they are sent through
the RTCPeerConnection
API.
When a MediaStream is
created to represent a stream obtained from a remote peer, the
label attribute is initialized from information provided
by the remote source.
The label of a MediaStream object
is unique to the source of the stream, but that does not mean it is
not possible to end up with duplicates. For example, a locally
generated stream could be sent from one user agent to a remote peer
using ,and then sent back to the
original user agent in the same manner, in which case the original
user agent will have multiple streams with the same label (the
locally-generated one and the one received from the remote peer).RTCPeerConnection
A new media track may be associated with an existing
MediaStream . For example, if a remote peer
adds a new MediaStreamTrack object to one of the
track lists of a MediaStream that is being sent
over a , this is observed on
the local user agent. If this happens for the reason exemplified, or
for any other reason than the RTCPeerConnectionadd() [GETUSERMEDIA]
method being invoked locally on a
MediaStreamTrackList or tracks are being
added as the stream is created (i.e. the stream is initialized with
tracks), the user agent must run the following steps:
Create a MediaStreamTrack object
track to represent the new media component.
If track’s kind
attribute equals "audio", add it to the
MediaStream object’s audioTracks
MediaStreamTrackList object.
If track’s kind
attribute equals "video", add it to the
MediaStream object’s videoTracks
MediaStreamTrackList object.
Fire a track event named addtrack with
the newly created track at the
MediaStreamTrackList object.
An existing media track may also be disassociated from a
MediaStream . If this happens for any other
reason than the remove() [GETUSERMEDIA] method being
invoked locally on a MediaStreamTrackList or the
stream is being destroyed, the user agent must run the following
steps:
Let track be the
MediaStreamTrack object representing the
media component about to be removed.
Remove track from the
MediaStreamTrackList object.
Fire a track event named removetrack
with track at the
MediaStreamTrackList object.
The event source for the onended event in the
networked case is the
object.RTCPeerConnection
A MediaStreamTrack object’s reference to its
MediaStream in the non-local media source case (an RTP
source, as is the case for a MediaStream received over a
) is always strong.RTCPeerConnection
When a track belongs to a MediaStream that comes
from a remote peer and the remote peer has permanently stopped sending
data the ended event must be fired on the track, as
specified in [GETUSERMEDIA].
ISSUE: How do you know when it has stopped? This seems like an SDP question, not a media-levelquestion.
A track in a MediaStream , received with a
, must have its
RTCPeerConnectionreadyState attribute [GETUSERMEDIA] set to
MUTED (1) until
media data arrives.
In addition, a MediaStreamTrack has its
readyState set to MUTED on the remote peer if
the local user agent disables the corresponding
MediaStreamTrack in the
MediaStream that is being sent. When the addstream
event triggers on a , all
RTCPeerConnectionMediaStreamTrack objects in the resulting
MediaStream are muted until media data can be read
from the RTP source.
ISSUE: How do you know when it has been disabled? This seems like an SDP question, not a media-levelquestion.
The DTMF API is having a bunch of list discussion and will probably change.
The is a specialization of a
normal AudioMediaStreamTrackMediaStreamTrack that only carries audio
and is extended to have the capability to send and/or receive DTMF
codes.
interface AudioMediaStreamTrack : MediaStreamTrack {
readonly attribute boolean canInsertDTMF;
void insertDTMF (DOMString tones, optional long duration);
};canInsertDTMF of type boolean, readonlyThe canInsertDTMF
attribute must indicate if the
is capable of sending
DTMF.AudioMediaStreamTrack
insertDTMFWhen a object’s AudioMediaStreamTrackinsertDTMF()
method is invoked, the user agent must queue a task that sends the
DTMF tones.
The tone parameters is treated as a series of characters. The characters 0 to 9, A to D, #, and * generated the associated DTMF tones. The characters a to d are equivalent to A to D. The character, indicates a an delay of 2 seconds before processing the next character in the tones parameter. Unrecognized characters are ignored.
The duration parameters indicates the duration in ms to play the each DTMF passed in the tones parameters. The duration can not be more than 6000 or less than 70. The default duration is 100 ms for each tone. The gap between tones must be at least 50 ms but should be as short as possible.
ISSUE: How are invalid values handled?
If insertDTMF is called on the same object while an existing task for this object to generate DTMF is still running, the previous task is canceled. Calling insertDTMF with an empty tones parameter can be used to cancel any tones currently being sent.
Editor Note: We need to add a callback that is set on the object that is called after the tones are sent. This is needed to allow the application to know when it can send new tones without canceling the tones that are currently being sent.
Editor Note: It seems we would want a callback or event for incoming tones. The proposal sent to the list had them played as audio to the speaker but I don’t see how that is useful.
| Parameter | Type | Nullable | Optional | Description |
|---|---|---|---|---|
| tones | DOMString | ✘ | ✘ | |
| duration | long | ✘ | ✔ |
voidA allows two users to communicate
directly, browser to browser. Communications are coordinated via a
signaling channel which is provided by unspecified means, but generally by
a script in the page via the server, e.g. using
RTCPeerConnectionXMLHttpRequest.
Calling new creates a RTCPeerConnection(configuration
) object.RTCPeerConnection
The configuration has the information to find and access the [STUN] and [TURN] servers. There may be multiple servers of each type and any TURN server also acts as a STUN server.
A RTCPeerConnection object has an associated ICE agent, RTCPeerConnection state, and ICE state. These are initialized when the object is created.
When the RTCPeerConnection() constructor is
invoked, the user agent must run the following steps. This algorithm has a
synchronous section (which is triggered as part of the event loop
algorithm).
Create an ICE Agent and let connection’s RTCPeerConnection ICE
Agent be that ICE Agent and provide it the STUN and TURN servers
from the configuration array. The [ICE] will proceed with gathering
as soon as the IceTransports constraint is not set to "none". At this
point the ICE Agent does not know how many ICE components it needs (and
hence the number of candidates to gather) but it can make a reasonable
assumption and as the RTCPeerConnection object gets more information,
it can adjust the number of components.
Set connection’s RTCPeerConnection
readiness state to "new" .
Set connection’s RTCPeerConnection ice
state to "new" .
Let connection’s localStreams
attribute be an empty read-only MediaStream
array.
Let connection’s remoteStreams
attribute be an empty read-only MediaStream
array.
Return connection, but continue these steps asynchronously.
Await a stable state. The synchronous section consists of the remaining steps of this algorithm.
During the lifetime of the RTCPeerConnection object, the following procedures are followed:
If iceState is "new" and the IceTransports constraint is not set to "none", it must queue a task to start gathering ICE address and set the iceState to "gathering".
If the ICE Agent has found one or more candidate pairs for any MediaStreamTrack that forms a valid connection, the ICE state is changed to "connected".
When the ICE Agent finishes checking all candidate pairs, if at least one connection has been found for some MediaStreamTrack, the iceState is changed to "completed" and if no connection has been found for any MediaStreamTrack, the iceState is changed to "failed".
ISSUE: Note that this means that if I was able to negotiate audio but not video via ICE, then iceState == "completed". Is this really what is desired?
If the iceState is "connected" or "completed" and both the local and remote session descriptions are set, the RTCPeerConnection state is set to "active".
If the iceState is "failed", a task is queued to calls the close method.
ISSUE:: CJ - this seems wrong to me.
User agents negotiate the codec resolution, bitrate, and other media
parameters. User agents are recommended to initially negotiate for the
maximum resolution of a video stream. For streams that are then rendered
(using a video element), user agents are recommended to
renegotiate for a resolution that matches the rendered display size.
Starting with the native resolution means that if the Web
application notifies its peer of the native resolution as it starts sending
data, and the peer prepares its video element accordingly,
there will be no need for a renegotiation once the stream is flowing.
The word "components" in this context refers to an RTP media flow and does not have anything to do with how [ICE] uses the term "component".
When a user agent has reached the point where a
MediaStream can be created to represent incoming
components, the user agent must run the following steps:
Let connection be the
expecting this media.RTCPeerConnection
Create a MediaStream object to represent the
media stream.
Run the following steps for each component in the media stream.
Create a MediaStreamTrack object
track to represent the component. [[EDITORIAL: Can we
just reference 3.2.1.2 here?]]
If track's kind
attribute equals "audio", add it to the
MediaStream object's audioTracks
MediaStreamTrackList object.
If track's kind
attribute equals "video", add it to the
MediaStream object's videoTracks
MediaStreamTrackList object.
The creation of new incoming MediaStreams
may be triggered either by SDP negotiation or by the receipt of media
on a given flow.
The internal order in the
MediaStreamTrackList objects on the receiving side
should reflect the order on the sending side. One way to enforce this
is to specify the order in the SDP.
Queue a task to run the following substeps:
If the connection’s RTCPeerConnection
readiness state is CLOSED (3), abort these
steps.
Add the newly created MediaStream object to
the end of connection’s remoteStreams
array.
Fire a stream event named
addstream with the newly
created MediaStream object at the connection object.
When a user agent has negotiated media for a component that belongs to a
media stream that is already represented by an existing
MediaStream object, the user agent must associate the
component with that MediaStream object.
When a finds that a stream from
the remote peer has been removed , the user agent must follow these steps:RTCPeerConnection
Let connection be the
associated with the stream being
removed.RTCPeerConnection
Let stream be the MediaStream object
that represents the media stream being removed, if any. If there isn't
one, then abort these steps.
By definition, stream is now finished.
A task is thus queued to update stream and fire an event.
Queue a task to run the following substeps:
If the connection’s RTCPeerConnection
readiness state is CLOSED (3), abort these
steps.
Remove stream from connection’s
remoteStreams
array.
Fire a stream event named
removestream with
stream at the connection object.
The task source for the tasks listed in this section is the networking task source.
If something in the browser changes that causes the
object to need to initiate a new
session description negotiation, an RTCPeerConnectionnegotiationneeded event is fired at the
object.RTCPeerConnection
In particular, if a object is
consuming a RTCPeerConnectionMediaStream and a
track is added to one of the stream's
MediaStreamTrackList objects, by, e.g., the
add() method
being invoked, the object must fire
the "negotiationneeded" event. Removal of media components must also
trigger "negotiationneeded".RTCPeerConnection
To prevent network sniffing from allowing a fourth party to establish a connection to a peer using the information sent out-of-band to the other peer and thus spoofing the client, the configuration information should always be transmitted using an encrypted connection.
The general operation of the RTCPeerConnection is described in [RTCWEB-JSEP].
The RTCSdpType enum describes the type of a
instance.RTCSessionDescription
enum RTCSdpType {
"offer",
"pranswer",
"answer"
};| Enumeration description | |
|---|---|
offer |
An RTCSdpType of "offer" indicates that a description should be treated as an [SDP] offer. |
pranswer |
An RTCSdpType of "pranswer" indicates that a description should be treated as an [SDP] answer, but not a final answer. A description used as a SDP "pranswer" may be applied as a response to a SDP offer, or an update to a previously sent SDP "pranswer". |
answer |
An RTCSdpType of "answer" indicates that a description should be treated as an [SDP] final answer, and the offer-answer exchange should be considered complete. A description used as a SDP answer may be applied as a response to a SDP offer, or an update to a previously send SDP "pranswer". |
The RTCSessionDescription()
constructor takes an optional dictionary argument,
descriptionInitDict, whose content is used to initialize the
new object. If a dictionary
key is not present in descriptionInitDict, the corresponding
attribute will be initialized to null. If the constructor is run without
the dictionary argument, all attributes will be initialized to null.
This class is a future extensible carrier for the data contained in
it and does not perform any substantive processing.RTCSessionDescription
[Constructor (optional RTCSessionDescriptionInit descriptionInitDict)]
interface RTCSessionDescription {
attribute RTCSdpType? type;
attribute DOMString? sdp;
stringifier DOMString ();
};
dictionary RTCSessionDescriptionInit {
RTCSdpType type;
DOMString sdp;
};sdp of type DOMString, nullabletype of type RTCSdpType, nullableDOMStringObjects implementing the interface must stringify by running the steps below. Let
RTCSessionDescription
type
and sdp
be the elements of attributeList, the list of attributes
to include in the string representation.
Let result be U+0028 LEFT PARENTHESIS U+007B LEFT CURLY BRACKET.
For each attribute in attributeList append, to result, the attribute name, U+003A COLON U+0022 QUOTATION MARK, the attribute value, U+0022 QUOTATION MARK and U+002C COMMA. If the attribute was the last element in attributeList, then remove the last U+002C COMMA.
Append U+007D RIGTH CURLY BRACKET U+0029 RIGHT PARENTHESIS to result and return result.
stringifierRTCSessionDescriptionInit Memberssdp of type DOMStringtype of type RTCSdpTypecallback RTCSessionDescriptionCallback = void (RTCSessionDescription sdp);RTCSessionDescriptionCallback Parameterssdp of type RTCSessionDescriptioncallback RTCVoidCallback = void ();
callback RTCPeerConnectionErrorCallback = void (DOMString errorInformation);RTCPeerConnectionErrorCallback ParameterserrorInformation of type DOMStringISSUE: Should this be an enum?
enum RTCPeerState {
"new",
"opening",
"active",
"closing",
"closed"
};| Enumeration description | |
|---|---|
new | The object was just created, and no networking has yet occurred. |
opening |
The user agent is attempting to establish an connection with the
ICE Agent and waiting for local and remote SDP to be set.
Issue 7 ISSUE: do we need more states between "opening" and "active"? |
active | The ICE Agent has found a connection both the local and remote SDP have been set. It is possible for media to flow. |
closing | The object is terminating
all media and is in the process of closing the connection. |
closed | The connection is closed. |
There is active discussion around changing these states.
enum RTCIceState {
"new",
"gathering",
"waiting",
"checking",
"connected",
"completed",
"failed",
"closed"
};| Enumeration description | |
|---|---|
new | The RTCPeerConnection object was just created, and no networking has yet occurred. |
gathering | The ICE Agent is attempting to gather addresses. |
waiting | The ICE Agent is not gathering any addresses and is waiting for candidates from the other side before it can start checking. |
checking | The ICE Agent is checking candidate pairs but has not yet found a connection. In addition to checking, it may also still be gathering. |
connected | The ICE Agent has found a connection but is still checking other candidate pairs to see if there is a better connection. It may also still be gathering. |
completed | The ICE Agent has finished gathering and checking and found a connection. |
failed | The ICE Agent is finished checking all candidate pairs and failed to find a connection. |
closed | The ICE Agent has shut down and is no longer responding to STUN requests. |
The RTCIceCandidate()
constructor takes an optional dictionary argument,
candidateInitDict, whose content is used to initialize the
new object. If a dictionary
key is not present in candidateInitDict, the corresponding
attribute will be initialized to null. If the constructor is run without
the dictionary argument, all attributes will be initialized to null.
This class is a future extensible carrier for the data contained in
it and does not perform any substantive processing.RTCIceCandidate
[Constructor (optional RTCIceCandidateInit candidateInitDict)]
interface RTCIceCandidate {
attribute DOMString? candidate;
attribute DOMString? sdpMid;
attribute unsigned short? sdpMLineIndex;
stringifier DOMString ();
};
dictionary RTCIceCandidateInit {
DOMString candidate;
DOMString sdpMid;
unsigned short sdpMLineIndex;
};candidate of type DOMString, nullablesdpMLineIndex of type unsigned short, nullablesdpMid of type DOMString, nullableDOMStringRTCIceCandidate
interface must stringify as defined by the
RTCSessionDescription stringifier
algorithm with candidate,
sdpMid,
sdpMLineIndex
as the elements of attributeList.stringifierRTCIceCandidateInit Memberscandidate of type DOMStringsdpMLineIndex of type unsigned shortsdpMid of type DOMStringdictionary RTCIceServer {
DOMString url;
nullable DOMString credential;
};RTCIceServer MembersIn network topologies with multiple layers of NATs, it is desirable to have a STUN servers between every layer of NATs in addition to the TURN servers to minimize the number peer to peer network latency.
An example array of RTCIceServer objects is:
[ { url:"stun:stun.example.net"] } , {
url:"turn:user@turn.example.org", credential:"myPassword"} ]
dictionary RTCIceServer {
RTCIceServer[] iceServers;
};RTCIceServer MembersiceServers of type array of RTCIceServerAn array of containing the STUN and TURN servers provided by the JS that can be used by ICE.
typedef MediaStream[] MediaStreamArray;[Constructor (RTCConfiguration configuration, optional MediaConstraints
constraints)]
interface RTCPeerConnection : EventTarget {
void createOffer (RTCSessionDescriptionCallback successCallback, optional RTCPeerConnectionErrorCallback failureCallback, optional MediaConstraints constraints);
void createAnswer (RTCSessionDescription offer, RTCSessionDescriptionCallback successCallback, optional RTCPeerConnectionErrorCallback? failureCallback = null, optional optional MediaConstraints constraints = null, optional optional boolean createProvisionalAnswer = false);
void setLocalDescription (RTCSessionDescription description, optional RTCVoidCallback successCallback, optional RTCPeerConnectionErrorCallback failureCallback);
readonly attribute RTCSessionDescription localDescription;
void setRemoteDescription (RTCSessionDescription description, optional RTCVoidCallback successCallback, optional RTCPeerConnectionErrorCallback failureCallback);
readonly attribute RTCSessionDescription remoteDescription;
readonly attribute RTCPeerState readyState;
void updateIce (optional RTCConfiguration? configuration = null, optional optional MediaConstraints? constraints = null, optional boolean restart = false);
void addIceCandidate (RTCIceCandidate candidate);
readonly attribute RTCIceState iceState;
readonly attribute MediaStreamArray localStreams;
readonly attribute MediaStreamArray remoteStreams;
DataChannel createDataChannel ([TreatNullAs=EmptyString] DOMString label, optional DataChannelInit dataChannelDict);
attribute EventHandler ondatachannel;
void addStream (MediaStream stream, optional MediaConstraints constraints);
void removeStream (MediaStream stream);
void close ();
attribute EventHandler onnegotationneeded;
attribute EventHandler onicecandidate;
attribute EventHandler onopen;
attribute EventHandler onstatechange;
attribute EventHandler onaddstream;
attribute EventHandler onremovestream;
attribute EventHandler onicechange;
};iceState of type RTCIceState, readonlyThe iceState attribute
must return the state of the RTCPeerConnection ICE
Agent ICE state.
localDescription of type RTCSessionDescription, readonlyThe localDescription
attribute must return the
that was most recently passed to RTCSessionDescriptionsetLocalDescription(),
plus any local candidates that have been generated by the ICE Agent
since then.
A null object will be returned if the local description has not yet been set.
localStreams of type MediaStreamArray, readonlyReturns a live array containing the local streams (those that
were added with addStream() ).
onaddstream of type EventHandleraddstream, must be
fired by all objects implementing the
RTCPeerConnection interface It is called any
time a MediaStream is added by the remote peer.
ondatachannel of type EventHandlerdatachannel , must be
supported by all objects implementing the
RTCPeerConnection interface.onicecandidate of type EventHandleronicecandidate , must be supported
by all objects implementing the RTCPeerConnection
interface. It is called any time there is a new ICE candidate
added to a previous offer or answer.onicechange of type EventHandlericechange, must be
fired by all objects implementing the
RTCPeerConnection interface. It is called any
time the iceState changes.
onnegotationneeded of type EventHandlernegotiationneeded , must
be supported by all objects implementing the
RTCPeerConnection interface.
onopen of type EventHandleropen , must be supported by all
objects implementing the RTCPeerConnection
interface.onremovestream of type EventHandlerremovestream, must be
fired by all objects implementing the
RTCPeerConnection interface. It is called any
time a MediaStream is removed by the remote peer.
onstatechange of type EventHandlerstatechange , must be supported
by all objects implementing the RTCPeerConnection
interface. It is called any time the readyState changes.readyState of type RTCPeerState, readonlyThe readyState attribute must return the
object's RTCPeerConnectionRTCPeerConnection
readiness state.
remoteDescription of type RTCSessionDescription, readonlyThe remoteDescription
attribute must return the
that was most recently passed to RTCSessionDescriptionsetRemoteDescription(),
plus any remote candidates that have been supplied via
addIceCandidate()
since then.
A null object will be returned if the remote description has not yet been set.
remoteStreams of type MediaStreamArray, readonlyReturns a live array containing the streams that the remote streams. (those that were added by the remote side).
This array is updated when addstream and
removestream events
are fired.
addIceCandidateThe addIceCandidate()
method provides a remote candidate to the ICE Agent, which will be
added to the remote description. Connectivity checks will be sent
to the new candidates as long as the "IceTransports" constraint is
not set to "none". This call will result in a change to the state
of the ICE Agent, and may result in a change to media state if it
results in different connectivity being established.
A TBD exception will be thrown if candidate parameter is malformed.
| Parameter | Type | Nullable | Optional | Description |
|---|---|---|---|---|
| candidate | | ✘ | ✘ |
voidaddStreamAdds a new stream to the RTCPeerConnection.
When the addStream() method is invoked, the user agent must
run the following steps:
If the object's
RTCPeerConnectionRTCPeerConnection
readiness state is CLOSED (3), throw
an INVALID_STATE_ERR exception.
If stream is already in the
object's RTCPeerConnectionlocalStreams
object, then abort these steps.
Add stream to the end of the
object's RTCPeerConnectionlocalStreams
object.
Parse the constraints provided by the application and apply them to the MediaStream, if possible. NOTE - need to deal with throwing an exception here.
Fire a negotiationneeded event.
ISSUE: Should this fire if the RTCPeerConnection is in "new"?
| Parameter | Type | Nullable | Optional | Description |
|---|---|---|---|---|
| stream | MediaStream | ✘ | ✘ | |
| constraints | MediaConstraints | ✘ | ✔ |
voidcloseWhen the close() method is invoked, the user agent must run
the following steps:
If the object's
RTCPeerConnectionRTCPeerConnection
readiness state is CLOSED (3), throw
an INVALID_STATE_ERR exception.
Destroy the RTCPeerConnection ICE
Agent, abruptly ending any active ICE processing and any
active streaming, and releasing any relevant resources (e.g.
TURN permissions).
Set the object's RTCPeerConnection
readiness state to CLOSED (3).
voidcreateAnswerThe createAnswer method generates a [SDP] answer with the supported configuration for the session that is compatible with the parameters supplied in offer. Like createOffer, the returned blob contains descriptions of the local MediaStreams attached to this RTCPeerConnection, the codec/RTP/RTCP options negotiated for this session, and any candidates that have been gathered by the ICE Agent. The constraints parameter may be supplied to provide additional control over the generated answer.
As an answer, the generated SDP will contain a specific configuration that, along with the offer, specifies how the media plane should be established. The generation of the SDP must follow the appropriate process for generating an answer or provisional answer.
Session descriptions generated by createAnswer must be immediately usable by setLocalDescription without generating an error if setLocalDescription is called from the successCallback function. Like createOffer, the returned description should reflect the current state of the system. The session descriptions must remain usable by setLocalDescription without causing an error until at least the end of the successCallback function. Calling this method is needed to get the ICE user name fragment and password. Provisional offers, as described in [RTCWEB-JSEP], are created if and only if the createProvisionalOffer flag is true.
The failureCallback will be called if the system cannot generate an appropriate answer given the offer.
A TBD exception is thrown if the constraints parameter is malformed.
| Parameter | Type | Nullable | Optional | Description |
|---|---|---|---|---|
| offer | | ✘ | ✘ | |
| successCallback | | ✘ | ✘ | |
| null | RTCPeerConnectionErrorCallback? failureCallback = | ✘ | ✔ | |
| null | optional MediaConstraints constraints = | ✘ | ✔ | |
| false | optional boolean createProvisionalAnswer = | ✘ | ✔ |
voidcreateDataChannelCreates a new object with the
given label. The DataChannel dictionary can
be used to configure properties of underlying channel such as
data reliability. A corresponding
DataChannelInit object is dispatched at the other
peer if the channel setup was successful.DataChannel
When the createDataChannel()
method is invoked, the user agent must run the following steps.
If the object’s
RTCPeerConnectionRTCPeerConnection
readiness state is CLOSED (3), throw
an INVALID_STATE_ERR exception.
Let channel be a newly created
object.DataChannel
Initialize channel’s label attribute to the
value of the first argument.
Initialize channel’s reliable attribute to
true.
If the second argument is present and it contains a
reliable dictionary
member, then set channel’s reliable attribute to
the dictionary member value.
Return channel and continue these steps in the background.
Create channel’s associated underlying data transport.
| Parameter | Type | Nullable | Optional | Description |
|---|---|---|---|---|
| label | DOMString | ✘ | ✘ | |
| dataChannelDict | | ✘ | ✔ |
DataChannelcreateOfferThe createOffer method generates a blob of SDP that contains a RFC 3264 offer with the supported configurations for the session, including descriptions of the local MediaStreams attached to this RTCPeerConnection, the codec/RTP/RTCP options supported by this implementation, and any candidates that have been gathered by the ICE Agent. The constraints parameter may be supplied to provide additional control over the offer generated.
As an offer, the generated SDP will contain the full set of capabilities supported by the session (as opposed to an answer, which will include only a specific negotiated subset to use); for each SDP line, the generation of the SDP must follow the appropriate process for generating an offer. In the event createOffer is called after the session is established, createOffer will generate an offer that is compatible with the current session, incorporating any changes that have been made to the session since the last complete offer-answer exchange, such as addition or removal of streams. If no changes have been made, the offer will be include the capabilities of the current local description as well as any additional capabilities that could be negotiated in an updated offer.
Session descriptions generated by createOffer must be immediately usable by setLocalDescription without causing an error as long as setLocalDiscription is called within the successCallback function. If a system has limited resources (e.g. a finite number of decoders), createOffer needs to return an offer that reflects the current state of the system, so that setLocalDescription will succeed when it attempts to acquire those resources. The session descriptions must remain usable by setLocalDescription without causing an error until at least end of the successCallback function. Calling this method is needed to get the ICE user name fragment and password.
The failureCallback will be called if the system can not generate an appropriate offer given the state of the RTCPeerConnection.
A TBD exception is thrown if the constraints parameter is malformed.
To Do: Discuss privacy aspects of this from a finger printing point of view - it's probably around as bad as access to a canvas :-)
| Parameter | Type | Nullable | Optional | Description |
|---|---|---|---|---|
| successCallback | | ✘ | ✘ | |
| failureCallback | | ✘ | ✔ | |
| constraints | MediaConstraints | ✘ | ✔ |
voidremoveStreamRemoves the given stream from the localStream array in the RTCPeerConnection and fires negotiationneeded.
When the other peer stops sending a stream in this manner, a
removestream event is
fired at the object.RTCPeerConnection
When the removeStream() method is invoked, the user agent
must run the following steps:
If the object's
RTCPeerConnectionRTCPeerConnection
readiness state is CLOSED (3), throw
an INVALID_STATE_ERR exception.
If stream is not in the
object's RTCPeerConnectionlocalStreams
object, then abort these steps. TODO: Do we need an exception
here?
Remove stream from the
object's RTCPeerConnectionlocalStreams
object.
Fire a negotiationneeded event.
| Parameter | Type | Nullable | Optional | Description |
|---|---|---|---|---|
| stream | MediaStream | ✘ | ✘ |
voidsetLocalDescriptionThe setLocalDescription()
method instructs the to apply
the supplied RTCPeerConnection as the local
description.RTCSessionDescription
This API changes the local media state. In order to successfully
handle scenarios where the application wants to offer to change
from one media format to a different, incompatible format, the
must be able to
simultaneously support use of both the old and new local
descriptions (e.g. support codecs that exist in both descriptions)
until a final answer is received, at which point the
RTCPeerConnection can fully adopt the new local
description, or roll back to the old description if the remote side
denied the change.RTCPeerConnection
ISSUE: how to indicate to roll back?
To Do: specify what parts of the SDP can be changed between the createOffer and setLocalDescription
Changes to the state of media transmission will occur when a
final answer is successfully applied. localDescription
must return the previous description until the new
description is successfully applied.
The failureCallback will be called if the
is a valid description but cannot be
applied at the media layer, e.g., if there are
insufficient resources to apply the SDP. The user agent
must roll back as necessary if the new description was
partially applied when the failure occurred.RTCSessionDescription
A TBD exception is thrown if the SDP content is invalid.
| Parameter | Type | Nullable | Optional | Description |
|---|---|---|---|---|
| description | | ✘ | ✘ | |
| successCallback | | ✘ | ✔ | |
| failureCallback | | ✘ | ✔ |
voidsetRemoteDescriptionThe setRemoteDescription()
method instructs the to apply
the supplied RTCPeerConnection as the
remote offer or answer. This API changes the local media state.RTCSessionDescription
Changes to the state of media transmission will occur when a
final answer is successfully applied.
remoteDescription must return the previous
description until the new description is successfully applied.
The failureCallback will be called if the
is a valid description but
cannot be applied at the media layer, e.g., if there are
insufficient resources to apply the SDP. The user agent must roll
back as necessary if the new description was partially applied when
the failure occurred.RTCSessionDescription
A TBD exception is thrown if the SDP content is invalid.
| Parameter | Type | Nullable | Optional | Description |
|---|---|---|---|---|
| description | | ✘ | ✘ | |
| successCallback | | ✘ | ✔ | |
| failureCallback | | ✘ | ✔ |
voidupdateIceThe updateIce method restarts or updates the ICE Agent process of gathering local candidates and pinging remote candidates. If there is a mandatory constraint called "IceTransports" it will control how the ICE engine can act. This can be used to limit the use to TURN candidates by a callee to avoid leaking location information prior to the call being accepted.
This call may result in a change to the state of the ICE Agent, and may result in a change to media state if it results in connectivity being established.
If the restart parameter is set to true, the ICE state machine discards all candidates it has gathered, allocates new ports for the host candidates, and restarts ICE as if there had been no previous ICE session. Applications can use this to reset all ICE negotiation when something has gone terribly wrong.
A TBD exception will be thrown if constraints parameter is malformed.
| Parameter | Type | Nullable | Optional | Description |
|---|---|---|---|---|
| null | RTCConfiguration? configuration = | ✘ | ✔ | |
| null | optional MediaConstraints? constraints = | ✘ | ✔ | |
| false | boolean restart = | ✘ | ✔ |
voidIANA is requested to register the constraints defined in Constraints Section as specified in [RTCWEB-CONSTRAINTS].
TOOD: Need to change the naming and declaration of these constraints to match the constraints draft once that is a bit further along. The names here now are likely not quite right but they serve as a place holder.
ISSUE: there are multiple ways to add constraints. How are multiple values reconciled?
The following new constraints are defined that can be used with a RTCPeerConnection object:
This is a enum type constraint that can take the values "true" and "false". The default is a non mandatory "true" for a RTCPeerConnection object that has a video stream at the point in time when the constraints are being evaluated and is non mandatory "false" otherwise.
In some cases, a RTCPeerConnection may wish to receive video but it is not going to send any video. The RTCPeerConnection needs to know if it should signal to the remote side if it wishes to receive video or not. This constraint allows an application to indicate its preferences for receiving video when creating an offer.
This is a enum type constraint that can take the values "true" and "false". The default is a non mandatory "true".
In some cases, a RTCPeerConnection may wish to receive audio but it is not going to send any audio. The RTCPeerConnection needs to know if it should signal to the remote side if it wishes to receive audio. This constraints allows an application to indicate its preferences for receiving audio when creating an offer.
This is a enum type constraint that can take the values "true" and "false". The default is a non mandatory "true".
Many codecs and system are capable of detecting "silence" and changing their behavior in this case by doing things such as not transmitting any media. In many cases, such as when dealing with sounds other than spoken voice or emergency calling, it is desirable to be able to turn off this behavior. This constraints allows the application to provide information about if it wishes this type of processing enable or disabled.
This is a enum type constraint that can take the values "none", "relay", and "all". The default is a non mandatory "all".
This constraint indicates which candidates the ICE engine is allowed to use. The value "none" means the ICE engine must not send or receive any packets at this point. The value "relay" indicates the ICE engine must only use media relay candidates such as candidates passing through a TURN server. This can be used to reduce leakage of IP addresses in certain use cases. The value of "all" indicates all values can be used.
TODO items - need to register with IANA.
When two peers decide they are going to set up a connection to each other, they both go through these steps. The STUN/TURN server configuration describes a server they can use to get things like their public IP address or to set up NAT traversal. They also have to send data for the signaling channel to each other using the same out-of-band mechanism they used to establish that they were going to communicate in the first place.
var signalingChannel = createSignalingChannel(); var pc; var configuration = ...; // run start(true) to initiate a call function start(isCaller) { pc = new RTCPeerConnection(configuration); // send any ice candidates to the other peer pc.onicecandidate = function (evt) { signalingChannel.send(JSON.stringify({ "candidate": evt.candidate })); }; // once remote stream arrives, show it in the remote video element pc.onaddstream = function (evt) { remoteView.src = URL.createObjectURL(evt.stream); }; // get the local stream, show it in the local video element and send it navigator.getUserMedia({ "audio": true, "video": true }, function (stream) { selfView.src = URL.createObjectURL(stream); pc.addStream(stream); if (isCaller) pc.createOffer(gotDescription); else pc.createAnswer(pc.remoteDescription, gotDescription); function gotDescription(desc) { pc.setLocalDescription(desc); signalingChannel.send(JSON.stringify({ "sdp": desc })); } }); } signalingChannel.onmessage = function (evt) { if (!pc) start(false); var signal = JSON.parse(evt.data); if (signal.sdp) pc.setRemoteDescription(new RTCSessionDescription(signal.sdp)); else pc.addIceCandidate(new RTCIceCandidate(signal.candidate)); };
This example shows the more comples functionality.
TODOEditor Note: This example flow needs to be discussed on the list and is likely wrong in many ways.
This shows an example of one possible call flow between two browsers. This does not show every callback that gets fired but instead tries to reduce it down to only show the key events and messages.
The following flow show a more complete set of the callbacks and events that happen.
Editor Note: This example flow needs to be discussed on the list and is likely wrong in many ways.
This shows an example of one possible call flow between a centralized conferencing server and a browsers. This does not show every callback that gets fired but instead tries to reduce it down to only show the key events and messages.
The Peer-to-peer Data API lets a web application send and receive generic application data peer-to-peer.
The interface represents a
bi-directional data channel between two peers. A
DataChannel is created via a factory method on a
DataChannel object. The corresponding
RTCPeerConnection object is then dispatched at the other
peer if the channel setup was successful.DataChannel
Each has an associated underlying
data transport that is used to transport actual data to the other
peer. The transport properties of the underlying data transport,
such as reliability mode, are configured by the peer taking the
initiative to create the channel. The other peer cannot change any
transport properties of a offered data channel. The actual wire protocol
between the peers is out of the scope for this specification.DataChannel
A created with DataChannelcreateDataChannel()
must initially be in the connecting state. If the
object’s underlying data transport
is successfully set up, the user agent must announce the DataChannelDataChannel as
open.
When the user agent is to announce
a DataChannel as open, the user agent must queue a
task to run the following steps:
If the associated object’s
RTCPeerConnectionRTCPeerConnection
readiness state is CLOSED (3), abort these
steps.
Let channel be the
object to be announced.DataChannel
Set channel’s readyState attribute to
open.
Fire a simple event named open at channel.
When an underlying data transport has been established, the user agent, of the peer that did not initiate the creation process must queue a task to run the following steps:
If the associated object’s
RTCPeerConnectionRTCPeerConnection
readiness state is CLOSED (3), abort these
steps.
Let configuration be an information bundle with key-value pairs, received from the other peer as a part of the process to establish the underlying data channel.
Let channel be a newly created
object.DataChannel
Initialize channel’s label attribute to value that
corresponds to the "label" key in
configuration.
Initialize channel’s reliable attribute to
true.
If configuration contains a key named
"reliable", set channel’s reliable attribute to the
corresponding value.
Set channel’s readyState attribute to
open.
Fire a datachannel event named datachannel with
channel at the
object.RTCPeerConnection
When the process of tearing down a
object’s underlying data
transport is initiated, the user agent must run the following
steps:DataChannel
If the associated object’s
RTCPeerConnectionRTCPeerConnection
readiness state is CLOSED, abort these
steps.
Let channel be the
object which is about to be closed.DataChannel
If channel’s readyState is closing or closed, then abort these
steps.
Set channel’s readyState attribute to
closing.
Queue a task to run the following steps:
Close channel’s underlying data transport.
Set channel’s readyState attribute to
closed
(3).
Fire a simple event named close at
channel.
interface DataChannel {
readonly attribute DOMString label;
readonly attribute boolean reliable;
readonly attribute DataChannelState readyState;
readonly attribute unsigned long bufferedAmount;
attribute EventHandler onopen;
attribute EventHandler onerror;
attribute EventHandler onclose;
void close ();
attribute EventHandler onmessage;
attribute DOMString binaryType;
void send (DOMString data);
void send (ArrayBuffer data);
void send (Blob data);
};binaryType of type DOMStringFIXME: align behavior with WebSocket API
bufferedAmount of type unsigned long, readonlyFIXME: align behavior with WebSocket API
label of type DOMString, readonlyThe DataChannel.label
attribute represents a label that can be used to distinguish this
object from other
DataChannel objects. The attribute must return
the value to which it was set when the
DataChannel object was created.DataChannel
onclose of type EventHandlerclose , must be supported by all
objects implementing the DataChannel
interface.onerror of type EventHandlererror , must be supported by all
objects implementing the DataChannel
interface.onmessage of type EventHandlermessage , must be supported by
all objects implementing the DataChannel
interface.onopen of type EventHandleropen , must be supported by all
objects implementing the DataChannel
interface.readyState of type DataChannelState, readonlyThe DataChannel.readyState
attribute represents the state of the DataChannel
object. It must return the value to which the user agent last set it
(as defined by the processing model algorithms).
reliable of type boolean, readonlyThe DataChannel.reliable
attribute returns true if the is
reliable, and false otherwise. The attribute must return the value to
which it was set when the DataChannel was
created.DataChannel
closeCloses the . It may be called
regardless if the DataChannel object was created
by this peer or the remote peer.DataChannel
When the close() method is called, the
user agent must initiate the
process of tearing down the DataChannel object’s
underlying data transport.
voidsendFIXME: align behavior with WebSocket API
| Parameter | Type | Nullable | Optional | Description |
|---|---|---|---|---|
| data | DOMString | ✘ | ✘ |
voidsendFIXME: align behavior with WebSocket API
| Parameter | Type | Nullable | Optional | Description |
|---|---|---|---|---|
| data | ArrayBuffer | ✘ | ✘ |
voidsendFIXME: align behavior with WebSocket API
| Parameter | Type | Nullable | Optional | Description |
|---|---|---|---|---|
| data | Blob | ✘ | ✘ |
voiddictionary DataChannelInit {
boolean reliable;
};DataChannelInit Membersreliable of type booleanenum DataChannelState {
"connecting",
"open",
"closing",
"closed"
};| Enumeration description | |
|---|---|
connecting |
The user agent is attempting to establish the underlying data
transport. This is the initial state of a
|
open |
The underlying data transport is established and
communication is possible. This is the initial state of a
|
closing |
The process of closing down the underlying data transport has started. |
closed |
The underlying data transport has been closed or could not be established. |
This simple example shows how to create a DataChannel, register an event listener to handle incoming data, and how to send a message.
var chan = peerConn.createDataChannel("mylabel"); chan.onmessage = function (evt) { // use evt.data }; chan.send("hello"); }
This simple example shows how to register an event listener to handle the case when a remote peer creates a new DataChannel.
peerConn.ondatachannel = function (evt) { var chan = evt.channel; chan.onmessage = function (evt) { // use evt.data }; chan.onclose = function () { // remote side closed the data channel }; };
A Window object has
a strong reference to any
objects created from the constructor whose global object is that
RTCPeerConnectionWindow object.
The onicecandidate event of the RTCPeerConnection uses
the interface.RTCPeerConnectionIceEvent
Firing a
event named
e with an RTCPeerConnectionIceEvent
candidate means that an event with the name e,
which does not bubble (except where otherwise stated) and is not
cancelable (except where otherwise stated), and which uses the
RTCIceCandidateRTCPeerConnectionIceEvent interface with the
candidate attribute set to the new ICE candidate must be
created and dispatched at the given target.
[Constructor(DOMString type, RTCPeerConnectionIceEventInit eventInitDict)]
interface RTCPeerConnectionIceEvent : Event {
readonly attribute RTCIceCandidate candidate;
};
dictionary RTCPeerConnectionIceEventInit : EventInit {
RTCIceCandidate candidate;
};candidate of type RTCIceCandidate, readonlyThe candidate attribute is the
object with the new ICE candidate
that caused the event.RTCIceCandidate
RTCPeerConnectionIceEventInit Memberscandidate of type RTCIceCandidate
The addtrack and
removetrack events
use the interface.MediaStreamTrackEvent
Firing a track event named
e with a MediaStreamTrack
track means that an event with the name e, which
does not bubble (except where otherwise stated) and is not cancelable
(except where otherwise stated), and which uses the
interface with the
MediaStreamTrackEventtrack
attribute set to track, must be created and dispatched at the
given target.
[Constructor(DOMString type, MediaStreamTrackEventInit eventInitDict)]
interface MediaStreamTrackEvent : Event {
readonly attribute MediaStreamTrack track;
};
dictionary MediaStreamTrackEventInit : EventInit {
readonly MediaStreamTrack? track;
};track of type MediaStreamTrack, readonlyThe track attribute
represents the MediaStreamTrack object associated
with the event.
MediaStreamTrackEventInit Memberstrack of type readonly MediaStreamTrack, nullable
The addstream
and removestream events use the
interface.MediaStreamEvent
Firing a
stream event named e with a
MediaStream stream means that an event
with the name e, which does not bubble (except where otherwise
stated) and is not cancelable (except where otherwise stated), and which
uses the interface with the
MediaStreamEventstream attribute
set to stream, must be created and dispatched at the
given target.
[Constructor(DOMString type, MediaStreamEventInit eventInitDict)]
interface MediaStreamEvent : Event {
readonly attribute MediaStream? stream;
};
dictionary MediaStreamEventInit : EventInit {
MediaStream stream;
};stream of type MediaStream, readonly, nullableThe stream attribute
represents the MediaStream object associated with
the event.
MediaStreamEventInit Membersstream of type MediaStream
The datachannel event use the
interface.DataChannelEvent
Firing a datachannel event named
e with a
channel means that an event with the name e, which
does not bubble (except where otherwise stated) and is not cancelable
(except where otherwise stated), and which uses the
DataChannel interface with the DataChannelEventchannel attribute set to
channel, must be created and dispatched at the given
target.
[Constructor(DOMString type, DataChannelEventInit eventInitDict)]
interface DataChannelEvent : Event {
readonly attribute DataChannel channel;
};
dictionary DataChannelEventInit : EventInit {
DataChannel channel;
};channel of type DataChannel, readonlyThe channel attribute
represents the object associated with
the event.DataChannel
DataChannelEventInit Memberschannel of type DataChannel
This section is non-normative.
The following event fires on
objects:DataChannel
| Event name | Interface | Fired when... |
|---|---|---|
open |
Event |
The object’s underlying data
transport has been established (or re-established).
|
MessageEvent |
Event |
A message was successfully received. TODO: Ref where MessageEvent is defined? |
error |
Event |
TODO. |
close |
Event |
The object’s underlying data
transport has was closed.
|
The following events fire on
objects:RTCPeerConnection
| Event name | Interface | Fired when... |
|---|---|---|
connecting |
Event |
TODO |
open |
Event |
TODO |
addstream |
|
A new stream has been added to the remoteStreams
array. |
removestream |
|
A stream has been removed from the remoteStreams
array. |
negotiationneeded |
Event |
The browser wishes to inform the application that session negotiation needs to be done at some point in the near future. |
statechange |
Event |
TODO |
icechange |
Event |
TODO |
icecandidate |
|
TODO |
TBD.
This section will be removed before publication.
The editors wish to thank the Working Group chairs, Harald Alvestrand and Stefan Håkansson, for their support.