Making a Zoom Alternative
In this guide, we are going to make a Zoom-like video conferencing system using React, CPaaS APIs, SDKs and other tools.
The full source code for this project is available on GitHub.
We will use:
-
The CPaaS Video SDK will run in the client's browser. It handles the cameras, the microphones, communication with the CPaaS servers, and with other members in the conference. We will also use this SDK to display the video stream in the browser.
-
We will use the CPaaS REST APIs for Video to provision rooms and access tokens for your conference members from the CPaaS server. CPaaS REST APIs are only available on your server, as they require your CPaaS API tokens to operate which shouldn't be exposed client-side.
-
We will use React library from CPaaS Community to handle the integration between the SDK and React.
We will be using Next.js for convenience and brevity, but you should be able to use any React framework to write the frontend and any server-side framework to write the backend. We will use the React Bootstrap framework to make a neat layout without too much boilerplate.
If you are looking for something far simpler to quickly embed on your existing page, please use the Embeddable Video Room widget instead.
Setting Up the project
Our starting point will be the Next.js boilerplate on which we will install the packages discussed above:
yarn create next-app --typescript
cd <your app name>
yarn add @signalwire-community/react
yarn add bootstrap react-bootstrap react-bootstrap-icons swr axios
Backend
While most of the work with respect to capturing and displaying media in the conference happens client-side, you do still need a server to securely proxy the CPaaS REST API. The client SDK needs a token be able to access the CPaaS servers hosting the conference. Your server can query for this token using CPaaS REST API, given that you have the API credentials.
Note that this is not the server where all the video streaming and processing happens. All those complex tasks will be handled by powerful CPaaS servers elsewhere. The figure below illustrates how all parts fit.
Diagram of the interaction between the client, your server, and CPaaS.
In a production setting, your server should authenticate your users, manage their permissions, get appropriate tokens for members and relay the tokens from the CPaaS's Video REST APIs to the client's browser.
The following code will create a new endpoint at /api/token
, which will query CPaaS and serve tokens given at least a valid
room_name
. It also takes additional user_name
and mod
parameters. The user_name
parameter simply sets the display name
for the user requesting the token. The mod
parameter (short for "moderator" in this case) selects between the two sets of
permissions defined in permissions.ts
which can be assigned to the user.
Note that the location of this file ensures that this will run server-side at api/token
endpoint.
Learn more about Next.js routing here.
- token.ts
- permissions.ts
import axios from "axios";
import { FULL_PERMISSIONS, GUEST_PERMISSIONS } from "../../data/permissions";
const AUTH = {
username: process.env.PROJECT_ID as string,
password: process.env.API_TOKEN as string,
};
const SPACE_NAME = process.env.SPACE_NAME as string;
export default async function handler(req: any, res: any) {
const { room_name, user_name, mod } = req.query;
if (room_name === undefined) return res.status(422).json({ error: true });
try {
const tokenResponse = await axios.post(
`https://${SPACE_NAME}.signalwire.com/api/video/room_tokens`,
{
room_name,
user_name,
enable_room_previews: true,
permissions: mod === "true" ? FULL_PERMISSIONS : GUEST_PERMISSIONS,
},
{ auth: AUTH } // pass {username: project_id, password: api_token} as basic auth
);
const token = tokenResponse.data.token;
if (token !== undefined) res.json({ token, error: false });
else res.status(400).json({ error: true });
} catch (e) {
res.status(400).json({ error: true });
}
}
export const FULL_PERMISSIONS = [
"room.hide_video_muted",
"room.show_video_muted",
"room.list_available_layouts",
"room.playback",
"room.recording",
"room.set_layout",
"room.set_position",
//Members
"room.member.audio_mute",
"room.member.audio_unmute",
"room.member.deaf",
"room.member.undeaf",
"room.member.remove",
"room.member.set_input_sensitivity",
"room.member.set_input_volume",
"room.member.set_output_volume",
"room.member.video_mute",
"room.member.video_unmute",
"room.member.set_position",
//Self
"room.self.additional_source",
"room.self.audio_mute",
"room.self.audio_unmute",
"room.self.deaf",
"room.self.undeaf",
"room.self.screenshare",
"room.self.set_input_sensitivity",
"room.self.set_input_volume",
"room.self.set_output_volume",
"room.self.video_mute",
"room.self.video_unmute",
"room.self.set_position",
];
export const GUEST_PERMISSIONS = [
//Members
"room.member.remove",
//Self
"room.self.additional_source",
"room.self.audio_mute",
"room.self.audio_unmute",
"room.self.deaf",
"room.self.undeaf",
"room.self.screenshare",
"room.self.set_input_sensitivity",
"room.self.set_input_volume",
"room.self.set_output_volume",
"room.self.video_mute",
"room.self.video_unmute",
"room.self.set_position",
];
In a production setting, you would want this endpoint to be behind an authentication middleware to make sure only your intended users can use it. For Next.js, an easy addition would be next-auth.
You might also want to check if the users requesting mod permissions have the authorization to actually do so in your system.
To quickly go over various parts of this code:
-
The constants
FULL_PERMISSIONS
andGUEST_PERMISSIONS
are arrays of strings representing the permissions given to the user. So whileFULL_PERMISSIONS
might look like[..., 'room.member.video.mute', 'room.member.remove', ...]
,GUEST_PERMISSIONS
would look like[..., 'room.self.video.mute']
, indicating that guest is not allowed to mute or remove any other user.CPaaS offers a flexible permission system so you can give users all combination of permissions as required. Permissions are described here.
-
The constant
AUTH
is a structure that assigns your CPaaS Project ID as the username, and the API token as password. You will find the Project ID and API token at your CPaaS Dashboard (explained here). We will use this for basic auth to authenticate with the CPaaS REST API.The constant
SPACE_NAME
is your CPaaS username which you also use as the subdomain to access your Dashboard. -
We perform an HTTP POST request using Axios to the room_tokens endpoint. We will send the name of the room, the name of the user, and the array of permissions for the user to this endpoint. We will also give axios the Project ID and the API token to be encoded as basic authentication header.
If all goes well, the CPaaS server will send us a token that we can forward to the client.
Testing the `/api/token` endpoint with Thunder Client
This simple backend will suffice to be able to conduct video conferences. But we will have one more endpoint to add here to support room previews.
Frontend
We will rely heavily on the CPaaS Community React library (@signalwire-community/react) to write the frontend.
Basic Video Feed
Consider the following piece of code.
// other imports
import { Video } from "@signalwire-community/react";
export default function Room() {
const router = useRouter();
const { roomName, userName, mod } = router.query;
const [roomSession, setRoomSession] = useState<any>();
const { data: token } = useSWRImmutable(
roomName !== undefined
? `/api/token?room_name=${roomName}&user_name=${userName}&mod=${mod}`
: null
);
if (!router.isReady) {
return <div>Loading</div>;
}
if (roomName === undefined || roomName === "undefined") return <div>Error</div>;
return (
<Container>
{token?.token && (
<Video
token={token.token}
onRoomReady={(r) => {
setRoomSession(r);
}}
onRoomLeft={() => router.push("/")}
/>
)}
</Container>
);
}
A few things to note about this code are:
-
Next.js router places it at
/rooms/[roomName]
whereroomName
can be any URL-safe string. So/rooms/guest
should take you to the guest room automatically. The dynamicroomName
parameter is accessible atuseRouter().query.roomName
. TheuserName
andmod
parameters should come from the URL query string (/rooms/guest?userName=user&mod=false
) -
We are using the immutable variant of the swr library to load the token. The React hook
useSWRImmutable
sends a GET request to/api/token
just once after it is instantiated. we made/api/token
in the previous section. We are using swr for convenience here, but you are free to use any way toHTTP GET /api/token
. -
The
<Video />
component from@signalwire-community/react
is supplied the token from the backend. It uses the token to connect to the video feed for the room, and it asks for permission to access camera and microphone from the user. With this component alone, you should be able to video conference in the room by just navigating tolocalhost:3000/rooms/guest
in multiple tabs.