kukudy

Prerequisites

Node.js 18
openvpn

Installation

git clone https://github.com/hy-chou/kukudy.git
cd kukudy/
npm install

The .env file

Kukudy uses Twitch API to interact with Twitch. Create a .env file inside the kukudy/ directory with the following content:

CLIENT_ID="q5tfahmk1wwd5f7hi1jef3fmd3s0f4"
CLIENT_SECRET="ctylihpd07ik133uxeue644y3hcrpc"
ACCESS_TOKEN="fe4lgy6mtrah3s45vaesmh0thc0egx"

CLIENT_ID_GQL="kimne78kx3ncx6brgo4mv6wki5h1ko"

The lines above are example credentials, except for the last line. So, let's get your own credentials.

Client ID and Client Secret

Sign up for a Twitch account, log in and go to Twitch Developers' Console. Click on Register Your Application in the Application section, fill out the form and click Create. You should see your application in the Application section. Click on Manage and you should see your Client ID and Client Secret.

Replace the Client ID and Client Secret in the .env file with yours.

Access Token

As specified in Twitch Developers' Document, to get an access token, run the following command.

curl -X POST 'https://id.twitch.tv/oauth2/token' \
     -F 'grant_type=client_credentials' \
     -F 'client_id=<CLIENT_ID goes here>' \
     -F 'client_secret=<CLIENT_SECRET goes here>'

The reply is in the json format.

{
  "access_token": "0vbuo8rvancxeuvon7k975jf66b5sq",
  "expires_in": 4533330,
  "token_type": "bearer"
}

Replace the Access Token in .env file with yours.

Note that an access token expires in about two months, and if you do not get a new one, every request to Twitch API would return an HTTP error code 401. We recommend getting a new one every month. Check out the Cron guide for a possible solution.

Introductory Tutorials

These tutorials, designed for first-time kukudy users, provides descriptions of the fundamental scripts inside kukudy/.

Before we start, go to the kukudy/ directory and do the following steps:

mkdir playground
cd playground

Let's go!

1. updateStreams.js

node ../updateStreams.js [NUMBER_OF_CHANNELS]

NUMBER_OF_CHANNELS is 100 by default.

The updateStreams.js script uses Twitch API's Get Streams to get a list of active streams, which is in descending order by the number of viewers watching the stream.

<details><summary>data format</summary>

The data is stored in two formats inside two directories, ulgs/ and dump/getStreams/.

ULGS/

Inside ulgs/ are .txt files.

Each filename is the time updateStreams.js starts running in UTC.

Each line is a user login, namely the name of a channel.

DUMPS/REQSTREAMS/

Inside dumps/reqStreams/ are .tsv files.

Each filename is the time updateStreams.js starts running in UTC.

Each line has five tab-separated elements:

timestamp - sent time
timestamp - received time
cursor

HTTP headers (json) - all value are strings

{
  "connection": "",
  "content-type": "",
  "access-control-allow-origin": "",
  "ratelimit-limit": "",
  "ratelimit-remaining": "",
  "ratelimit-reset": "",
  "timing-allow-origin": "",
  "date": "",
  "x-served-by": "",
  "x-cache": "",
  "x-cache-hits": "",
  "x-timer": "",
  "vary": "",
  "strict-transport-security": "",
  "transfer-encoding": ""
}

HTTP body (json) - Get Streams API reference

</details>

2. updateInfo.js

node ../updateInfo.js

The updateInfo.js script reads the latest stream list inside ulgs/ to get information of the edge servers distributing the streams in the list.

<details><summary>data format</summary>

The data is stored in two formats inside three directories, info/, dumps/reqPlaybackAccessToken/ and dumps/reqUsherM3U8/.

INFO/

Inside info/ are .tsv files.

Each filename is the time updateInfo.js starts running in UTC.

Each line has three tab-separated elements:

timestamp
user login

info (json) - all value are strings

{
  "NODE": "video-edge-{six hexes}.{IATA airport code}{two digits}",
  "MANIFEST-NODE-TYPE": "weaver_cluster",
  "MANIFEST-NODE": "video-weaver.{IATA airport code}{two digits}",
  "SUPPRESS": "",
  "SERVER-TIME": "",
  "TRANSCODESTACK": "",
  "USER-IP": "{IPv4}",
  "SERVING-ID": "",
  "CLUSTER": "{IATA airport code}{two digits}",
  "ABS": "",
  "VIDEO-SESSION-ID": "",
  "BROADCAST-ID": "",
  "STREAM-TIME": "",
  "B": "",
  "USER-COUNTRY": "{ISO 3166-1 alpha-2 code}",
  "MANIFEST-CLUSTER": "{IATA airport code}{two digits}",
  "ORIGIN": "{IATA airport code}{two digits}",
  "C": "",
  "D": ""
}

DUMPS/REQPLAYBACKACCESSTOKEN/

Inside the dumps/reqPlaybackAccessToken/ directory are .tsv files.

Each filename is the UTC time shortly after updateInfo.js starts running.

Each line has five tab-separated elements:

timestamp - sent time
timestamp - received time
user login

HTTP headers (json) - all value are strings

{
  "connection": "",
  "content-length": "",
  "content-type": "",
  "access-control-allow-origin": "",
  "date": ""
}

HTTP body (json)

{
  "data": {
    "streamPlaybackAccessToken": {
      "value": "",
      "signature": "",
      "__typename": ""
    }
  },
  "extensions": {
    "durationMilliseconds": "",
    "operationName": "",
    "requestID": ""
  }
}

The data.streamPlaybackAccessToken.value object is a JSON string.

{
  "adblock": false,
  "authorization": {
    "forbidden": false,
    "reason": ""
  },
  "blackout_enabled": false,
  "channel": "",
  "channel_id": 999999999,
  "chansub": {
    "restricted_bitrates": [],
    "view_until": 9999999999
  },
  "ci_gb": false,
  "geoblock_reason": "",
  "device_id": null,
  "expires": 9999999999,
  "extended_history_allowed": false,
  "game": "",
  "hide_ads": false,
  "https_required": true,
  "mature": false,
  "partner": false,
  "platform": "web",
  "player_type": "site",
  "private": {
    "allowed_to_view": true
  },
  "privileged": false,
  "role": "",
  "server_ads": true,
  "show_ads": true,
  "subscriber": false,
  "turbo": false,
  "user_id": null,
  "user_ip": "{IPv4}",
  "version": 2
}

DUMPS/REQUSHERM3U8/

Inside the dumps/reqUsherM3U8/ directory are .tsv files.

Each filename is the UTC time shortly after updateInfo.js starts running.

Each line has five tab-separated elements:

timestamp - sent time
timestamp - received time
user login

HTTP headers (json) - all value are strings

{
  "content-type": "",
  "content-length": "",
  "connection": "",
  "vary": "",
  "date": "",
  "x-amzn-trace-id": "",
  "x-cache": "",
  "via": "",
  "x-amz-cf-pop": "",
  "x-amz-cf-id": ""
}

HTTP body (m3u8)

</details>

Intermediate Tutorial

The scripts below cache the list of active streams for 10 minutes.

scripts/book.sh

bash book.sh DIRECTORY CHANNEL_COUNT

book.sh collects at least CHANNEL_COUNT channels and stores the data inside the DIRECTORY under the kukudy/ directory.

<details><summary>example</summary>
To collect 100 channels and store the data inside the kukudy/playground/ directory, run
bash book.sh playground 100
</details>

Advanced Tutorials

For beginners, read the VPN guide below to set up the envoronment first.

updateInfo.js

node ../updateInfo.js TARGET_COUNTRY

The updateInfo.js script checks if TARGET_COUNTRY is the user country detected by Twitch. Exit status is 0 if they are the same, 1 if different, 2 otherwise.

scripts/bookvpn.sh

sudo bash bookvpn.sh DIRECTORY CHANNEL_COUNT CONFIG_ID...

bookvpn.sh connects to the VPN server(s) with CONFIG_ID(s) consecutively, collects at least CHANNEL_COUNT channels and stores the data inside the DIRECTORY under the kukudy/ directory.

<details><summary>example</summary>
To connect to tw168.nordvpn.com and us9487.nordvpn.com consecutively to collect 100 channels and store the data inside kukudy/playground/, run
sudo bash bookvpn.sh playground 100 tw168 us9487
</details>

scripts/pickybookvpn.sh

sudo bash pickybookvpn.sh DIRECTORY CHANNEL_COUNT CONFIG_ID...

pickybookvpn.sh connects to the VPN server(s) with CONFIG_ID(s) consecutively. In each round, it checks if the user country detected by Twitch is the country of the CONFIG_ID. If they do not match, it reconnects to the same CONFIG_ID and checks again. If it fails to be correctly recognized for seven times, the current CONFIG_ID is skipped. It collects at least CHANNEL_COUNT channels and stores the data inside the DIRECTORY under the kukudy/ directory.

scripts/bookvpnbycountry.sh

sudo bash bookvpnbycountry.sh DIRECTORY CHANNEL_COUNT COUNTRY...

Use ISO 3166-1 alpha-2 codes for COUNTRY....

bookvpnbycountry.sh connects to the VPN server(s) in the COUNTRY(ies) consecutively, collects at least CHANNEL_COUNT channels and stores the data inside the DIRECTORY under the kukudy/ directory.

<details><summary>example</summary>
To connect to VPNs in Taiwan and the United States consecutively to collect 100 channels and store the data inside kukudy/playground/, run
sudo bash bookvpnbycountry.sh playground 100 TW US
</details>

Kukudy

Install / Use

README

kukudy

Prerequisites

Installation

The .env file

Client ID and Client Secret

Access Token

Introductory Tutorials

1. updateStreams.js

ULGS/

DUMPS/REQSTREAMS/

2. updateInfo.js

INFO/

DUMPS/REQPLAYBACKACCESSTOKEN/

DUMPS/REQUSHERM3U8/

Intermediate Tutorial

scripts/book.sh

Advanced Tutorials

updateInfo.js

scripts/bookvpn.sh

scripts/pickybookvpn.sh

scripts/bookvpnbycountry.sh

scripts/bookvpnbycity.sh