Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Introduction:

  • This document describes the enhanced/derived Device and Extended User Profile schemas, design of how the profile data is captured and how it can be used across various systems.

Design Flow:

Image RemovedImage Added

Schema Design for

...

User and Device Profile

Code Block
languagesql
themeRDark
borderStylesolid
linenumberstrue
#Schema of User Profile

TABLE user_db.user_profile (
   user_id text,
   user_role text,
   user_type text,
   sign_in_type text,
   user_state text,
   user_district text,
   user_block text,
   first_access timestamp,
   last_access timestamp,
   device_ids list<text>,
   last_updated_by text,
   last_content_played text,
   products_list list<text>,
   last_device_accessed text,
   total_ts bigint,
   PRIMARY KEY (userid)
);
Code Block
languagesql
themeRDark
borderStylesolid
linenumberstrue
collapsetrue
#Schema of Device Profile

TABLE device_db.device_profile (
    device_id text,
    avg_ts double,
    city text,
    country text,
    country_code text,
    device_spec map<text, text>,
    district_custom text,
    first_access timestamp,
    last_access timestamp,
    state text,
    state_code text,
    state_code_custom text,
    state_custom text,
    total_launches bigint,
    total_ts double,
    uaspec map<text, text>,
    updated_date timestamp,
    last_updated_by text,
    products_list list<text>,
    user_count bigint,
    producer_id text,
    producer_ver text,
    fcm_token text,
   PRIMARY KEY (device_id)
);

Computation Table:

...

Field

...

Type

...

Computation Table:

Device Profile -


Field

Type

Description

Source

Release
1device_idString

Device

Unique id of device

From first source which inserts a record

1.11
2

total_ts

Double

Device

Total Time spent on device till date

Device profile updater data product

1.11
3

total_launches

Long

Device

Number app/portal launches from the device

Device profile updater data product

1.11
4

avg_ts

Double

Device

Average time spent on device

Device profile updater data product

1.11
5

first_access

Timestamp

Device

Timestamp on which device was first accessed

Device profile updater data product

1.11
6

last_access

Timestamp

Device

Timestamp on which device was last accessed

Device profile updater data product

1.11
7

state

String

Device

State from which device is accessed

Device Register API

1.12
8

state_code

String

Device

Device

state code

Device Register API

1.14
9

state_code_custom

String

Device

Device

custom state code

Device Register API

1.14
10

state_custom

String

Device

Device

custom state name

Device Register API

1.14
11

city

String

Device

City from which device is accessed

Device Register API

1.12
12

country

String

Device

Country from which device is accessed

Device Register API

1.14
13

country_code

String

Device

Device

country code

Device Register API

1.14
14

district_custom

String

Device

Device custom district name

Device Register API

1.14
15

device_spec

Map<String,String>

Device

Device Specification

Device Register API

1.12
16

uaspec

Map<String,String>

Device

Device user agent data

Device Register API

1.12
17

fcm_token

String

Device

FCM token for app devices

Device Register API

2.1.0
18

producer_id

String

Device

Producer id for which device belongs to(App/Portal)

Device Register API

2.1.0
19location_updated_dateTimestamplast location details updated dateDevice Register APITBD
20

producer_ver

String

Device

Version of producer

Device profile updater data product

TBD
21

user_ids

List<String>

User

List of unique users accessed from the device

Device profile updater data product

TBD
22

user_count

Long

Device

Unique users accessed from the device

Device profile updater data product

TBD
23

last_updated_by

String

Device

Source from which record was last updated

Source which is updating the record

products_list

TBD
24

sources

List<String>

Device

List of sources from which record gets updated

From all the sources

TBD
25

updated_date

Timestamp

Last updated date by the source

From all sources1.12


User Profile -


FieldTypeDescriptionSourceRelease
1user_idString

User

Unique id
for user
of the userFrom first source which inserts a record2.2.0
2

user_type

String

User

Type of user(Anonymous/SelfSignedIn/ValidatedUser)

User_Org Cassandra via Samza job

2.2.0
3

user_role

String

User

Role of user(Student/Teacher)

AUDIT events via Samza job

2.2.0
4

sign_in_type

String

User

Type of sign-in(GoogleAuth/

SingleSignIn)

Samza job

user

SingleSignon/Login)

AUDIT events via Samza job

2.2.0
5boardList<String>Board the user has selectedINTERACT event via Samza job2.3.0
6subjectList<String>Subject the user has selectedINTERACT event via Samza job2.3.0
7mediumList<String>Medium the user has selectedINTERACT event via Samza job2.3.0
8gradeList<String>Grade the user has selectedINTERACT event via Samza job

2.3.0

9languageStringUser preferred languageINTERACT event via Samza job2.3.0
10profile_stateString
User

State name of the User

user

User_Org Cassandra via Samza job

TBD
11profile_districtString

User

District name of the UserUser
user
_Org Cassandra via Samza jobTBD
12profile_blockString

User

Block name of the UserUser_Org Cassandra via Samza jobTBD
13

first_access

Timestamp

User

Timestamp on which user was first seen

User Profile Updater data product

TBD
14

last_access

Timestamp

User

Timestamp on which user was last seen

User Profile Updater data product

TBD
15

last_device_accessed

String

User

Device id on which user was last seen

User Profile Updater data product

TBD
16

device_ids

List<String>

User

List

Map<String,String>

list of unique devices

on which user was seen

with device_id and producer_id

User Profile Updater data product

TBD
17

last_updated_by

String

User

Source from which record was last updated

Source which is updating the record

products_list

TBD
18

sources

List<String>

User

List of sources from which record gets updated

From all the sources

TBD
19

last_content_played

String

User

Last content played by user

User Profile Updater data product

TBD
20

total_ts

Double

User

Total

overall time spent till date by user

User Profile Updater data product

TBD
21updated_dateTimestampLast updated date by the sourceFrom all sources2.2.0
22qr_scansIntegerTotal scans done by the uservia data product?
23...




Questions:

  • Possibility of switch in user_role for APP users, Is it ok to have multiple values for user_role in the telemetry for same user_id?
  • Can we enhance the existing user table or create new one in analytics database.