Introduction
This wiki explain the current design and implementation of tracking and monitoring collections. The challenges we have at scale and the proposed design to handle them.
Background & Problem Statement
The sunbird platform supports collection tracking and monitoring. It uses the below APIs to capture the content tracking data, generates progress and score metrics and provide the summary.
Content State Update API - To capture content progress and submit assessment.
Content State Read API - To read the individual content consumption and assessment attempts status.
Enrolment List API - To access all the enrolment metrics of a given user.
The content state update API capture the content progress and assessment data. It generate events for score and overall progress computation by activity-aggregator and assessment-aggregator jobs.
We have a single API (Content State Update) to capture all the tracking information. So, it has a complex logic to identify the given input is for content progress or assessment submission and etc,.
Key Design Problems:
Single API to capture all the tracking data.
Read after write of consumption data and basic summary.
Design
Viewing Service:
Viewing Service collects the “content view updates” and generate events to process and provide summary to the users.
When a user starts viewing a content, a view entry created. There are three stages when a user view the content. They are start, progress and end. Considering these three stages we have 3 API endpoints to capture this information for each stage.
An event will be generated when a content view ends. The summary computation jobs will read these event to process and compute the overall summary of the collection.
The computed summary will be available from API interface to download and view.
Summary Computation Jobs - Flink:
The Flink jobs are used to read and compute the summary of a collection consumption progress when the user view ends. It also computes the score for the current view and best score using all the previous views.
Info |
---|
The event is just a trigger to initiate the computation of the collection progress. The job fetches the raw data from DB to compute the overall progress. When an assessment type content (Ex: QuestionSet) view ends, it expects the ASSESS events data to assessment submit API for score metrics computation.
|
Once the view ends, the progress and score will be updated asynchronously by the flink jobs.
Image ModifiedViewer-Service - Content Consumption Scenarios:
The user can consume a content by searching it in our platform (organically) or via a collection when the user enrolled to a course.
With Viewer-Service, we will support tracking individual content consumption also. Below details explain how the data will be stored for a content consumption in different scenarios.
User | Accessing Content | Table - user_content_consumption |
---|
A | <By enrolling to Collection> | |
A | <By using platform search> | userId: A collectionId: single-digit-addition batchId: single-digit-addition contentId: single-digit-addition
|
Content View Lifecycle:
When the user view the content in context of a collection and batch, for the first time its start, progress update and end triggers are processed. Revisit (2nd - nth view) of the content will be ignored to process and update the DB.
Note |
---|
Shall we enable force ‘view end’ to handle the collection progress update sync issues? |
Handling collection and batch dependencies:
For view start, end and update, courseId and batchId are non-mandatory. This would enable to track the progress for any content which is not part of a course.
This is handled in two ways:
If, collectionId and batchId are part of the request, then, individual content progress and overall collection progress is captured and computed.
Handling Collection Data types in DB:
With normal collection types, the map values gets distributed to multiple sstables with append, which might lead to read latency issues
To the handle the scenario, will consider the frozen collection types, which will helpful in tombstone and avoid multiple sstable reads
V1 vs V2 APIs:
We need to continue supporting the current APIs(v1) before deprecate and delete. So, it requires to work with both the APIs with backward compatibility.
Enhance V1 APIs to read summary from aggregate table.
Enhance the below APIs to read the progress and score metrics from user_activity_agg
table.
Enrolment List API.
Content State Read API.
One time Data migration:
The content status
and score
metrics data should be updated to user_activity_agg
table from user_enrolemnts
and assessment_agg
table for all the existing enrolment records.
API Spec
Expand |
---|
title | POST - /v2/view/start |
---|
|
Request: Code Block |
---|
| {
"request": {
"userId": "{{userId}}",
"collectionId" : "{{collectionId}}",
"batchId": "{{batchId}}",
"contentId": "{{contentId}}"
}
} |
Response: Code Block |
---|
| {
"id": "api.view.start",
"ver": "v2",
"ts": "2021-06-23 05:37:40:575+0000",
"params": {
"resmsgid": null,
"msgid": "5e763bc2-b072-440d-916e-da787881b1b9",
"err": null,
"status": "success",
"errmsg": null
},
"responseCode": "OK",
"result": {
"{{contentId}}": "Progress started"
}
}
|
|
Expand |
---|
title | POST - /v2/view/update |
---|
|
Request: Code Block |
---|
| {
"request": {
"userId": "{{userId}}",
"collectionId" : "{{collectionId}}",
"batchId": "{{batchId}}",
"contentId": "{{contentId}}",
"progress": 34
}
} |
Response: Code Block |
---|
| 200 OK:
{
"id": "api.view.update",
"ver": "v2",
"ts": "2021-06-23 05:37:40:575+0000",
"params": {
"resmsgid": null,
"msgid": "5e763bc2-b072-440d-916e-da787881b1b9",
"err": null,
"status": "success",
"errmsg": null
},
"responseCode": "OK",
"result": {
"{{contentId}}": "SUCCESS"
}
}
4XX or 5XX Error:
{
"id": "api.view.update",
"ver": "v2",
"ts": "2021-06-23 05:37:40:575+0000",
"params": {
"resmsgid": null,
"msgid": "5e763bc2-b072-440d-916e-da787881b1b9",
"err": ERR_Error_Code,
"status": "failed",
"errmsg": ERR_error_msg
},
"responseCode": "BAD_REQUEST"/"SERVER_ERROR",
"result": {
}
} |
|
Expand |
---|
title | POST - /v2/view/assess |
---|
|
Request: Code Block |
---|
| {
"request": {
"userId": "{{userId}}",
"collectionId" : "{{collectionId}}",
"batchId": "{{batchId}}",
"contentId": "{{contentId}}",
"assessments": [{
{{assess_event}} //Mandatory for self-assess contents
}]
}
} |
Response: Code Block |
---|
{
"id": "api.view.assess",
"ver": "v2",
"ts": "2021-06-23 05:37:40:575+0000",
"params": {
"resmsgid": null,
"msgid": "5e763bc2-b072-440d-916e-da787881b1b9",
"err": null,
"status": "success",
"errmsg": null
},
"responseCode": "OK",
"result": {
"{{contentId}}": "SUCCESS"
}
} |
|
Expand |
---|
|
Request: Code Block |
---|
| {
"request": {
"userId": "{{userId}}",
"collectionId" : "{{collectionId}}",
"batchId": "{{batchId}}",
"contentId": "{{contentId}}"
}
} |
Response: Code Block |
---|
| {
"id": "api.view.end",
"ver": "v2",
"ts": "2021-06-23 05:37:40:575+0000",
"params": {
"resmsgid": null,
"msgid": "5e763bc2-b072-440d-916e-da787881b1b9",
"err": null,
"status": "success",
"errmsg": null
},
"responseCode": "OK",
"result": {
"{{contentId}}": "Progress ended"
}
}
|
|
Expand |
---|
title | POST - /v2/view/read |
---|
|
Request: Code Block |
---|
| {
"request": {
"userId": "{{userId}}",
"contentId": ["do_123", "do_1234"],
"collectionId" : "{{collectionId}}", //optional
"batchId": "{{batchId}}" // optional
}
} |
Response: Code Block |
---|
| {
"id": "api.view.read",
"ver": "v2",
"ts": "2021-06-23 05:37:40:575+0000",
"params": {
"resmsgid": null,
"msgid": "5e763bc2-b072-440d-916e-da787881b1b9",
"err": null,
"status": "success",
"errmsg": null
},
"responseCode": "OK",
"result": {
"userId": "{{userId}}",
"collectionId": "{{collectionId}}",
"batchId": "{{batchId}}",
"contents": [{
"identifier": "{contentId}",
"progress": 45,
"score": {{best_score}},
"max_score": {{max_score}}
}]
}
}
|
|
Viewer Summary - All enrolments
Expand |
---|
title | GET - /v2/viewer/summary/list/:userId |
---|
|
Response: Code Block |
---|
| {
"id": "api.viewer.summary.list",
"ver": "v2",
"ts": "2021-06-23 05:59:54:984+0000",
"params": {
"resmsgid": null,
"msgid": "95e4942d-cbe8-477d-aebd-ad8e6de4bfc8",
"err": null,
"status": "success",
"errmsg": null
},
"responseCode": "OK",
"result": {
"summary": [
{
"userId": "{{userId}}",
"collectionId": "{{collectionId}}",
"batchId": "{{batchId}}",
"enrolledDate": 1624275377301,
"active": true,
"contentStatus": {
"{{contentId}}": {{status}}
},
"assessmentStatus": {
"assessmentId": {
"score": {{best_score}},
"max_score": {{max_score}}
}
},
"collection": {
"identifier": "{{collectionId}}",
"name": "{{collectionName}}",
"logo": "{{logo Url}}",
"leafNodesCount": {{leafNodeCount}},
"description": "{{description}}"
},
"issuedCertificates": [{
"name": "{{certName}}",
"id": "certificateId",
"token": "{{certToken}}",
"lastIssuedOn": "{{lastIssuedOn}}"
}],
"completedOn": {{completion_date}},
"progress": {{progress}},
"status": {{status}}
}
]
}
}
|
|
Viewer Summary - Specific enrolment
Expand |
---|
title | POST - /v2/viewer/summary/read |
---|
|
Request: Code Block |
---|
{
"request": {
"userId": "{{userId}}",
"collectionId" : "{{collectionId}}",
"batchId": "{{batchId}}"
}
} |
Response: Code Block |
---|
| {
"id": "api.viewer.summary.read",
"ver": "v2",
"ts": "2021-06-23 05:59:54:984+0000",
"params": {
"resmsgid": null,
"msgid": "95e4942d-cbe8-477d-aebd-ad8e6de4bfc8",
"err": null,
"status": "success",
"errmsg": null
},
"responseCode": "OK",
"result": {
"userId": "{{userId}}",
"collectionId": "{{collectionId}}",
"batchId": "{{batchId}}",
"enrolledDate": 1624275377301,
"active": true,
"contentStatus": {
"{{contentId}}": {{status}}
},
"assessmentStatus": {
"assessmentId": {
"score": {{best_score}},
"max_score": {{max_score}}
}
},
"collection": {
"identifier": "{{collectionId}}",
"name": "{{collectionName}}",
"logo": "{{logo Url}}",
"leafNodesCount": {{leafNodeCount}},
"description": "{{description}}"
},
"issuedCertificates": [{
"name": "{{certName}}",
"id": "certificateId",
"token": "{{certToken}}",
"lastIssuedOn": "{{lastIssuedOn}}"
}],
"completedOn": {{completion_date}},
"progress": {{progress}},
"status": {{status}}
}
}
|
|
Expand |
---|
title | DELETE - /v2/viewer/summary/delete/:userId?all - To Delete all enrolments |
---|
|
Response: Code Block |
---|
| Response:
{
"id": "api.viewer.summary.delete",
"ver": "v2",
"ts": "2021-06-23 05:37:40:575+0000",
"params": {
"resmsgid": null,
"msgid": "5e763bc2-b072-440d-916e-da787881b1b9",
"err": null,
"status": "success",
"errmsg": null
},
"responseCode": "OK",
"result": {}
}
|
|
Expand |
---|
title | DELETE - /v2/viewer/summary/delete/:userId - To Delete specific enrolments |
---|
|
Request: Code Block |
---|
{
"request": {
"userId": "{{userId}}",
"collectionId" : "{{collectionId}}",
"batchId": "{{batchId}}"
}
} |
Response: Code Block |
---|
| Response:
{
"id": "api.viewer.summary.delete",
"ver": "v2",
"ts": "2021-06-23 05:37:40:575+0000",
"params": {
"resmsgid": null,
"msgid": "5e763bc2-b072-440d-916e-da787881b1b9",
"err": null,
"status": "success",
"errmsg": null
},
"responseCode": "OK",
"result": {}
}
|
|
Expand |
---|
title | GET - /v2/viewer/summary/download/:userId?format=csv |
---|
|
Response: Code Block |
---|
{
"id": "api.viewer.summary.download",
"ver": "v2",
"ts": "2021-06-23 05:37:40:575+0000",
"params": {
"resmsgid": null,
"msgid": "5e763bc2-b072-440d-916e-da787881b1b9",
"err": null,
"status": "success",
"errmsg": null
},
"responseCode": "OK",
"result": {
"url": "{{userId}}_viewer_summary.csv"
}
} |
|
Expand |
---|
title | GET - /v2/viewer/summary/download/:userId |
---|
|
Response: Code Block |
---|
{
"id": "api.viewer.summary.download",
"ver": "v2",
"ts": "2021-06-23 05:37:40:575+0000",
"params": {
"resmsgid": null,
"msgid": "5e763bc2-b072-440d-916e-da787881b1b9",
"err": null,
"status": "success",
"errmsg": null
},
"responseCode": "OK",
"result": {
"url": "{{userId}}_viewer_summary.json"
}
} |
|
Conclusion:
<TODO>