ConceptKT

This dataset is an extension of MathEDU, tailored for concept-level knowledge tracing tasks. It is designed to predict which specific concepts a student is likely to struggle with in future questions.

Dataset

The dataset includes the concept annotations associated with each question, as well as the concepts that the student lacked or failed to master on that question.

Data Structure

Each entry in this dataset includes the following fields:

id: Unique identifier for the problem. Corresponds to the index in the combined MathQA dataset (train + validation + test).
student_id: The ID of the student who answered the problem.
associated_concepts: A list of concepts associated with the problem. These represent the knowledge components required to solve the question.
missing_concepts: A list of concepts that the student failed to demonstrate understanding of when answering the problem. These reflect potential learning gaps.

Mapping to MathEDU

Each record can be linked back to the corresponding student response in the MathEDU dataset using the id and student_id fields.

Mapping to MathQA

To access the full content of each question, use the id to match against the combined and indexed MathQA. Here’s how to create the indexed version:

import json
from datasets import load_dataset, concatenate_datasets

dataset = load_dataset("json", data_files={
    "train": "MathQA/train.json",
    "validation": "MathQA/dev.json",
    "test": "MathQA/test.json"
})
mathqa = concatenate_datasets([dataset['train'], dataset['validation'], dataset['test']])
mathqa = mathqa.add_column("index", list(range(len(mathqa))))
mathqa.to_json("mathqa_index.json", orient="records", lines=True)

Example

Here is an example of an entry in the dataset:

{
    "id": 17467,
    "student_id": 6,
    "associated_concepts": [
      "Volume Calculation",
      "Area Calculation",
      "Unit Conversion"
    ],
    "missing_concepts": [
      "Volume Calculation",
      "Area Calculation"
    ]
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
ConceptKT		ConceptKT
ConceptKT.json		ConceptKT.json
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ConceptKT

Dataset

Data Structure

Mapping to MathEDU

Mapping to MathQA

Example

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

ConceptKT

Dataset

Data Structure

Mapping to MathEDU

Mapping to MathQA

Example

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages