Network Data Source | - |
Network Data Labeled | - |
Host Data Source | Events generated from usage of storage solution “hBox” |
Host Data Labeled | No, data generated in production -> no known attacks |
Overall Setting | Enterprise IT |
OS Types | n/a (focus on subsystem “hBox”) |
Number of Machines | n/a (5000 distinct users) |
Total Runtime | 1910 days |
Year of Collection | 2017-2022 |
Attack Categories | n/a |
Benign Activity | Real users |
Packed Size | 640 MB |
Unpacked Size | 14,9 GB |
Download Link | goto |
Overview
CLUE-LDS (CLoud-based User Entity behavior analytics Log Data Set - that acronym really is a stretch but okay)is a dataset generated by real user activity at the premise of an Austrian IT service provider over the span of five years. It is intended to be used for User and Entity Behavior Analytics (UEBA), which tries to detect cases where an attacker accesses a system through legitimate means, for example by using leaked credentials. Since this doesn’t ring any bells for systems designed to detect malware, exploits, etc., the goal is to recognize this “illegal” access by determining if a user suddenly changed its behavior, which is monitored in form of events originating from that user.
Environment
The environment is that of a live production network of an Austrian IT provider, hosting the storage solution “hBox”.
Activity
Activity consists of actual users accessing and using a cloud storage (hBox) over the span of roughly five years, leading to ~50 million events created by more than 5000 distinct users.
Contained Data
The dataset has been anonymized, though the authors state that his has been done in such a way that no information was lost (see section 3.B in paper). Each event contains a user, the associated action and a couple other relevant fields such as location or paths. More details regarding those fields are linked below. Since this was collected in a productive environment, there are no known attacks contained in this dataset.
Papers
Links
Data Examples
Event logs from clue.json
[...]
{
"params": {
"user": "marvellous-amaranth-fowl-accountant"
},
"type": "login_successful",
"time": "2017-07-07T09:05:57Z",
"uid": "marvellous-amaranth-fowl-accountant",
"id": 58,
"uidType": "name"
}
{
"params": {
"path": "/chinese-teal-meerkat-garagemanager/chosen-blue-marmoset-metallurgist/little-scarlet-warbler-reflexologist"
},
"type": "file_accessed",
"time": "2017-07-07T09:06:00Z",
"uid": "marvellous-amaranth-fowl-accountant",
"id": 59,
"uidType": "name"
}
{
"params": {
"path": "/chinese-teal-meerkat-garagemanager/chosen-blue-marmoset-metallurgist/thoughtful-blush-meerkat-producesupervisor"
},
"type": "file_accessed",
"time": "2017-07-07T09:06:00Z",
"uid": "marvellous-amaranth-fowl-accountant",
"id": 60,
"uidType": "name"
}
[...]