Projects

PROTEUS

PROTEUS was designed to address fundamental scientific challenges related to the scalability and responsiveness of analytics capabilities. Completed.

ETI

The High Frequency Appliance Disaggregation Analysis project analysed real world data from the ETI's Home Energy Management System in five homes to gather detailed energy data from water, gas and electricity use. Completed.

Extreme-XP

ExtremeXP aims to provide accurate, precise, fit-for-purpose, and trustworthy data-driven insights via evaluating different complex analytics variants, considering end users preferences and feedback in an automated way. Active.


ExtremeXP Use Case: Spam Classification

Classification report

Figure 1. Example classification report.

Confusion matrix

Figure 2. Example confusion matrix.

Selected Features

  • pkts_mean: A packet is a small segment of a larger message. Data sent over networks is divided into packets which are then recombined by the receiving device.
  • bytes_mean: A byte is the basic unit of information in computer storage and processing.
  • mean_duration: Commonly measured by MTTC (Mean Time To Contain), encompassing the time to detect, acknowledge, and fully contain a security incident.
  • udp_ratio: The proportion of network traffic that utilises the User Datagram Protocol (UDP) compared to the Transmission Control Protocol (TCP).
  • conn_ratio: The ratio of connections established to a system compared to the number of legitimate users or devices that should be accessing it.