Conveners
High Performance Computing in Science
- Florin Bogdan MANOLACHE (Carnegie Mellon University)
In modern industrial environments, understanding how data flows through complex ETL pipelines is critical for traceability, auditing, and compliance. While traditional lineage tracking tools rely on static metadata or log-based introspection, they often lack semantic expressiveness and offer limited support for automated validation.
This work presents a semantic-aware framework for ETL data...
Large datasets can rarely be presented or used in real time without significantly reducing their size. This paper discusses models of trimming timestamped event datasets while keeping the loss of information to a minimum. The presentation goes gradually from independent event models,
where trimming of events does not change the order of the information contribution of the other events, to...
The structure and usage scenarios of a software package for trimming datasets while having minimum information loss are described. Several information models applied to a large dataset generated by an enterprise information system are analyzed. Different strategies and procedures are compared to obtain the best compromise between computing time and information retention. A set of data...
This paper presents a streamlined and effective methodology for integrating generative artificial intelligence (AI) chatbots into educational and research activities focused on computer networks. The proposed approach leverages the capabilities of generative AI to assist in each phase of a typical network analysis workflow: selecting appropriate software tools, generating and capturing network...
The continuous growth of the space industry and the increasing demand for satellite data across various sectors highlight the need for accessible and user-friendly data integration platforms. However, despite the availability of large volumes of open satellite data, significant barriers remain in making this data accessible to the general public, educators, and non-expert users. This research...