This dataset contains .pcap files collected during the execution of variant calling on large number of human genomes using a cluster. The GATK4 variant calling pipeline was executed using AVAH  in two testbeds, CloudLab and FABRIC. A 16-node cluster was used on CloudLab, and an 8-node cluster was used on FABRIC. The files were collected by running tcpdump on the network interfaces of the nodes.

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Subscribe now or login.

[1] Manas Das, Khawar Shehzad, Praveen Rao, "A Dataset of Network Traffic Collected During Large-Scale Human Genome Sequence Analysis", IEEE Dataport, 2023. [Online]. Available: http://dx.doi.org/10.21227/y0t5-1w13. Accessed: Jul. 14, 2024.
@data{y0t5-1w13-23,
doi = {10.21227/y0t5-1w13},
url = {http://dx.doi.org/10.21227/y0t5-1w13},
author = {Manas Das; Khawar Shehzad; Praveen Rao },
publisher = {IEEE Dataport},
title = {A Dataset of Network Traffic Collected During Large-Scale Human Genome Sequence Analysis},
year = {2023} }
TY - DATA
T1 - A Dataset of Network Traffic Collected During Large-Scale Human Genome Sequence Analysis
AU - Manas Das; Khawar Shehzad; Praveen Rao
PY - 2023
PB - IEEE Dataport
UR - 10.21227/y0t5-1w13
ER -
Manas Das, Khawar Shehzad, Praveen Rao. (2023). A Dataset of Network Traffic Collected During Large-Scale Human Genome Sequence Analysis. IEEE Dataport. http://dx.doi.org/10.21227/y0t5-1w13
Manas Das, Khawar Shehzad, Praveen Rao, 2023. A Dataset of Network Traffic Collected During Large-Scale Human Genome Sequence Analysis. Available at: http://dx.doi.org/10.21227/y0t5-1w13.
Manas Das, Khawar Shehzad, Praveen Rao. (2023). "A Dataset of Network Traffic Collected During Large-Scale Human Genome Sequence Analysis." Web.
1. Manas Das, Khawar Shehzad, Praveen Rao. A Dataset of Network Traffic Collected During Large-Scale Human Genome Sequence Analysis [Internet]. IEEE Dataport; 2023. Available from : http://dx.doi.org/10.21227/y0t5-1w13
Manas Das, Khawar Shehzad, Praveen Rao. "A Dataset of Network Traffic Collected During Large-Scale Human Genome Sequence Analysis." doi: 10.21227/y0t5-1w13