Captured network traces¶
The followed are lists of proxy traffic traces and unobfuscated (regular) traffic traces captured for testing and evaluation of CovertMark during the project. All traffic were captured under realistic internet browsing conditions on the modern web, which were carried out by real humans. This marks the greatest distinction between the CovertMark datasets and datasets used in prior protocol-obfuscation-detection researches.
Sources and destinations of packets have been scrambled with Crypto-PAn, which is primarily to protect the exact addresses of Tor PT bridges in proxy traces. Regular traffic captured as controls and negative training data were produced by human volunteers under experimental conditions with detailed instructions to protect their privacy. All unencrypted and non-TCP traffic have also been stripped out. However, in contrast with CAIDA traces, all encrypted payloads have been preserved, along with some cleartext metadata such as TLS SNI hostnames. This allows CovertMark to examine encrypted network traffic from the exact perspective of state censors performing advanced DPI detection of proxy servers.
Click on the file names below to download associated traces.
All proxy traces are unaffected by TCP segmentation offload (TSO), with longer-than-MTU payloads fully segmented.
|File Name||Packets||IP(s) of Proxy Clients||IP(s) of Proxy Servers||Port(s) of Proxy Servers|
|shadowsocks1_anon||674458||22.214.171.124, 126.96.36.199||188.8.131.52, 184.108.40.206||443, 995|
|meek1_anon||756548||220.127.116.11, 18.104.22.168||22.214.171.124, 126.96.36.199||443|
|obfs4-1_anon||619754||188.8.131.52 , 184.108.40.206||220.127.116.11, 18.104.22.168||443, 38224|
Negative / Regular Traffic Traces¶
The longer, multi-client
cantab_anon.pcap was captured on machines
with TCP segmentation offloading due to technical limitations, which
means that longer-than-MTU payloads will appear unsegmented. This does
not however affect CovertMark’s default detection strategies. The
lso_anon.pcap is unaffected with all longer-than-MTU
payloads properly segmented.
|File Name||Packets||Concurrent Users||Client IP’s/Subnets|