Accession Number : AD1046497


Title :   Entropy based file type identification and partitioning


Descriptive Note : Technical Report


Corporate Author : Naval Postgraduate School Monterey United States


Personal Author(s) : Paul,Calvin B


Full Text : https://apps.dtic.mil/dtic/tr/fulltext/u2/1046497.pdf


Report Date : 01 Jun 2017


Pagination or Media Count : 107


Abstract : The need for file identification and partitioning in the digital forensic, reverse engineering, and security analyst fields cannot be overstated. In this research, we investigate the use of the Shannon entropy profile derived from the file expressed in byte format to characterize specific file types and identify file segments based on entropy-level changes. The process consists of two stages. In the first stage, a binary representation of the file is partitioned into chunks of fixed-length data bytes and processed to extract the entropy profile. In the second stage, the detrended fluctuation analysis (DFA) method is applied to determine the level of structure in the entropy profile. The Haar continuous wavelet transform (CWT) is then used to partition the files identified as highly structured into areas of distinct changes in entropy level. Experimental results show that the proposed approach is effective in identifying file types and partitioning in segments of different entropy levels.


Descriptors :   entropy , WAVELET TRANSFORMS , identification , STATISTICAL ANALYSIS , DATA ENCRYPTION , FOURIER TRANSFORMATION , cYBERATTACKS , computer security


Subject Categories : Computer Systems Management and Standards


Distribution Statement : APPROVED FOR PUBLIC RELEASE