This is the mail archive of the binutils@sourceware.org mailing list for the binutils project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Fuzzing objdump (PR 17512) and readelf (PR 17531)

From: Alexander Cherepanov <cherepan at mccme dot ru>
To: binutils at sourceware dot org
Cc: oss-security at lists dot openwall dot com
Date: Fri, 07 Nov 2014 07:43:27 +0300
Subject: Fuzzing objdump (PR 17512) and readelf (PR 17531)
Authentication-results: sourceware.org; auth=none

Hi!

I was privately asked how I fuzzed objdump in PR 17512 and I figured itcould be interesting to others too. So here it is.


Short version: I used the most naive way.

Longer version: I started with the most simple approach I could getresults with and improved it only a little bit so far. There was just noneed for improvements -- until recently I was getting more crashes thanI can analyze (i.e. run through valgrind:-). Thanks to the excellentwork of Nick Clifton, crashes are harder to get now. But there is a longway to go.

Hardware resources: one several years old desktop. Previous batches ofcrashes required from minutes to half an hour to get. The last one isthe result of one night.

binutils was built like `./configure && make`. Using address-sanitizerwould probably improve the process.

Commands fuzzed: `objdump -x $file` in PR 17512[1] and `readelf -a$file` in PR 17531. Someone more familiar with binutils could choose acommand involving more parsing and hence improve code coverage.


[1] https://sourceware.org/bugzilla/show_bug.cgi?id=17512
[2] https://sourceware.org/bugzilla/show_bug.cgi?id=17531

Fuzzer: zzuf with more-or-less default options. It was used like this:

  zzuf -s 0:1000000 -c -C 0 -q -T 5 -M 100 -j 4 objdump -x "$file" 2> log

Options:

  -s 0:1000000 -- seeds to try, change as you wish
  -c           -- only fuzz files specified in command line (just in case)
  -C 0         -- don't stop after the first crash
  -q           -- suppress output from objdump
  -T 5         -- limit cputime to not hang on infinite loops
  -M 100       -- limit memory to not eat all of it
  -j 4         -- number of simultaneous jobs

Another option of interest is -r -- ratio of changed bits.

After you get a crash like this:

  zzuf[s=1448,r=0.004]: signal 11 (SIGSEGV)

you can get a fuzzed sample with the following command:

  zzuf -s 1448 -r 0.004 < "$file" > fuzzed-file

In fact, I run zzuf from a script against different samples in batchesof 10K seeds, then run random selection from found crashes undervalgrind and collect unique errors. When crashes are rare it's possibleto run all of them through valgrind. This part of the process is alsoquite naive but it you are fixing crashes as you find them it's notneeded at all.

Samples: I started with clam.exe from ClamAV, then used 'main() { return0; }' compiled in different ways and then switched to samples fromhttps://github.com/radare/radare2-regressions . Looking at code coverageand optimizing the set of samples could improve the process. My uploadedsamples named as (\d+)-(0.004) are clam.exe zzufed with seed $1 andratio $2, samples named as (\d+)-(\d+)-(0.004) are fromradare2-regressions zzufed in the same way. If someone is interested Ican provide a precise list.

License -- short version: AFAICT clam.exe is under GPLv2 and "wasentirely written by hand using HIEW"[1], radare2-regressions is underGPLv3+. Feel free to use in testsuites etc.


[1] http://lurker.clamav.net/message/20071225.173819.0255a1c0.en.html

--
Alexander Cherepanov

Follow-Ups:
- Re: Fuzzing objdump (PR 17512) and readelf (PR 17531)
  - From: Yury Gribov

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]