This benchmark is from "malloc() Performance in a Multithreaded Linux Environment": http://www.citi.umich.edu/techreports/reports/citi-tr-00-5.pdf Try the following parameters, where P = 1 and then the number of processors on your system: ./linux-scalability 512 10000000 P ./linux-scalability-hoard 512 10000000 P