Aleksander Adamowski - Blog
Time the syscalls:
strace -c
Sizes and utilization of kernel caches, to see which might need raising limits:
slabtop -o -s -c
Steal in atop is bad
atop
Major page fault counters per process:
ps -e -o maj_flt,comm | sort -n | tail