Librem 13v2 PureOS kernel runs out of memory with make -j
Open, Needs TriagePublic

Description

I tried to build qemu on an up-to-date PureOS system (last dist-upgrade issued on Feb 18) with 'make -j' which should chose the right number of cores to maximize the build but the computer completely freezes with no access to interrupts within one minute.

$ uname -a
Linux fenrir 4.14.0-3-amd64 #1 SMP Debian 4.14.13-1 (2018-01-14) x86_64 GNU/Linux

In syslog, we see the issue starts with Out of Memory:

Feb 20 18:19:31 fenrir kernel: [ 739.742057] Out of memory: Kill process 2316 (evolution-calen) score 2 or sacrifice child
Feb 20 18:19:31 fenrir kernel: [ 739.742082] Killed process 2316 (evolution-calen) total-vm:906756kB, anon-rss:0kB, file-rss:0kB, shmem-rss:0kB
Feb 20 18:19:31 fenrir kernel: [ 745.885385] warn_alloc: 543 callbacks suppressed
Feb 20 18:19:31 fenrir kernel: [ 745.885393] cc1: page allocation stalls for 34244ms, order:0, mode:0x14201ca(GFP_HIGHUSER_MOVABLE|GFP_COLD), nodemask=(null)
Feb 20 18:19:31 fenrir kernel: [ 745.885398] cc1 cpuset=/ mems_allowed=0
Feb 20 18:19:31 fenrir kernel: [ 745.885402] CPU: 1 PID: 17428 Comm: cc1 Tainted: G W 4.14.0-3-amd64 #1 Debian 4.14.13-1
Feb 20 18:19:31 fenrir kernel: [ 745.885403] Hardware name: Purism Librem 13 v2/Librem 13 v2, BIOS 4.6-a86d1b-Purism-5 07/27/2017
Feb 20 18:19:31 fenrir kernel: [ 745.885403] Call Trace:
Feb 20 18:19:31 fenrir kernel: [ 745.885410] dump_stack+0x5c/0x85
Feb 20 18:19:31 fenrir kernel: [ 745.885413] warn_alloc+0x114/0x1b0
Feb 20 18:19:31 fenrir kernel: [ 745.885415]
alloc_pages_slowpath+0x85d/0xe00
Feb 20 18:19:31 fenrir kernel: [ 745.885418] alloc_pages_nodemask+0x233/0x250
Feb 20 18:19:31 fenrir kernel: [ 745.885421] filemap_fault+0x1fa/0x5f0
Feb 20 18:19:31 fenrir kernel: [ 745.885423] ? filemap_map_pages+0x31f/0x340
Feb 20 18:19:31 fenrir kernel: [ 745.885438] ext4_filemap_fault+0x2c/0x40 [ext4]
Feb 20 18:19:31 fenrir kernel: [ 745.885441]
do_fault+0x19/0xb0
Feb 20 18:19:31 fenrir kernel: [ 745.885443] handle_mm_fault+0xc2f/0x10f0
Feb 20 18:19:31 fenrir kernel: [ 745.885445] handle_mm_fault+0xaa/0x1f0
Feb 20 18:19:31 fenrir kernel: [ 745.885447]
do_page_fault+0x25d/0x4e0
Feb 20 18:19:31 fenrir kernel: [ 745.885450] ? page_fault+0x36/0x60
Feb 20 18:19:31 fenrir kernel: [ 745.885451] page_fault+0x4c/0x60
Feb 20 18:19:31 fenrir kernel: [ 745.885453] RIP: 0033:0x60dd60
Feb 20 18:19:31 fenrir kernel: [ 745.885454] RSP: 002b:00007ffe840817c8 EFLAGS: 00010297

There are a lot of call traces that repeat with the final two lines (RIP and RSP) being slightly different. I have the complete syslog failure available upon request (just too large to paste here).

To reproduce this issue:

  1. checkout a large project:

$ git clone git://git.qemu.org/qemu.git

  1. try to build it

$ make -j

  1. see the system freeze
This comment was removed by heather.ellsworth.

Adding the full syslog error containing all of the call traces{F44013}

Add Comment