Koozali.org: home of the SME Server

kernel: Eeek! page_mapcount(page) went negative!

Offline dadou

  • ***
  • 47
  • +0/-0
kernel: Eeek! page_mapcount(page) went negative!
« on: April 22, 2012, 08:07:43 PM »
Hello,

Last night I got a bad crash on our system.

After log-rotate in crontab it just crashed :


Quote
Apr 22 01:12:01 fmcp esmith::event[7374]: S75purge-old-logs=action|Event|logrotate|Action|S75purge-old-logs|Start|1335049921 555497|End|1335049921 640327|Elapsed|0.08483
Apr 22 01:12:01 fmcp esmith::event[7374]: Running event handler: /etc/e-smith/events/actions/adjust-services
Apr 22 01:12:01 fmcp esmith::event[7374]: adjusting supervised httpd-e-smith (sigusr1)
Apr 22 01:12:01 fmcp esmith::event[7374]: adjusting supervised httpd-e-smith (up)
Apr 22 01:12:01 fmcp esmith::event[7374]: adjusting non-supervised syslog (reload)
Apr 22 01:12:01 fmcp kernel: Eeek! page_mapcount(page) went negative! (-1)
Apr 22 01:12:01 fmcp kernel:   page->flags = 4002016c
Apr 22 01:12:01 fmcp kernel:   page->count = 1
Apr 22 01:12:01 fmcp kernel:   page->mapping = d6c8dcf0
Apr 22 01:12:01 fmcp kernel: ------------[ cut here ]------------
Apr 22 01:12:01 fmcp kernel: kernel BUG at mm/rmap.c:508!
Apr 22 01:12:01 fmcp kernel: invalid operand: 0000 [#1]
Apr 22 01:12:01 fmcp kernel: SMP
Apr 22 01:12:01 fmcp kernel: Modules linked in: appletalk(U) r8169 bonding(U) ipt_ULOG ipt_REJECT ipt_MASQUERADE ipt_state ip_nat_ftp ip_conntrack_ftp iptable_mangle iptable_nat ip_conntrack iptable_filter ip_tables loop button battery ac hw_random mii sr_mod dm_snapshot dm_zero dm_mirror ext3 jbd raid1 dm_mod usb_storage uhci_hcd ohci_hcd ehci_hcd ata_piix libata 3w_9xxx sd_mod scsi_mod
Apr 22 01:12:01 fmcp kernel: CPU:    0
Apr 22 01:12:01 fmcp kernel: EIP:    0060:[page_remove_rmap+111/258]    Not tainted VLI
Apr 22 01:12:01 fmcp kernel: EIP:    0060:[<c0155197>]    Not tainted VLI
Apr 22 01:12:01 fmcp kernel: EFLAGS: 00010246   (2.6.9-89.31.1.ELsmp)
Apr 22 01:12:01 fmcp kernel: EIP is at page_remove_rmap+0x6f/0x102
Apr 22 01:12:01 fmcp kernel: eax: 0000001e   ebx: c95fa860   ecx: f3a7eeac   edx: c02f2938
Apr 22 01:12:01 fmcp kernel: esi: 00000001   edi: e0c85000   ebp: fffcf8d8   esp: f3a7eea8
Apr 22 01:12:01 fmcp kernel: ds: 007b   es: 007b   ss: 0068
Apr 22 01:12:01 fmcp kernel: Process httpd (pid: 2154, threadinfo=f3a7e000 task=f6ac4d70)
Apr 22 01:12:01 fmcp kernel: Stack: c02f2938 d6c8dcf0 c02f2921 00000001 085fa860 00000001 c014eb8e 2fd43425
Apr 22 01:12:01 fmcp kernel:        00000004 c95fa860 0000a000 00111000 c969b180 c0120f01 f3a7eee0 00111000
Apr 22 01:12:01 fmcp kernel:        00191000 e0c85008 c969b180 c014ecb0 00080000 00000000 00111000 f6bda548

I'm on contribs 7.5.1 and it seems to be the first time it happens.
When I google the problems it seems to be a material RAM problem. What do you think ?

Thanks by advance,

David

Offline cactus

  • *
  • 4,880
  • +3/-0
    • http://www.snetram.nl
Re: kernel: Eeek! page_mapcount(page) went negative!
« Reply #1 on: April 22, 2012, 09:43:12 PM »
When I google the problems it seems to be a material RAM problem. What do you think ?
Most of the times Google is very accurate on this. I suggest you run a memory test like memtest86+, to confirm this.
Be careful whose advice you buy, but be patient with those who supply it. Advice is a form of nostalgia, dispensing it is a way of fishing the past from the disposal, wiping it off, painting over the ugly parts and recycling it for more than its worth ~ Baz Luhrmann - Everybody's Free (To Wear Sunscreen)

Offline dadou

  • ***
  • 47
  • +0/-0
Re: kernel: Eeek! page_mapcount(page) went negative!
« Reply #2 on: May 09, 2012, 09:30:22 AM »
Hello Thanks for the answer,

I'll test that.

I also see something I never took care of. At startup system give this message :


kernel: ********************************************************
kernel: * This system has more than 16 Gigabyte of memory.     *
kernel: * It is recommended that you read the release notes    *
kernel: * that accompany your copy of CentOS distribution      *
kernel: * about the recommended kernel for such configurations *
kernel: ********************************************************

I have 4x4096Mo Kingston DDR3 (so it's 16Go ?)

It seems that kernel has to be updated for better configuration about this. Is it necessary ?
If yes how do you suggest to update it ?

It seems to need kernel-hugemem does someone already installed it ?

Thanks by advance,

David

Offline CharlieBrady

  • *
  • 6,918
  • +3/-0
Re: kernel: Eeek! page_mapcount(page) went negative!
« Reply #3 on: May 09, 2012, 02:35:19 PM »
You don't need kernel-hugemem. I'm pretty sure that you don't need 16GB of RAM either.

You do need to test your memory.

Offline jameswilson

  • *
  • 795
  • +0/-0
    • Security Warehouse, professional security equipment
Re: kernel: Eeek! page_mapcount(page) went negative!
« Reply #4 on: May 10, 2012, 09:35:46 AM »
Re 16 gig of ram
I was planning a powerful sme and use virtual box. Would I be better with a modest sme and a seperate xen box for the vm's

I know performance would be better on xen but I love the stability and ease of use of sme

James

Offline Stefano

  • *
  • 10,894
  • +3/-0
Re: kernel: Eeek! page_mapcount(page) went negative!
« Reply #5 on: May 10, 2012, 09:38:39 AM »
Re 16 gig of ram
I was planning a powerful sme and use virtual box. Would I be better with a modest sme and a seperate xen box for the vm's

I know performance would be better on xen but I love the stability and ease of use of sme

James

on such an hw, I wolud install proxmox ve and virtualize all my servers (SME too)

Offline jameswilson

  • *
  • 795
  • +0/-0
    • Security Warehouse, professional security equipment
Re: kernel: Eeek! page_mapcount(page) went negative!
« Reply #6 on: May 10, 2012, 01:36:57 PM »
Ok ta

Offline CharlieBrady

  • *
  • 6,918
  • +3/-0
Re: kernel: Eeek! page_mapcount(page) went negative!
« Reply #7 on: May 10, 2012, 03:07:46 PM »
I was planning a powerful sme and use virtual box.

If you proceed with that, you should use SME8.