Koozali.org: home of the SME Server

Kernel panic - bond0 problem

Offline bpivk

  • *
  • 908
  • +0/-0
    • http://www.bezigrad.com
Kernel panic - bond0 problem
« on: November 29, 2006, 11:13:43 PM »
Hy

I don't know what's wrong with my server but i get a kernel panic message every 6 days or so. I don't know if it could be because of internet traffic  or peerguardian runing on other computers.

Here is the error if someone can make something of it...

Nov 29 19:32:07 wegeland kernel: ------------[ cut here ]------------
Nov 29 19:32:07 wegeland kernel: kernel BUG at mm/rmap.c:479!
Nov 29 19:32:07 wegeland kernel: invalid operand: 0000 [#1]
Nov 29 19:32:07 wegeland kernel: SMP
Nov 29 19:32:07 wegeland kernel: Modules linked in: appletalk(U) 8139too ipt_ULOG ipt_REJECT ipt_MASQUERADE ipt_state ipt_TOS ip_nat_ftp ip_conntrack_ftp iptable_mangle iptable_nat ip_conntrack iptable_filter ip_tables button battery ac ohci_hcd ehci_hcd mii bonding(U) floppy dm_snapshot dm_zero dm_mirror ext3 jbd raid1 dm_mod
Nov 29 19:32:07 wegeland kernel: CPU:    0
Nov 29 19:32:07 wegeland kernel: EIP:    0060:[page_remove_rmap+35/74]    Not tainted VLI
Nov 29 19:32:07 wegeland kernel: EIP:    0060:[<c0152b91>]    Not tainted VLI
Nov 29 19:32:07 wegeland kernel: EFLAGS: 00010286   (2.6.9-42.0.2.ELsmp)
Nov 29 19:32:07 wegeland kernel: EIP is at page_remove_rmap+0x23/0x4a
Nov 29 19:32:07 wegeland kernel: eax: ffffffff   ebx: 000ffd80   ecx: c12041c0   edx: c10ffd80
Nov 29 19:32:07 wegeland kernel: esi: 00000000   edi: c9501248   ebp: c816abd8   esp: c3d97ec0
Nov 29 19:32:07 wegeland kernel: ds: 007b   es: 007b   ss: 0068
Nov 29 19:32:07 wegeland kernel: Process sysmon (pid: 3884, threadinfo=c3d97000 task=ca1fce30)
Nov 29 19:32:07 wegeland kernel: Stack: c014c830 07fec067 00000000 c10ffd80 0003c000 0933f000 c12041c0 c84ef680
Nov 29 19:32:07 wegeland kernel:        c84ef680 0933f000 093bf000 c9501250 c12041c0 c014c952 00080000 00000000
Nov 29 19:32:07 wegeland kernel:        0933f000 ce9562e8 093bf000 c12041c0 c014c9b1 00080000 00000000 c3d97f78
Nov 29 19:32:07 wegeland kernel: Call Trace:
Nov 29 19:32:07 wegeland kernel:  [zap_pte_range+640/841] zap_pte_range+0x280/0x349
Nov 29 19:32:07 wegeland kernel:  [<c014c830>] zap_pte_range+0x280/0x349
Nov 29 19:32:07 wegeland kernel:  [zap_pmd_range+89/124] zap_pmd_range+0x59/0x7c
Nov 29 19:32:07 wegeland kernel:  [<c014c952>] zap_pmd_range+0x59/0x7c
Nov 29 19:32:07 wegeland kernel:  [unmap_page_range+60/95] unmap_page_range+0x3c/0x5f
Nov 29 19:32:07 wegeland kernel:  [<c014c9b1>] unmap_page_range+0x3c/0x5f
Nov 29 19:32:07 wegeland kernel:  [unmap_vmas+241/517] unmap_vmas+0xf1/0x205
Nov 29 19:32:07 wegeland kernel:  [<c014cac5>] unmap_vmas+0xf1/0x205
Nov 29 19:32:07 wegeland kernel:  [exit_mmap+121/328] exit_mmap+0x79/0x148
Nov 29 19:32:07 wegeland kernel:  [<c0150ebb>] exit_mmap+0x79/0x148
Nov 29 19:32:07 wegeland kernel:  [mmput+78/114] mmput+0x4e/0x72
Nov 29 19:32:07 wegeland kernel:  [<c012079c>] mmput+0x4e/0x72
Nov 29 19:32:07 wegeland kernel:  [do_exit+527/1028] do_exit+0x20f/0x404
Nov 29 19:32:07 wegeland kernel:  [<c0124739>] do_exit+0x20f/0x404
Nov 29 19:32:07 wegeland kernel:  [sys_exit_group+0/13] sys_exit_group+0x0/0xd
Nov 29 19:32:07 wegeland kernel:  [<c0124a19>] sys_exit_group+0x0/0xd
Nov 29 19:32:07 wegeland kernel:  [syscall_call+7/11] syscall_call+0x7/0xb
Nov 29 19:32:07 wegeland kernel:  [<c02d47bf>] syscall_call+0x7/0xb
Nov 29 19:32:07 wegeland kernel:  [packet_rcv+394/775] packet_rcv+0x18a/0x307
Nov 29 19:32:07 wegeland kernel:  [<c02d007b>] packet_rcv+0x18a/0x307
Nov 29 19:32:07 wegeland kernel: Code: 3c c0 ff 42 10 51 9d c3 89 c2 8b 00 f6 c4 08 74 08 0f 0b dc 01 61 8c 2e c0 f0 83 42 08 ff 0f 98 c0 84 c0 74 2c 8b 42 08 40 79 08 <0f> 0b df 01 61 8c 2e c0 9c 59 fa b8 00 f0 ff ff 21 e0 8b 40 10
Nov 29 19:32:07 wegeland kernel:  <0>Fatal exception: panic in 5 seconds
Nov 29 19:32:07 wegeland kernel: bad: scheduling while atomic!
Nov 29 19:32:08 wegeland kernel:  [schedule+45/2267] schedule+0x2d/0x8db
Nov 29 19:32:08 wegeland kernel:  [<c02d1e71>] schedule+0x2d/0x8db
Nov 29 19:32:08 wegeland kernel:  [__mod_timer+257/267] __mod_timer+0x101/0x10b
Nov 29 19:32:08 wegeland kernel:  [<c0129e39>] __mod_timer+0x101/0x10b
Nov 29 19:32:08 wegeland kernel:  [poke_blanked_console+143/154] poke_blanked_console+0x8f/0x9a
Nov 29 19:32:08 wegeland kernel:  [<c020c52c>] poke_blanked_console+0x8f/0x9a
Nov 29 19:32:08 wegeland kernel:  [vt_console_print+660/677] vt_console_print+0x294/0x2a5
Nov 29 19:32:08 wegeland kernel:  [<c020b8cd>] vt_console_print+0x294/0x2a5
Nov 29 19:32:08 wegeland kernel:  [__mod_timer+257/267] __mod_timer+0x101/0x10b
Nov 29 19:32:08 wegeland kernel:  [<c0129e39>] __mod_timer+0x101/0x10b
Nov 29 19:32:08 wegeland kernel:  [schedule_timeout+313/340] schedule_timeout+0x139/0x154
Nov 29 19:32:08 wegeland kernel:  [<c02d2f8d>] schedule_timeout+0x139/0x154
Nov 29 19:32:08 wegeland kernel:  [process_timeout+0/5] process_timeout+0x0/0x5
Nov 29 19:32:08 wegeland kernel:  [<c012a6de>] process_timeout+0x0/0x5
Nov 29 19:32:08 wegeland kernel:  [printk+14/17] printk+0xe/0x11
Nov 29 19:32:08 wegeland kernel:  [<c01228ac>] printk+0xe/0x11
Nov 29 19:32:08 wegeland kernel:  [die+346/363] die+0x15a/0x16b
Nov 29 19:32:08 wegeland kernel:  [<c01060c2>] die+0x15a/0x16b
Nov 29 19:32:08 wegeland kernel:  [do_invalid_op+207/242] do_invalid_op+0xcf/0xf2
Nov 29 19:32:08 wegeland kernel:  [<c0106425>] do_invalid_op+0xcf/0xf2
Nov 29 19:32:08 wegeland kernel:  [page_remove_rmap+35/74] page_remove_rmap+0x23/0x4a
Nov 29 19:32:08 wegeland kernel:  [<c0152b91>] page_remove_rmap+0x23/0x4a
Nov 29 19:32:08 wegeland kernel:  [buffered_rmqueue+381/421] buffered_rmqueue+0x17d/0x1a5
Nov 29 19:32:08 wegeland kernel:  [<c0143fa4>] buffered_rmqueue+0x17d/0x1a5
Nov 29 19:32:08 wegeland kernel:  [do_IRQ+418/430] do_IRQ+0x1a2/0x1ae
Nov 29 19:32:08 wegeland kernel:  [<c0107ab4>] do_IRQ+0x1a2/0x1ae
Nov 29 19:32:08 wegeland kernel:  [free_pages_bulk+459/471] free_pages_bulk+0x1cb/0x1d7
Nov 29 19:32:08 wegeland kernel:  [<c014399c>] free_pages_bulk+0x1cb/0x1d7
Nov 29 19:32:08 wegeland kernel:  [do_invalid_op+0/242] do_invalid_op+0x0/0xf2
Nov 29 19:32:08 wegeland kernel:  [<c0106356>] do_invalid_op+0x0/0xf2
Nov 29 19:32:08 wegeland kernel:  [error_code+47/56] error_code+0x2f/0x38
Nov 29 19:32:08 wegeland kernel:  [<c02d52b7>] error_code+0x2f/0x38
Nov 29 19:32:08 wegeland kernel:  [page_remove_rmap+35/74] page_remove_rmap+0x23/0x4a
Nov 29 19:32:08 wegeland kernel:  [<c0152b91>] page_remove_rmap+0x23/0x4a
Nov 29 19:32:08 wegeland kernel:  [zap_pte_range+640/841] zap_pte_range+0x280/0x349
Nov 29 19:32:08 wegeland kernel:  [<c014c830>] zap_pte_range+0x280/0x349
Nov 29 19:32:08 wegeland kernel:  [zap_pmd_range+89/124] zap_pmd_range+0x59/0x7c
Nov 29 19:32:08 wegeland kernel:  [<c014c952>] zap_pmd_range+0x59/0x7c
Nov 29 19:32:08 wegeland kernel:  [unmap_page_range+60/95] unmap_page_range+0x3c/0x5f
Nov 29 19:32:08 wegeland kernel:  [<c014c9b1>] unmap_page_range+0x3c/0x5f
Nov 29 19:32:08 wegeland kernel:  [unmap_vmas+241/517] unmap_vmas+0xf1/0x205
Nov 29 19:32:08 wegeland kernel:  [<c014cac5>] unmap_vmas+0xf1/0x205
Nov 29 19:32:08 wegeland kernel:  [exit_mmap+121/328] exit_mmap+0x79/0x148
Nov 29 19:32:08 wegeland kernel:  [<c0150ebb>] exit_mmap+0x79/0x148
Nov 29 19:32:08 wegeland kernel:  [mmput+78/114] mmput+0x4e/0x72
Nov 29 19:32:08 wegeland kernel:  [<c012079c>] mmput+0x4e/0x72
Nov 29 19:32:08 wegeland kernel:  [do_exit+527/1028] do_exit+0x20f/0x404
Nov 29 19:32:08 wegeland kernel:  [<c0124739>] do_exit+0x20f/0x404
Nov 29 19:32:08 wegeland kernel:  [sys_exit_group+0/13] sys_exit_group+0x0/0xd
Nov 29 19:32:08 wegeland kernel:  [<c0124a19>] sys_exit_group+0x0/0xd
Nov 29 19:32:08 wegeland kernel:  [syscall_call+7/11] syscall_call+0x7/0xb
Nov 29 19:32:08 wegeland kernel:  [<c02d47bf>] syscall_call+0x7/0xb
Nov 29 19:32:08 wegeland kernel:  [packet_rcv+394/775] packet_rcv+0x18a/0x307
Nov 29 19:32:08 wegeland kernel:  [<c02d007b>] packet_rcv+0x18a/0x307
"It should just work" if it doesn't report it. Thanks!

Offline JonB

  • *
  • 351
  • +0/-0
Kernel panic - bond0 problem
« Reply #1 on: November 30, 2006, 12:01:23 AM »
Try replacing the RAM.

Jon
...

Offline bpivk

  • *
  • 908
  • +0/-0
    • http://www.bezigrad.com
Kernel panic - bond0 problem
« Reply #2 on: November 30, 2006, 12:37:12 AM »
I don't think that it's the ram acting up because i didn't have any problems on previous OS.

I googled a little and found out that it could be ram indeed.
So i'll change it and see what happens. The box is runing on 128mb  sdram so i think that it's about time to upgrade to 512ddr.  :D
"It should just work" if it doesn't report it. Thanks!

Offline brentonv

  • ****
  • 71
  • +0/-0
bonding?
« Reply #3 on: November 30, 2006, 03:25:44 AM »
hi there. i too have been having kernal panics ever since i started messing around with network bonding, using realtek 8139 nics. i am curious if you too are using identical nics as this is what was causing my kernal panic and in your log it mentions network bonding

Offline bpivk

  • *
  • 908
  • +0/-0
    • http://www.bezigrad.com
Kernel panic - bond0 problem
« Reply #4 on: November 30, 2006, 10:18:34 AM »
Yes i have two realtec nics just like you.

But i didn't do anything to bond them.

But i did notice that the sistem monitor contrib now shows bond info. But this just appeared. I didn't anything to activate it. The info is empty and it just shows an empty graph.

I read somwhere that this can be turned on if the server is in gateway mode but my server is in server and gateway mode.

Can i change this trough pannel od command line or do i change one of the nic's to stop this.
"It should just work" if it doesn't report it. Thanks!

Offline brentonv

  • ****
  • 71
  • +0/-0
Kernel panic - bond0 problem
« Reply #5 on: November 30, 2006, 11:07:23 PM »
yes. i did not request it to bond. it just happenes automatically. i read that we are supposed to get an option in the console to bond identical nics but so far i have not seen this option. today i am going to try two things.

1-install 2 identical intel nics,
2-reinstall sme twice, once with identical realtek (to see if this bonding option is only in the installer) and again with the intel nics.

i will let you know what happens

Offline bpivk

  • *
  • 908
  • +0/-0
    • http://www.bezigrad.com
Kernel panic - bond0 problem
« Reply #6 on: November 30, 2006, 11:21:51 PM »
Well fresh install isn't an option in my case so i'll have to find a way to remove this thing.

I'll wait for someone to tell me how to turn this off or i'll buy a different NIC and just swap one of them.
"It should just work" if it doesn't report it. Thanks!

Offline brentonv

  • ****
  • 71
  • +0/-0
Kernel panic - bond0 problem
« Reply #7 on: December 01, 2006, 02:40:14 AM »
dual realtek cards were the cause of my kernal panic woes. i swapped the nics for dual intels and everything was just fine. btw. i was just doing a clean install on a spare box for testing, and as it turns out the supposed option for nic bonding did not appear in the setup console on either the dual realteks or the dual intels. not sure why people talk about this option because i have never seen it. perhaps it was in a version 7 beta?. anyway my advice is just get rid of one of the realteks and everything will be fine. don't bother trying to remove the bond0. just change one of the cards.

Offline william_syd

  • *****
  • 1,608
  • +0/-0
  • Nothing to see here.
    • http://www.magicwilly.info
Kernel panic - bond0 problem
« Reply #8 on: December 01, 2006, 04:07:13 AM »
Quote from: "brentonv"
i read that we are supposed to get an option in the console to bond identical nics but so far i have not seen this option.




Available during install(configuration) and server-console.
Regards,
William

IF I give advise.. It's only if it was me....

Offline mrjhb3

  • *
  • 1,188
  • +0/-0
    • John Bennett Services
Kernel panic - bond0 problem
« Reply #9 on: December 01, 2006, 05:50:11 AM »
Quote from: "william_syd"
Quote from: "brentonv"
i read that we are supposed to get an option in the console to bond identical nics but so far i have not seen this option.


Available during install(configuration) and server-console.


I get the nic bonding option too,  but my servers are running in server-only mode.

John
......

Offline chris burnat

  • *****
  • 1,135
  • +2/-0
    • http://www.burnat.com
Kernel panic - bond0 problem
« Reply #10 on: December 01, 2006, 09:33:03 AM »
"dual realtek cards were the cause of my kernal panic woes. i swapped the nics for dual intels and everything was just fine. "

brentonv, would you be so kind as to put a bug report about this issue. Whilst it is not an SME issue per se, it would be good to document this in the FAQ or whaqtever for future reference.  This sort of information tends to get left behind if only in the forum...
Thanks.
chris
- chris
If it does not work out of the box, please fill in a Bug Report @ Bugzilla (http://bugs.contribs.org)  - check: http://wiki.contribs.org/Bugzilla_Help .  Thanks.

Offline william_syd

  • *****
  • 1,608
  • +0/-0
  • Nothing to see here.
    • http://www.magicwilly.info
Kernel panic - bond0 problem
« Reply #11 on: December 01, 2006, 10:12:32 AM »
I've also seen reports around the place that changing PCI slots may also help.
Regards,
William

IF I give advise.. It's only if it was me....

Offline bpivk

  • *
  • 908
  • +0/-0
    • http://www.bezigrad.com
Kernel panic - bond0 problem
« Reply #12 on: December 01, 2006, 02:24:47 PM »
I'll try to change the slots.
But i don't have an option to turn bond on or off. It just turned on and that's it. I didn't do anything. And because i don't have any options to turn it off i can't do that.

P.S.: My server is runing in server and gateway.
"It should just work" if it doesn't report it. Thanks!

Offline william_syd

  • *****
  • 1,608
  • +0/-0
  • Nothing to see here.
    • http://www.magicwilly.info
Kernel panic - bond0 problem
« Reply #13 on: December 01, 2006, 02:41:12 PM »
Quote from: "bpivk"
I'll try to change the slots.
But i don't have an option to turn bond on or off. It just turned on and that's it. I didn't do anything. And because i don't have any options to turn it off i can't do that.

P.S.: My server is runing in server and gateway.


How can you bond 2 nics in server/gateway mode ? One is LAN and the other is WAN.

In server only mode both are LAN.

Code: [Select]
[root@clean-server-only ~]# ifconfig
bond0     Link encap:Ethernet  HWaddr 00:0C:29:AE:51:F2
          inet addr:192.168.2.111  Bcast:192.168.2.255  Mask:255.255.255.0
          UP BROADCAST RUNNING ALLMULTI MASTER MULTICAST  MTU:1500  Metric:1
          RX packets:10280 errors:0 dropped:0 overruns:0 frame:0
          TX packets:4992 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0
          RX bytes:9832691 (9.3 MiB)  TX bytes:350492 (342.2 KiB)

eth0      Link encap:Ethernet  HWaddr 00:0C:29:AE:51:F2
          UP BROADCAST RUNNING SLAVE MULTICAST  MTU:1500  Metric:1
          RX packets:8921 errors:0 dropped:0 overruns:0 frame:0
          TX packets:5000 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:9652926 (9.2 MiB)  TX bytes:351804 (343.5 KiB)
          Interrupt:177 Base address:0x1400

eth1      Link encap:Ethernet  HWaddr 00:0C:29:AE:51:F2
          UP BROADCAST RUNNING SLAVE MULTICAST  MTU:1500  Metric:1
          RX packets:1367 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:180293 (176.0 KiB)  TX bytes:0 (0.0 b)
          Interrupt:185 Base address:0x1480

lo        Link encap:Local Loopback
          inet addr:127.0.0.1  Mask:255.0.0.0
          UP LOOPBACK RUNNING  MTU:16436  Metric:1
          RX packets:308 errors:0 dropped:0 overruns:0 frame:0
          TX packets:308 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0
          RX bytes:27916 (27.2 KiB)  TX bytes:27916 (27.2 KiB)

[root@clean-server-only ~]#
Regards,
William

IF I give advise.. It's only if it was me....

Offline bpivk

  • *
  • 908
  • +0/-0
    • http://www.bezigrad.com
Kernel panic - bond0 problem
« Reply #14 on: December 01, 2006, 03:09:58 PM »
Take a look. And again my server is in "server and gateway" mode and everything works as it should. Apart from kernell errors. :)

Code: [Select]

bond0     Link encap:Ethernet  HWaddr 00:00:00:00:00:00
          inet addr:1.1.1.1  Bcast:1.255.255.255  Mask:255.0.0.0
          UP BROADCAST RUNNING MASTER MULTICAST  MTU:1500  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0
          RX bytes:0 (0.0 b)  TX bytes:0 (0.0 b)

eth0      Link encap:Ethernet  HWaddr 00:50:BF:01:0D:52
          inet addr:192.168.0.1  Bcast:192.168.0.255  Mask:255.255.255.0
          UP BROADCAST RUNNING ALLMULTI MULTICAST  MTU:1500  Metric:1
          RX packets:6238578 errors:685 dropped:3592 overruns:352 frame:0
          TX packets:6462232 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:1848246777 (1.7 GiB)  TX bytes:3315664738 (3.0 GiB)
          Interrupt:5 Base address:0xdc00

eth1      Link encap:Ethernet  HWaddr 00:50:FC:3A:A5:F6
          inet addr:89.212.16.101  Bcast:89.xxx.xxx.xxx  Mask:255.255.0.0
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:6514401 errors:0 dropped:0 overruns:0 frame:0
          TX packets:6348565 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:3326750078 (3.0 GiB)  TX bytes:1884561109 (1.7 GiB)
          Interrupt:11 Base address:0xd800

lo        Link encap:Local Loopback
          inet addr:127.0.0.1  Mask:255.0.0.0
          UP LOOPBACK RUNNING  MTU:16436  Metric:1
          RX packets:20703 errors:0 dropped:0 overruns:0 frame:0
          TX packets:20703 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0
          RX bytes:4136414 (3.9 MiB)  TX bytes:4136414 (3.9 MiB)
"It should just work" if it doesn't report it. Thanks!