Jump to content

mce: [Hardware Error]: Machine check events logged


OCDKV

Recommended Posts

I keep getting these messages in my syslog.  Everything seems to be working fine but I've noticed some hiccups when streaming to my Xbox or TV but no problems streaming to my iPad.  I was blaming it on a pretty crappy router (Belkin N600 Play) but seeing these logs around the same time I'm trying to stream my shows makes me a bit paranoid (these errors also occur when nobodys home).  I've done the memtest for 18hrs with 0 errors smart tests on the drives shows no errors so mobo or cpu?  Kinda at a lost here as unRaid and linux are still pretty new to me. also where are these machine events logged?

 

Nov  6 04:03:39 Tower kernel: mdcmd (24): spindown 0

Nov  6 04:40:01 Tower su[16030]: Successful su for unraid-plex by root

Nov  6 04:40:01 Tower su[16030]: + ??? root:unraid-plex

Nov  6 04:41:00 Tower kernel: mdcmd (25): spindown 1

Nov  6 19:21:39 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  6 20:08:46 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  6 20:21:29 Tower in.telnetd[27531]: connect from 192.168.2.23 (192.168.2.23)

Nov  6 20:21:31 Tower login[27532]: ROOT LOGIN  on '/dev/pts/0' from '192.168.2.23'

Nov  6 20:41:29 Tower kernel: mce_notify_irq: 1 callbacks suppressed

Nov  6 20:41:29 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  6 20:46:47 Tower last message repeated 2 times

 

Nov  6 21:03:46 Tower emhttp: Restart SMB...

Nov  6 21:03:46 Tower emhttp: shcmd (59): killall -HUP smbd

Nov  6 21:03:46 Tower emhttp: shcmd (60): ps axc | grep -q rpc.mountd

Nov  6 21:03:46 Tower emhttp: _shcmd: shcmd (60): exit status: 1

Nov  6 21:03:46 Tower emhttp: shcmd (61): /usr/local/sbin/emhttp_event svcs_restarted

Nov  6 21:03:46 Tower emhttp_event: svcs_restarted

Nov  6 22:00:36 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  6 22:45:17 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  6 22:45:32 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  6 22:46:54 Tower kernel: mce_notify_irq: 2 callbacks suppressed

Nov  6 22:46:54 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  6 22:48:03 Tower last message repeated 2 times

Nov  6 22:53:39 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  6 22:55:36 Tower kernel: mce_notify_irq: 1 callbacks suppressed

Nov  6 22:55:36 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  6 22:57:23 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  6 22:58:40 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  6 22:59:20 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  6 23:00:56 Tower kernel: mdcmd (26): spindown 2

Nov  6 23:01:16 Tower kernel: mdcmd (27): spindown 0

Nov  6 23:10:00 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  6 23:15:41 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  6 23:34:30 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  7 00:35:17 Tower kernel: mdcmd (28): spindown 1

Nov  7 01:42:48 Tower kernel: mdcmd (29): spindown 2

 

Nov  7 04:21:20 Tower kernel: mdcmd (30): spindown 0

Nov  7 04:40:01 Tower su[23845]: Successful su for unraid-plex by root

Nov  7 04:40:01 Tower su[23845]: + ??? root:unraid-plex

Nov  7 04:43:00 Tower kernel: mdcmd (31): spindown 1

Nov  7 08:47:22 Tower kernel: mdcmd (32): spindown 1

Nov  7 09:12:35 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  7 11:31:24 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  7 12:22:43 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  7 12:25:23 Tower kernel: mdcmd (33): spindown 2

Nov  7 12:26:34 Tower kernel: mdcmd (34): spindown 1

Nov  7 15:32:18 Tower kernel: mce: [Hardware Error]: Machine check events logged

Nov  7 15:38:51 Tower kernel: mce: [Hardware Error]: Machine check events logged

 

Help!

 

Thank you

 

Link to comment

Spoke too soon :'(

 

Everything was fine yesterday but got up this morning and error lines galore.  I won't post the logs cause they're all the same as my original post.  I've upgraded my router as well so no problems there.

 

Anybody else?

 

Thank you.

Link to comment

Spoke too soon :'(

 

Everything was fine yesterday but got up this morning and error lines galore.  I won't post the logs cause they're all the same as my original post.  I've upgraded my router as well so no problems there.

 

Anybody else?

 

Thank you.

 

Are you sure the ntfs-3g driver is the right one?

 

Delete all ntfs-3g packages from /boot/extra and boot/packages then re-boot to be sure the right driver is installed.

Link to comment

No luck :(

 

Within 1.5 hrs the errors started showing up again.  The server was not touched during this time.

 

Thanks for trying, I appreciate it greatly.  Any other thoughts?

 

I do have to say that I cannot see any performance issues so far, unRaid appears to functioning great (I know appearances can be deceiving ;)).  I'm still concerned about CPU & Mobo but again nothing 'appears' to be wrong.  I guess I'm just a bit paranoid. 

 

Perhaps I should disable Sab, SB & CP and see what happens, however I seem to recall these errors occurring before I added them but not as persistent.  Trial and Error... seems to be a theme since I started the whole unRaid project :-\  Thank god for this forum,  I'd be lost without it and everybody's help.  Most of my 'issues' were solved with a simple search but this one's being a pain in the ____ ;D.

 

Thanks

Link to comment

Ok, you'll have to excuse my newness but by 'Rename' do mean to delete everything within the folder ???

Will this have any effect on the plugin files on my cache drive?

Is this much like doing a clean install?

I've backed up my flash drive just in case, should I back up my files & media on the server?  That may take awhile :o

 

I'm willing to learn and try anything, at this point it's only costing me more time but having a worry free unRaid server will be worth it (not sure if the wife agrees!).  Plus, knowledge is power.

 

Thanks for your help dgaschk, really appreciate it.

 

Link to comment

See my sig to revert to a stock system and then test.

 

25 hrs into testing and no errors.  Fingers are crossed but it appears to have been a plugin issue.  I removed all plugins and cleared my cashe.  I have added unMenu back.  I will report back tomorrow and if all's clear I'll start adding Plex, Sab, SB & CP one at a time and test for 24 hrs between additions.

 

Thanks again to the comunity :)

 

I will update as the testing goes.

 

Thank you.

Link to comment

>:( >:(>:(

 

Ok. so I added unMenu yesterday with a few of additions from the pkg manager (Email, Powerdown, bwm-ng, status alerts,  monthly parity check and Modify "mover" to not invoke "sync").  All's fine and today I find 1 solitary error right in the middle of the Mover.

 

Nov 15 03:00:01 SkyNet logger: mover started

Nov 15 03:00:01 SkyNet logger: skipping Apps/

Nov 15 03:00:01 SkyNet logger: moving Movies/

Nov 15 03:00:02 SkyNet logger: ./Movies/ELYSIUM_1080p_BLUEBIRD.mkv

Nov 15 03:00:02 SkyNet logger: >f+++++++++ Movies/ELYSIUM_1080p_BLUEBIRD.mkv

Nov 15 03:00:28 SkyNet kernel: mce: [Hardware Error]: Machine check events logged (Errors)

Nov 15 03:02:18 SkyNet logger: ./Movies/Red.2.2013.720p.WEB-DL.H264-PublicHD.mkv

Nov 15 03:02:18 SkyNet logger: >f+++++++++ Movies/Red.2.2013.720p.WEB-DL.H264-PublicHD.mkv

Nov 15 03:04:14 SkyNet logger: ./Movies/

Nov 15 03:04:14 SkyNet logger: mover finished

 

My morning email told me the server was OK so I thought nothing of it till I checked the syslog.  Also noticed since I upgraded my router I get alot of  these:

 

Nov 15 09:03:55 SkyNet dhcpcd[1137]: eth0: renewing lease of 192.168.1.155 (Network)

Nov 15 09:03:55 SkyNet dhcpcd[1137]: eth0: acknowledged 192.168.1.155 from 192.168.1.1 (Network)

Nov 15 09:03:55 SkyNet dhcpcd[1137]: eth0: leased 192.168.1.155 for 86400 seconds (Network)

 

Like every few hours (not the 86400 seconds is states)?  Dont know if this is related or even a bad thing, or should I start a new thread?

 

I've attached a syslog just in case someone sees something else that might be important. 

 

I'm going to remove the Modify "mover" to not invoke "sync" pkg and see if that does anything.

 

And Yes I named my tower SkyNet, saw it somewhere else and thought it was the perfect name so I borrowed it, I hope you dont mind whoever you are ;D ;D

 

Thank you all but I'm really starting to question the CPU or Mobo, I just don't have the coin or resources to change them out unless I dismantle my Desktop and use it's mobo & cpu, but I don't think the wife will be too happy about that!

syslog-2013-11-15.txt

Link to comment

CRAP!  I'm sure I read this when I first started noticing the errors.

 

Is there any way to use mcelog in unraid?

 

Any suggestions on how to obtain an mcelog with this hardware?

 

I'm thinking of running Ubuntu off a flash and trying to stress the cpu somehow to envoke an error or two... sound plausible to those who have much more experience with this than I would?

 

Any input will help,

 

Thanks :'(

Link to comment

Thank you to those that tried to help but I have decided that this subject was taking up too much time and effort to really be worth it.  I'm certain that my troubles and errors were a result of my inexperience so i decided to dismantle my server and re-purpose the Haswell technology in building my wife a new desktop computer. 

 

I reclaimed the old desktop components and attached my HDD's, an old SSD and the flash drive and 'rebuilt' my unRAID Server using 5 year old MOBO & CPU.  I've added all my plugins, as well as a few others, and have been testing all weekend and she's been pretty much rock solid (a couple of hiccups but I definately know those were user error). I've had no problem streaming to all my devices (Roku, Ipad, Ipod, XBuntu HTPC, Laptop, Cell Phone, XBOX 360), not all at once of course ;)

 

I have no idea what caused these errors and to be honest right now i don't care.  Everything works, is stable, and that was the point to the server in the first place.

 

My "NEW" rig is in my sig and if anyone is interested my wifes new Desktop computer is running perfectly under windows 8.1.

 

Again thank you to the community and I'm sure i'll call upon you again, I just hope not too soon ;D???::)

Link to comment

Archived

This topic is now archived and is closed to further replies.

×
×
  • Create New...