Nie jesteś zalogowany.
Jeśli nie posiadasz konta, zarejestruj je już teraz! Pozwoli Ci ono w pełni korzystać z naszego serwisu. Spamerom dziękujemy!
Prosimy o pomoc dla małej Julki — przekaż 1% podatku na Fundacji Dzieciom zdazyć z Pomocą.
Więcej informacji na dug.net.pl/pomagamy/.
Strony: 1
Witam,
Mam serwerek Dell PowerEdge 1950, w środku 2 dyski SAS 146GB..
Raida robiłem za pomocą PERC 6/i Integrated BIOS Configuration Utiity...
Ostatnio przy serwerku było słychać nierówną pracę dysków, i zaczełęm się zastanawiać, co może być przyczyną, bo to niezbyt fajne..
Jak sprawdzić, czy dyski SAS są ok? SATOWE dyski sprawdzałęm smartem, z pakietu smartmontools i w razie czego informował mnie mailem o nieprawidłowościach.. Jak to jest z dyskami SAS? Poniżej wynik smarta dla obu dysków:
(root@0151 ~)# smartctl -a -d megaraid,1 /dev/sda smartctl 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net Device: FUJITSU MAX3147RC Version: D207 Serial number: DQ03P7A0CRVR Device type: disk Transport protocol: SAS Local Time is: Fri Apr 6 08:38:59 2012 CEST Device supports SMART and is Enabled Temperature Warning Disabled or Not Supported SMART Health Status: OK Current Drive Temperature: 22 C Drive Trip Temperature: 65 C Manufactured in week 41 of year 2007 Specified cycle count over device lifetime: 10000 Accumulated start-stop cycles: 19 Elements in grown defect list: 0 Error counter log: Errors Corrected by Total Correction Gigabytes Total ECC rereads/ errors algorithm processed uncorrected fast | delayed rewrites corrected invocations [10^9 bytes] errors read: 0 1249 0 0 0 70291.224 0 write: 0 1 0 0 0 5941.395 0 verify: 0 341 0 0 0 27896.464 0 Non-medium error count: 13 SMART Self-test log Num Test Status segment LifeTime LBA_first_err [SK ASC ASQ] Description number (hours) # 1 Background long Completed - 0 - [- - -] # 2 Background short Completed - 0 - [- - -] Long (extended) Self Test duration: 1793 seconds [29.9 minutes] (root@0151 ~)# smartctl -a -d megaraid,0 /dev/sda smartctl 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net Device: FUJITSU MAX3147RC Version: D207 Serial number: DQ03P7A0CRNE Device type: disk Transport protocol: SAS Local Time is: Fri Apr 6 08:39:07 2012 CEST Device supports SMART and is Enabled Temperature Warning Disabled or Not Supported SMART Health Status: OK Current Drive Temperature: 18 C Drive Trip Temperature: 65 C Manufactured in week 41 of year 2007 Specified cycle count over device lifetime: 10000 Accumulated start-stop cycles: 19 Elements in grown defect list: 0 Error counter log: Errors Corrected by Total Correction Gigabytes Total ECC rereads/ errors algorithm processed uncorrected fast | delayed rewrites corrected invocations [10^9 bytes] errors read: 0 218 0 0 0 68421.273 0 write: 0 6 0 0 0 5795.461 0 verify: 0 59 0 0 0 27896.463 0 Non-medium error count: 11 SMART Self-test log Num Test Status segment LifeTime LBA_first_err [SK ASC ASQ] Description number (hours) # 1 Background short Completed - 27208 - [- - -] # 2 Background long Completed - 0 - [- - -] # 3 Background short Completed - 0 - [- - -] Long (extended) Self Test duration: 1793 seconds [29.9 minutes]
Jak sprawdzić, czy RAID mirroruje się dobrze (w moim przypadku to akurat RAID1) najlepiej bez restartu serwera?
Z góry dzięki za odp.
Ostatnio edytowany przez Grzeslaw (2012-04-06 09:03:46)
Offline
Grzeslaw
np tak:
smartctl -a -d megaraid,11 /dev/sdb smartctl --test=long -d megaraid,11 /dev/sdb
gdzie 11 to "Device Id:" brane z megacli
Odnośnie regularnego sprawdzania dysków w macierzy na megacli lepiej chyba korzystać z megacli -ldinfo -Lall -Aall -NoLog i sprawdzać wartość pola State, oprócz uszkodzenia dysków pokaże też rozjechaną macierz, a jeszcze lepiej skorzystać z gotowego pluginu do nagiosa.
Offline
Mhh.. Nagiosa nie używam, ale może zainstaluje.. Anyway.. chodzi mi bardziej o konfiguracje smartctl.confa by powiadamial mi mailowo jak jest nie halo..
Oto wynik tego co zaparoponwales.. Pierwsze to sam ci wkleilem. ale drugie:
# smartctl --test=long -d megaraid,1 /dev/sda smartctl 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net Long (extended) offline self test failed [Input/output error]
Offline
Strony: 1
Time (s) | Query |
---|---|
0.00010 | SET CHARSET latin2 |
0.00005 | SET NAMES latin2 |
0.00121 | SELECT u.*, g.*, o.logged FROM punbb_users AS u INNER JOIN punbb_groups AS g ON u.group_id=g.g_id LEFT JOIN punbb_online AS o ON o.ident='3.149.237.231' WHERE u.id=1 |
0.00076 | REPLACE INTO punbb_online (user_id, ident, logged) VALUES(1, '3.149.237.231', 1732216606) |
0.00057 | SELECT * FROM punbb_online WHERE logged<1732216306 |
0.00109 | DELETE FROM punbb_online WHERE ident='3.144.108.200' |
0.00084 | DELETE FROM punbb_online WHERE ident='3.144.8.68' |
0.00073 | DELETE FROM punbb_online WHERE ident='54.36.148.101' |
0.00087 | SELECT t.subject, t.closed, t.num_replies, t.sticky, f.id AS forum_id, f.forum_name, f.moderators, fp.post_replies, 0 FROM punbb_topics AS t INNER JOIN punbb_forums AS f ON f.id=t.forum_id LEFT JOIN punbb_forum_perms AS fp ON (fp.forum_id=f.id AND fp.group_id=3) WHERE (fp.read_forum IS NULL OR fp.read_forum=1) AND t.id=21030 AND t.moved_to IS NULL |
0.00005 | SELECT search_for, replace_with FROM punbb_censoring |
0.00147 | SELECT u.email, u.title, u.url, u.location, u.use_avatar, u.signature, u.email_setting, u.num_posts, u.registered, u.admin_note, p.id, p.poster AS username, p.poster_id, p.poster_ip, p.poster_email, p.message, p.hide_smilies, p.posted, p.edited, p.edited_by, g.g_id, g.g_user_title, o.user_id AS is_online FROM punbb_posts AS p INNER JOIN punbb_users AS u ON u.id=p.poster_id INNER JOIN punbb_groups AS g ON g.g_id=u.group_id LEFT JOIN punbb_online AS o ON (o.user_id=u.id AND o.user_id!=1 AND o.idle=0) WHERE p.topic_id=21030 ORDER BY p.id LIMIT 0,25 |
0.00071 | UPDATE punbb_topics SET num_views=num_views+1 WHERE id=21030 |
Total query time: 0.00845 s |