Are you using iSCSI to access your volumes ? Might worth it activating iSCSI digests on both sides and see how it behaves then, wouldnâ€™t it ? Youâ€™d probably lose some perfs but it would probably too help you identify the root cause of your problems I guessâ€¦
De : firstname.lastname@example.org [mailto:email@example.com] De la part de Stanislav German-Evtushenko
EnvoyÃ© : lundi 27 janvier 2014 13:51
Ã€ : Bram Matthys
Cc : drbd-user
Objet : Re: [DRBD-user] BUG: Uncatchable DRBD out-of-sync issue
On Mon, Jan 27, 2014 at 4:18 PM, Bram Matthys <firstname.lastname@example.org <mailto:email@example.com> > wrote:
-----BEGIN PGP SIGNED MESSAGE-----
Just jumping in, unaware of the history of this thread...
Stanislav German-Evtushenko wrote, on 27-1-2014 7:08: >
> On Thu, Apr 18, 2013 at 4:21 PM, Stanislav German-Evtushenko
> <firstname.lastname@example.org <mailto:email@example.com> <mailto:firstname.lastname@example.org <mailto:email@example.com> >> wrote:
> No choice so far :)
> I don't think this is a kernel bug. Anyway would be nice if sombody
> can investigate and fix or at least find work around. IDE is slow in
> compare to VIRTIO.
> On Thu, Apr 18, 2013 at 2:31 PM, Felix Frank <firstname.lastname@example.org <mailto:email@example.com>
> <mailto:firstname.lastname@example.org <mailto:email@example.com> >> wrote:
> > On 04/18/2013 12:20 PM, Stanislav German-Evtushenko wrote:
> >>> Note that your kernel (and hence kvm/virtio) can be considered
> rather old by now.
> >> This is a stable RHEL 6 kernel at the moment.
> > Exactly ;-)
> > Same for Debian 6, which I no longer consider fit for KVM setups
> > (without backports and such).
> I have replaced all hard-drives on the first server and upgraded DRBD kernel
> modules to 8.3.15. I do verifying every week. It usually founds new
> out-of-sync sectors, then I check if they are false-positive or not (with
> md5sum) and find that 95% of them are real.
> Could anybody suggest a way to debug? Can it be DRBD + RAID problem? Or DRBD
> + one specific RAID problem?
Have you figured out on which one of the servers the data is correct? And is
it always the same server? This assumes a primary/secondary setup.
If you know on which server the data is correct then you know - IF it's a
hardware problem - which server is at fault. If it's a software problem,
then you still can't tell.
Do you run a weekly/monthly RAID verification job? On both servers? Linux sw
raid has this, and presumably hw raid has this option as well.
This would pick up (most) RAID / disk issues.
Silent disk corruption on RAID arrays can occur and disk verification would
be the only way to tell (well, apart from using a filesystem like ZFS).
Software developer/IT consultant firstname.lastname@example.org <mailto:email@example.com>
Website: www.vulnscan.org <http://www.vulnscan.org
PGP key: www.vulnscan.org/pubkey.asc <http://www.vulnscan.org/pubkey.asc
PGP fp: EBCA 8977 FCA6 0AB0 6EDB 04A7 6E67 6D45 7FE1 99A6
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.17 (MingW32)
-----END PGP SIGNATURE-----
drbd-user mailing list
firstname.lastname@example.org <mailto:email@example.com> http://lists.linbit.com/mailman/listinfo/drbd-user > Have you figured out on which one of the servers the data is correct?
> And is it always the same server?
It depends on what server is writing. On the one which write it is always correct.
Servers are identical and firmwares are up to date. > Do you run a weekly/monthly RAID verification job? On both servers?
That is nice point to try. I've been thinking I'd tried everything already. > This would pick up (most) RAID / disk issues.
This is very unlikely, however I'll try to run RAID verification job on both and will come back with results.
Ce courrier Ã©lectronique ne contient aucun virus ou logiciel malveillant parce que la protection avast! Antivirus est active. http://www.avast.com