Interrupted write causes corruption #243

tabokie · 2022-07-14T03:57:25Z

At this line, we use write_all to append some bytes:

raft-engine/src/file_pipe_log/log_file.rs

Line 107 in ee0f6cf

self.writer.write_all(buf)?;

If this write is interrupted, we directly bubble its error. But some portion of the data might already be written. In this case, the self.written is inconsistent with underlying writer's internal offset. A fractured write will remain as a phantom record.

The text was updated successfully, but these errors were encountered:

LykxSassinator · 2022-07-15T02:26:17Z

Maybe we need a new func to support a safe write here? Just like:

index 3f7628c..5f37581 100644
--- a/src/env/mod.rs
+++ b/src/env/mod.rs
@@ -55,4 +55,8 @@ pub trait WriteExt {
     fn truncate(&mut self, offset: usize) -> Result<()>;
     fn sync(&mut self) -> Result<()>;
     fn allocate(&mut self, offset: usize, size: usize) -> Result<()>;
+    fn write_all_safely(
+        &mut self,
+        buf: &mut [u8],
+    ) -> ::std::result::Result<usize, (usize, std::io::Error)>;
 }

tabokie · 2022-07-15T02:46:00Z

No, simply reseek the writer if there's a failure. If that seek fails, panic.

LykxSassinator · 2022-07-15T03:43:26Z

Emm...I didn't get it.
reseek is just a op which resets the offset, and in our self-defined write, reseek and its followed operations in write is just like a redo op by continue triggered by Errno: EINTER.

https://github.com/LykxSassinator/raft-engine/blob/f3c268bb954f63f2809f84a940141db4419b2c44/src/env/default.rs#L116-L138

And I just wanna introduce a safe write by write_all_safely. If it failed, it would return the tuple, both containing the actual written size of bytes and the error details.

tabokie · 2022-07-15T03:54:05Z

The purpose here is to make sure subsequent writes can correctly overwrite the failed partial write, so that LogFileWriter::written is always consistent with LogFile::offset, no phantom data is inserted.

tabokie added bug Something isn't working good first issue Denotes an issue ready for a new contributor, according to the "help wanted" guidelines. help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. labels Jul 14, 2022

This was referenced Jul 15, 2022

chore: update CI toolchain and clean up code #244

Merged

pipe: reseek after write failure #245

Merged

tabokie closed this as completed in #245 Jul 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interrupted write causes corruption #243

Interrupted write causes corruption #243

tabokie commented Jul 14, 2022

LykxSassinator commented Jul 15, 2022

tabokie commented Jul 15, 2022

LykxSassinator commented Jul 15, 2022 •

edited

Loading

tabokie commented Jul 15, 2022

Interrupted write causes corruption #243

Interrupted write causes corruption #243

Comments

tabokie commented Jul 14, 2022

LykxSassinator commented Jul 15, 2022

tabokie commented Jul 15, 2022

LykxSassinator commented Jul 15, 2022 • edited Loading

tabokie commented Jul 15, 2022

LykxSassinator commented Jul 15, 2022 •

edited

Loading