Skip to content

Facilities to read and write FAT filesystems with Python

Notifications You must be signed in to change notification settings

cire831/FATtools

 
 

Repository files navigation

FATtools

This is a refinement and extension of my FATtools provided in PyDiskTools on GitHub.

Born to re-sort in an arbitrary order the directory entries in a FAT32 root table to cope with some hardware MP3 players' limits, it now provides full read/write support in Python 2.7 for FAT12/16/32 and exFAT filesystems, for hacking and recovering purposes.

Actually, following features are implemented (mostly in Python, with a few ctypes calls to handle Win32 disks natively; but it works in Linux, also):

  • sector aligned read/writes with both file images and real Win32 disks
  • sector based caching mechanism (for both reading and writing) to speed-up FAT and directory table operations
  • run length encoded map (with tuples and dictionaries) of free clusters, free directory slots, allocated cluster chains
  • transparent reading and writing of FAT12/16/32 and exFAT filesystems with FS boot-sector auto recognizer
  • Long File Name and Unicode support
  • tools to open, create, rename, list and delete files and directories
  • facilities to sort, clean and shrink directory tables
  • file fragmentation calculator
  • mkfat tool to properly apply a FAT12/16/32 or exFAT filesystem to a block device (file or disk) and let CHKDSK be happy with it (included exFAT compressed Up-Case table generator)

Obviously, since a filesystem is an extremely complex and delicate matter, and big bugs may lay around, you'll USE IT TOTALLY AT YOUR OWN RISK! But it seems quite stable and usable, now.

The most fragile area (and, thus, subject to bugs) was the caching mechanism, that operates in different ways:

  • intercepting small I/O (<= 512 bytes), which is cached in a small circular buffer. Bigger I/O bypasses the cache; when the cache is full, all dirty sectors are committed to disk and the cache buffer is zeroed. Sectors and buffers are paired with Python builtin dictionaries: this permits a good (from a Pythonic perspective) I/O speed during FAT and directory tables access;
  • mantaining a dictionary of pre-decoded FAT indexes, to improve the speed of repetitive access to cluster chains;
  • mantaining a dictionary of short and long names (paired with their respective directory slots) for each directory table, to speed up searches and updates in directory tables;
  • mantaining a RLE map of of free clusters, free directory slots and allocated cluster chains, to dramatically improve speed of allocation and file access.

Actually, the I/O speed is closer to system's one.

Sample usage:

# -*- coding: mbcs -*-
from Volume import *

# Assuming we have DirA, DirB, DirC in this disk order into X:
root = opendisk('X:', 'r+b')

new_order = '''DirB
DirC
DirA'''

root._sortby.fix = new_order.split('\n') # uses built-in directory sort algorithm
root.sort(root._sortby)

About

Facilities to read and write FAT filesystems with Python

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 98.8%
  • Other 1.2%