MIT_JOS_Lab5

This Lab is mainly the part of the file system in the JOS. It is mainly related to the disk related to file storage, and the form of file storage on the disk. Then there is the file system. The implementation method of the JOS file system is to realize the file through a special process. The basic operation, and then through IPC (inter-process communication) to achieve the operation of other processes on the file. Then read the file from the disk and create a process, this process is similar to fork.

File system preliminaries

Our file system is very different from the UNIX file system, does not support multi-user and multi-user permissions, Our file system also currently does not support hard links, symbolic links, time stamps, or special device files like most UNIX file systems do.

On-Disk File System Structure

In general operating systems, when a disk stores files, the disk has two regions, namely the data regions of the file, and the inode regions. For the inode, its size of it often fixed. Sometimes, our disk obviously has space, The reason why the file cannot be saved is that the inode area of the disk is full, and new data cannot be added. However, in JOS, we have simplified the file system, there is no inode stored on the disk, and the inode information of the file is directly stored in the directory above this file. We can find it by looking at the structure of the file Struct

struct File {
    char f_name[MAXNAMELEN];    // filename
    off_t f_size;           // file size in bytes
    uint32_t f_type;        // file type

    // Block pointers.
    // A block is allocated iff its value is != 0.
    uint32_t f_direct[NDIRECT]; // direct blocks
    uint32_t f_indirect;        // indirect block

    // Pad out to 256 bytes; must do arithmetic in case we're compiling fsformat on a 64-bit machine.
    uint8_t f_pad[256 - MAXNAMELEN - 8 - 4*NDIRECT - 4];
} __attribute__((packed));  // required only on some 64-bit machines

Sectors and Blocks

The file is stored on the disk, and the unit of reading the disk is a sector, usually the size of a sector is 512 Bytes. The first program we start OS with is to read a specific sector of the disk. But the unit of the file is block. In JOS, the block size is 4096 bytes, which is equal to the size of one page.

Superblocks

The file system is stored on the disk. For the file system, the root directory is special, and other files are found through the root directory, so on the disk, the block that stores the root directory is special. This block is usually called superblock, and in JOS, superblock is the second block in the disk, it's block[1], because block[0] is save the bootloader, some OS maybe have not only one superblocks.

File Meta-data

The layout of the meta-data describing a file in our file system is described by struct File in inc/fs.h. This meta-data includes the file's name, size, type (regular file or directory), and pointers to the blocks comprising the file. As mentioned above, we do not have inodes, so this meta-data is stored in a directory entry on disk. For file Meta-data, it's stored format in memory and disk both are struct File.

For the data block occupied by the file, we use two parts to represent, where f_direct[NDIRECT] is the direct index block, and f_indirect It is a first-level index block. The first-level index block's each entry points to a block. The content of this block stores the index of 1024 blocks.

Directories versus Regular Files

A File structure in our file system can represent either a regular file or a directory; these two types of "files" are distinguished by the type field in the File structure. The file system interprets the contents of a directory-file as a series of File structures describing the files and subdirectories within the directory.

In the JOS file system, superblock contains the file structure of the root directory. The contents of this directory-file is a sequence of File structures describing the files and directories located within the root directory of the file system.

The File System

This experiment does not implement a file system from scratch, the main part of the implementation is as follows,

Read file from disk
Write files from memory back to disk
Allocate disks, and manage disks
Through IPC (inter-process communication) to achieve the process of reading and writing files, and open excuses

The x86 processor uses the IOPL bits in the EFLAGS register to determine whether protected-mode code is allowed to perform special device I/O instructions such as the IN and OUT instructions. IO independent addressing is used in X86, So only the file system can access this special IO address space. In effect, the IOPL bits in the EFLAGS register provides the kernel with a simple "all-or-nothing" method of controlling whether user-mode code can access I/O space.

Exercise 1

Create a special IO process

if (type == ENV_TYPE_FS) {
    env->env_tf.tf_eflags |= FL_IOPL_MASK;
}

The Block Cache

It is incorrect to say that the virtual space and the virtual disk are not connected at all. For the file system, the file system process has its own separate system space, which is completely different from other Env system spaces, The division of this virtual address space is shown below, We can see that the key part is 0x10000000 (DISKMAP), where we start to map the disk file. the only thing the file system environment needs to do is to
implement file access, it is reasonable to reserve most of the file system environment's address space in this way.

Of course, if you can, we read the all contents of this disk into the address space, but this is stupid and impossible. Therefore, we have implemented a page fault mechanism to read the disk.

Exercise 2

Implement the bc_pgfault and flush_block functions in fs/bc.c. bc_pgfault is a page fault handler, just like the one your wrote in the previous lab for
copy-on-write fork, except that its job is to load pages in from the disk in response to a page fault. When writing this, keep in mind that (1) addr may not be aligned to a block boundary and (2) ide_read operates in sectors, not blocks.

  // LAB 5: you code here:
    addr = ROUNDDOWN(addr, PGSIZE);
    // alloc a page for load the block
    if ((r = sys_page_alloc(0, addr, PTE_W | PTE_U | PTE_P)) != 0) {
        panic("bc_pgfault: %e", r);
    } 
    // the unit read from disk is sector rather block 
    if ((r = ide_read(blockno * BLKSECTS, addr, BLKSECTS)) != 0) {
        panic("bc_pgfault: %e", r);
    }
    // Clear the dirty bit for the disk block page since we just read the
    // block from disk
    if ((r = sys_page_map(0, addr, 0, addr, uvpt[PGNUM(addr)] & PTE_SYSCALL)) < 0)
        panic("in bc_pgfault, sys_page_map: %e", r);

    // Check that the block we read was allocated. (exercise for
    // the reader: why do we do this *after* reading the block in?)
    if (bitmap && block_is_free(blockno))
        panic("reading free block %08x\n", blockno);

Write the content at address addr back to disk

void flush_block(void *addr)
{
    uint32_t blockno = ((uint32_t)addr - DISKMAP) / BLKSIZE;
    int r;
    // Determine whether the range of virtual addresses is correct
    if (addr < (void*)DISKMAP || addr >= (void*)(DISKMAP + DISKSIZE))
        panic("flush_block of bad va %08x", addr);

    // LAB 5: Your code here.
    addr = ROUNDDOWN(addr, PGSIZE);
    // If the virtual address is not mapped, it does not exist in physical memory
    if (!va_is_mapped(addr) || !va_is_dirty(addr)) {        
        return;
    }
    if ((r = ide_write(blockno * BLKSECTS, addr, BLKSECTS)) != 0) {
        panic("flush_block: %e", r);
    }
    // Clear the dirty bit for the disk block page since we have writed back the block
    if ((r = sys_page_map(0, addr, 0, addr, uvpt[PGNUM(addr)] & PTE_SYSCALL)) != 0) {
        panic("flush_block: %e", r);
    }
    // panic("flush_block not implemented");
}

The fs_init function in fs/fs.c is a prime example of how to use the block cache. After initializing the block cache, it simply stores pointers into the disk map region in the super global variable. After this point, we can simply read from the super structure as if they were in memory and our page fault handler will read them from disk as necessary.

The Block Bitmap

After fs_init sets the bitmap pointer, we can treat bitmap as a packed array of bits, one for each block on the disk. See, for example, block_is_free, which simply checks whether a given block is marked free in the bitmap.

Exercise 3

Use free_block as a model to implement alloc_block in fs/fs.c, which should find a free disk block in the bitmap, mark it used, and return the number of that block. When you allocate a block, you should immediately flush the changed bitmap block to disk with flush_block, to help file system consistency

int alloc_block(void)
{
    // The bitmap consists of one or more blocks.  A single bitmap block
    // contains the in-use bits for BLKBITSIZE blocks.  There are
    // super->s_nblocks blocks in the disk altogether.
    int i;
    // look for the fisrt free block and alloc it, after that, we should flush 
    // the block that save the bitmap[]
    for (i = 0; i < super->s_nblocks; i++) {
        if (block_is_free(i)) {
            bitmap[i / 32] &= ~(1 << (i % 32));
            flush_block(&bitmap[i / 32]);
            return i;
        }
    }
    // panic("alloc_block not implemented");
    return -E_NO_DISK;
}

File Operations

We have provided a variety of functions in fs/fs.c to implement the basic facilities you will need to interpret and manage File structures, scan and manage the entries of directory-files, and walk the file system from the root to resolve an absolute pathname. Since the files are stored on the disk, the main job of this part is to modify the files on the disk.

Exercise 4

Implement file_block_walk and file_get_block. file_block_walk maps from a block offset within a file to the pointer for that block in the struct File or the indirect block, very much like what pgdir_walk did for page tables.

Get the address of the No filebno data block of the file in the index of file f. It is different from the block's address, but the index's address.

static int file_block_walk(struct File *f, uint32_t filebno, uint32_t **ppdiskbno, bool alloc)
{
    if (filebno >= NDIRECT + NINDIRECT) {
        return -E_INVAL;
    }
    if (filebno < NDIRECT) {
        *ppdiskbno = &f->f_direct[filebno];
    }
    else {
        if (!f->f_indirect && !alloc) {
            return -E_NOT_FOUND;
        }
        if (!f->f_indirect && alloc) {
            uint32_t newbno;

            if ((newbno = alloc_block()) < 0) {
                return -E_NO_DISK;
            }
            f->f_indirect = newbno;
            memset(diskaddr(newbno), 0, BLKSIZE);
        }
        *ppdiskbno = &((uint32_t *)diskaddr(f->f_indirect))[filebno - NDIRECT];
    }
    // LAB 5: Your code here.
    return 0;
}

Here we need the parameter pdiskbno and the parameter blk, the pdiskbno pointer points to the index address of the file's f_direct[] or f_indirect[], and blk points to the virtual address of the block, so *pdiskbno is Block number.

int file_get_block(struct File *f, uint32_t filebno, char **blk)
{
    // LAB 5: Your code here.
    uint32_t *pdiskbno;
    int r;

    if ((r = file_block_walk(f, filebno, &pdiskbno, 1)) != 0) {
        return r;
    }

    if (!*pdiskbno) {
        uint32_t newbno;
        if ((newbno = alloc_block()) < 0) {
            return -E_NO_DISK;
        }
        *pdiskbno = newbno;
        memset(diskaddr(newbno), 0, BLKSIZE);
    }
    // *blk = (char *)pdiskbno;
    // the virtual address of block isn't the same as block's pointer
    *blk = diskaddr(*pdiskbno);
    return 0;
    // panic("file_get_block not implemented");
}

The file system interface

This part was originally a part that puzzled me very much. After I drew a flowchart of the whole process, I found that the process and the code are very clear. I use the following diagram to explain the file reading process, as well as some used interfaces.

MIT_JOS_Lab5

File system preliminaries

On-Disk File System Structure

Sectors and Blocks

Superblocks

File Meta-data

Directories versus Regular Files

The File System

Exercise 1

The Block Cache

Exercise 2

The Block Bitmap

Exercise 3

File Operations

Exercise 4

The file system interface

Exercise 5

Exercise 6

Spawning Processes

Exercise 7

Exercise 8

The keyboard interface

Exercise 9

The Shell

Exercise 10

MIT_JOS_Lab5

File system preliminaries

On-Disk File System Structure

Sectors and Blocks

Superblocks

File Meta-data

Directories versus Regular Files

The File System

Exercise 1

The Block Cache

Exercise 2

The Block Bitmap

Exercise 3

File Operations

Exercise 4

The file system interface

Exercise 5

Exercise 6

Spawning Processes

Exercise 7

Sharing library state across fork and spawn

Exercise 8

The keyboard interface

Exercise 9

The Shell

Exercise 10

相关推荐