1. Introduction
When working with file I/O operations in Java, FileOutputStream and FileChannel are the two common approaches for writing data to files. In this tutorial, we’ll explore their functionalities and understand their differences.
2. FileOutputStream
FileOutputStream is a part of the java.io package and is one of the simplest ways to write binary data to a file. It’s a good choice for straightforward write operations, especially with smaller files. Its simplicity makes it easy to use for basic file writing tasks.
Here’s a code snippet demonstrating how to write a byte array to a file using FileOutputStream:
byte[] data = "This is some data to write".getBytes();
try (FileOutputStream outputStream = new FileOutputStream("output.txt")) {
outputStream.write(data);
} catch (IOException e) {
// ...
}
In this example, we first create a byte array containing the data to write. Next, we initialize a FileOutputStream object, and specify the file name “output.txt“. The try-with-resources statement ensures automatic resource closing. The write() method of FileOutputStream writes the entire byte array “data” to the file.
3. FileChannel
FileChannel is a part of the java.nio.channels package and provides more advanced and flexible file I/O operations compared to FileOutputStream. It’s particularly well-suited for handling larger files, random access, and performance-critical applications. Its use of buffers allows for more efficient data transfer and manipulation.
Here’s a code snippet demonstrating how to write a byte array to a file using FileChannel:
byte[] data = "This is some data to write".getBytes();
ByteBuffer buffer = ByteBuffer.wrap(data);
try (FileChannel fileChannel = FileChannel.open(Path.of("output.txt"),
StandardOpenOption.WRITE, StandardOpenOption.CREATE)) {
fileChannel.write(buffer);
} catch (IOException e) {
// ...
}
In this example, we create a ByteBuffer and wrap the byte array data into it. Then we initialize a FileChannel object using the FileChannel.open() method. Next, we also specify the file name “output.txt” and the necessary open options (StandardOpenOption.WRITE and StandardOpenOption.CREATE).
The write() method of FileChannel then writes the content of the ByteBuffer to the specified file.
4. Data Access
In this section, let’s dive into the differences between FileOutputStream and FileChannel in the context of data access.
4.1. FileOutputStream
FileOutputStream writes data sequentially, meaning it writes bytes to a file in the order they’re given, from the beginning to the end. It doesn’t support jumping to specific positions within the file to read or write data.
Here’s an example of writing data sequentially using FileOutputStream:
byte[] data1 = "This is the first line.\n".getBytes();
byte[] data2 = "This is the second line.\n".getBytes();
try (FileOutputStream outputStream = new FileOutputStream("output.txt")) {
outputStream.write(data1);
outputStream.write(data2);
} catch (IOException e) {
// ...
}
In this code, “This is the first line.” will be written first, followed by “This is the second line.” on a new line in the “output.txt” file. We can’t write data in the middle of the file without rewriting everything from the beginning.
4.2. FileChannel
On the other hand, FileChannel allows us to read or write data at any position in the file. This is because FileChannel uses a file pointer that can be moved to any position in the file. This is achieved using the position() method, which sets the position within the file where the next read or write occurs.
The code snippet below demonstrates how FileChannel can write data to specific positions within the file:
ByteBuffer buffer1 = ByteBuffer.wrap(data1);
ByteBuffer buffer2 = ByteBuffer.wrap(data2);
try (FileChannel fileChannel = FileChannel.open(Path.of("output.txt"),
StandardOpenOption.WRITE, StandardOpenOption.CREATE)) {
fileChannel.write(buffer1);
fileChannel.position(10);
fileChannel.write(buffer2);
} catch (IOException e) {
// ...
}
In this example, data1 is written at the beginning of the file. Now, we want to insert data2 into the file starting from position 10. Therefore, we set the position to 10 using fileChannel.position(10), and then data2 is written starting at the 10th byte.
5. Concurrency and Thread Safety
In this section, we’ll explore how FileOutputStream and FileChannel handle concurrency and thread safety.
5.1. FileOutputStream
FileOutputStream doesn’t handle synchronization internally. If two threads are trying to write to the same FileOutputStream concurrently, the result can be unpredictable data interleaving in the output file. Therefore we need synchronization to ensure thread safety.
Here’s an example using FileOutputStream with external synchronization:
final Object lock = new Object();
void writeToFile(String fileName, byte[] data) {
synchronized (lock) {
try (FileOutputStream outputStream = new FileOutputStream(fileName, true)) {
outputStream.write(data);
log.info("Data written by " + Thread.currentThread().getName());
} catch (IOException e) {
// ...
}
}
}
In this example, we use a common lock object to synchronize the access to the file. When multi threads write data to the file sequentially, it ensures the thread safety:
Thread thread1 = new Thread(() -> writeToFile("output.txt", data1));
Thread thread2 = new Thread(() -> writeToFile("output.txt", data2));
thread1.start();
thread2.start();
5.2. FileChannel
In contrast, FileChannel supports file locking, allowing us to lock specific file sections to prevent other threads or processes from accessing that data simultaneously.
Here’s an example of using FileChannel with FileLock to handle concurrent access:
void writeToFileWithLock(String fileName, ByteBuffer buffer, int position) {
try (FileChannel fileChannel = FileChannel.open(Path.of(fileName), StandardOpenOption.WRITE, StandardOpenOption.CREATE)) {
// Acquire an exclusive lock on the file
try (FileLock lock = fileChannel.lock(position, buffer.remaining(), false)) {
fileChannel.position(position);
fileChannel.write(buffer);
log.info("Data written by " + Thread.currentThread().getName() + " at position " + position);
} catch (IOException e) {
// ...
}
} catch (IOException e) {
// ...
}
}
In this example, the FileLock object is used to ensure that the file section being written to is locked to prevent other threads from accessing it concurrently. When a thread calls writeToFileWithLock(), it first acquires a lock on the specific section of the file:
Thread thread1 = new Thread(() -> writeToFileWithLock("output.txt", buffer1, 0));
Thread thread2 = new Thread(() -> writeToFileWithLock("output.txt", buffer2, 20));
thread1.start();
thread2.start();
6. Performance
In this section, we’ll compare the performance of FileOutputStream and FileChannel using JMH. We’ll create a benchmark class that includes both FileOutputStream and FileChannel benchmarks to evaluate their performance in handling large files:
@Setup
public void setup() {
largeData = new byte[1000 * 1024 * 1024]; // 1 GB of data
Arrays.fill(largeData, (byte) 1);
}
@Benchmark
@BenchmarkMode(Mode.AverageTime)
@OutputTimeUnit(TimeUnit.MILLISECONDS)
public void testFileOutputStream() {
try (FileOutputStream outputStream = new FileOutputStream("largeOutputStream.txt")) {
outputStream.write(largeData);
} catch (IOException e) {
// ...
}
}
@Benchmark
@BenchmarkMode(Mode.AverageTime)
@OutputTimeUnit(TimeUnit.MILLISECONDS)
public void testFileChannel() {
ByteBuffer buffer = ByteBuffer.wrap(largeData);
try (FileChannel fileChannel = FileChannel.open(Path.of("largeFileChannel.txt"), StandardOpenOption.WRITE, StandardOpenOption.CREATE)) {
fileChannel.write(buffer);
} catch (IOException e) {
// ...
}
}
Let’s execute the benchmarks and compare the performance of FileOutputStream and FileChannel. The results show the average time taken for each operation in milliseconds:
Options opt = new OptionsBuilder()
.include(FileIOBenchmark.class.getSimpleName())
.forks(1)
.build();
new Runner(opt).run();
After running the benchmarks, we obtained the following results:
Benchmark Mode Cnt Score Error Units
FileIOBenchmark.testFileChannel avgt 5 431.414 ± 52.229 ms/op
FileIOBenchmark.testFileOutputStream avgt 5 556.102 ± 91.512 ms/op
FileOutputStream is designed for simplicity and ease of use. However, when dealing with large files with high-frequency I/O operations, it can introduce some overhead. This is because the FileOutputStream operations are blocking, which means each write operation must be completed before the next one starts.
On the other hand, FileChannel supports memory-mapped I/O, which can map a section of the file into memory. This enables data manipulation directly in memory space, resulting in faster transfer.
7. Conclusion
In this article, we’ve explored two file I/O methods: FileOutputStream and FileChannel. FileOutputStream offers simplicity and ease for basic file writing tasks, ideal for smaller files and sequential data writing.
On the other hand, FileChannel provides advanced features like direct buffer access for better performance with large files.
As always, the source code for the examples is available over on GitHub.