Protocol Buffers (ProtoBuf) in Go
Buckle up, folks! This time we’re diving into the world of Protocol Buffers (protobuf) and their superpowers in data serialization.
Introduction
Protocol Buffers, also referred to as protobuf, is a language-agnostic binary serialization format that has been developed by Google. Its primary purpose is to efficiently serialize structured data for inter-system communication and data storage.
[Image source: Google]
Major benefits of protobuf :
- Compactness : Protobuf provides efficient serialization, resulting in smaller message sizes for improved bandwidth usage.
- Schema Evolution : Protobuf supports schema evolution without breaking compatibility, allowing seamless updates to data structures.
- Efficient Serialization and Deserialization : Protobuf offers fast and efficient serialization, improving overall system performance.
- Cross-Platform Support : Protobuf allows seamless data exchange across different platforms and languages.
These benefits make protobuf a powerful tool for efficient data communication and storage in Go applications.
How It Is Better Than JSON and XML:
[Image source: Google]
XML, known as Extensible Markup Language, is like a map that helps organize and structure data using tags. It presents information in a way that both humans and machines can understand. However, XML can be wordy and take up more space, which may slow down performance and make data transmission less efficient.
JSON, or JavaScript Object Notation, is like a messenger that uses a simple key-value structure to represent data objects. It has become popular for transmitting data between web services because it is easy to read and work with. But JSON’s text-based format can result in larger file sizes, which can affect the speed of data transfer.
In contrast, Protocol Buffers (protobuf) shine in the world of data serialization. It’s like a magic trick that transforms data into a compact and efficient binary format. Protobuf is known for its fast data processing and the ability to adapt to changing data structures without breaking compatibility. It can be used with different programming languages and ensures the reliability of your data.
In summary, XML and JSON have their uses, but if you need a powerful and efficient data serialization solution, Protocol Buffer (protobuf) is the way to go. It provides compactness, speed, flexibility, and compatibility, making it a top choice for handling data efficiently.
The Serialization Performance: Protocol Buffers vs. JSON in Golang
Enough Talk, Let’s Get Our Hands Dirty
[Image source: Google]
- Visit the official Protocol Buffers GitHub repository ( https://github.com/protocolbuffers/protobuf ) to download the compiler compatible with your operating system.
2. Define a protobuf message schema using the .proto file format. ```protocol buffer syntax = “proto3”; package main; option go_package = “/;msgmodel”;
message MyMessage { int32 id = 1; string name = 2; string email = 3; }
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
3\. Compile the file
> protoc — go\_out=\. \./\*proto
This command generates Go code bindings from the protobuf schema\. The `--go_out` flag specifies that the output should be in Go\. This will generate a msg\.pb\.go file, that contains the necessary code bindings for your protobuf schema\.
4\. Implement a benchmark test in Golang that serializes a large dataset using both protobuf and JSON:
```go
package main
import (
"encoding/json"
"github.com/golang/protobuf/proto"
"go-protobuf/model/message"
"log"
"testing"
)
const (
iteration = 10000000 //Number of iterations for the benchmark test
)
func generateDataset() []*message.MyMessage {
var dataset []*message.MyMessage
for i := 0; i < iteration; i++ {
data := &message.MyMessage{
Email: "johndoe@example.com",
Name: "John Doe",
Id: int32(i),
}
dataset = append(dataset, data)
}
return dataset
}
func BenchmarkProtobufSerialisation(b *testing.B) {
dataset := generateDataset()
b.ResetTimer()
for n := 0; n < b.N; n++ {
for _, data := range dataset {
_, err := proto.Marshal(data)
if err != nil {
log.Fatal(err)
}
}
}
}
func BenchmarkJSONSerialization(b *testing.B) {
dataset := generateDataset()
b.ResetTimer()
for n := 0; n < b.N; n++ {
for _, data := range dataset {
_, err := json.Marshal(data)
if err != nil {
log.Fatal(err)
}
}
}
}
func main() {
// Run the benchmark tests
testing.Benchmark(BenchmarkProtobufSerialisation)
testing.Benchmark(BenchmarkJSONSerialization)
}
5. Based on the benchmark results (shown below), it is evident that Protobuf outperforms JSON serialization in terms of speed. The protobuf serialization benchmark was completed in significantly less time compared to the JSON serialization benchmark.
Memory Performance Comparison: JSON vs. Protocol Buffers
- Implement a benchmark test in Golang that compares memory usage of a large dataset using both protobuf and JSON:
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
package main
import (
"encoding/json"
"github.com/golang/protobuf/proto"
"go-protobuf/model/message"
"log"
"runtime"
"runtime/debug"
"testing"
)
const (
iteration = 100000000 //Number of iterations for the benchmark test
)
func generateDataset() []*message.MyMessage {
var dataset []*message.MyMessage
for i := 0; i < iteration; i++ {
data := &message.MyMessage{
Email: "johndoe@example.com",
Name: "John Doe",
Id: int32(i),
}
dataset = append(dataset, data)
}
return dataset
}
func BenchmarkProtobufSerialisation(b *testing.B) {
dataset := generateDataset()
b.ResetTimer()
for n := 0; n < b.N; n++ {
for _, data := range dataset {
_, err := proto.Marshal(data)
if err != nil {
log.Fatal(err)
}
}
}
measureMemoryUsage(b)
}
func BenchmarkJSONSerialization(b *testing.B) {
dataset := generateDataset()
b.ResetTimer()
for n := 0; n < b.N; n++ {
for _, data := range dataset {
_, err := json.Marshal(data)
if err != nil {
log.Fatal(err)
}
}
}
measureMemoryUsage(b)
}
func measureMemoryUsage(b *testing.B) {
debug.FreeOSMemory()
var mem runtime.MemStats
runtime.GC()
runtime.ReadMemStats(&mem)
b.ReportMetric(float64(mem.Alloc)/1024/1024, "Memory_MB")
}
func main() {
// Run the benchmark tests
testing.Benchmark(BenchmarkProtobufSerialisation)
testing.Benchmark(BenchmarkJSONSerialization)
}
2. The benchmark results show that JSON serialization used more memory compared to Protobuf serialization. On average, JSON serialization consumed around 0.2052 MB of memory, while protobuf serialization used only about 0.2042 MB. Although the difference is small, it’s clear that protobuf is more efficient in terms of memory usage. This means that protobuf’s compact binary format helps save memory, making it a good choice for working with large datasets and improving performance.
Conclusion
It’s conclusion time now ! ! !
Protocol Buffers (protobuf) have demonstrated superior performance and memory efficiency compared to JSON serialization in Golang. With its compact binary format and efficient serialization mechanism, protobuf offers smaller message sizes, improved network efficiency, and reduced bandwidth usage. Additionally, its schema evolution capabilities allow for seamless updates to data models. While JSON has its strengths, protobuf excels in scenarios that demand high-speed and memory-efficient data serialization, enabling optimized data transmission and improved system performance.
[Image Source: Google]
The complete codebase for the examples discussed in this article is available on the Git repository .
Post converted from Medium by ZMediumToMarkdown.