The Visitor Design Pattern

As I have previously mentioned, writing software encompasses more than builing a working solution. It also often requires an architecture that is flexible to future required changes. Making changes to our source code is often difficult because of depencencies that interconnect different parts of our code in ways that is hard to keep track of. So often we discover small changes to one part of our code will break functionality elsewhere in the system. The visitor pattern helps us build new functionality without modifying existing code, satisfying the Open-Closed principle.

Let us consider a classic example of rendering some shapes using dynamic polymorphism:

C++

 1struct Point {
 2  double x;
 3  double y;
 4};
 5enum Shape {
 6  circle, 
 7  square
 8};
 9class Shape {
10  public:
11    Shape() = default;
12    virtual ~Shape() = default;
13    virtual void draw() const = 0;
14};
15
16class Circle: public Shape {
17  public:
18    explicit Circle(double radius) : radius_(radius) { };
19    double radius() const { return radius_; }
20    Point center() const {return center_; }
21    void draw() const override;
22  private:
23    double radius_;
24    Point center_{};
25};
26
27void Circle::draw() const {
28  // implement drawing
29}
30
31class Square: public Shape {
32  public:
33    explicit Square(double side) : Shape(square), side_(side) {}
34    double side() const { return side_; }
35    Point center const { return center_; }
36    void draw() const override;
37  private:
38    double side_;
39    Point center_{};
40};
41
42void Square::draw() const {
43  // implement drawing
44}
45
46void drawAllShapes(std::vector<std::unique_ptr<Shape>> const& shapes) {
47  for (auto const& shape : shapes) {
48    shape->draw();
49  }
50}

Compile, run, and marvel at your code because it works! You are now drawing shapes. The great part of this solution is just how easy it is to add other shapes. If you find out later that your software now needs to support Triangles, simply add a new Triangle class that inherits from Shape and ensure it overrides the virtual draw function. Everything seems great. But we do have a problem. We can no longer easily add new operations to this architecture. Say we find out at a later date that our shapes now needs to implement functionality to serialize themselves into Bytes. We would be forced to modify our base class with a new virtual function called Serialize(). This violates the Open-Closed principle since adding a new virtual function to our base class means the class is not closed. Furthermore, what if not all the derived classes need to support being serialized? Maybe we will never need to serialize the Triangle class? Unfortunately the Triangle class is forced to implement this operation despite its lack of a need which violates the Interface Segregation principle.

The problem was face is a classic problem with this type of object oriented programming. Adding new types is easy, but adding new operations is difficult.

This is where the Visitor design pattern comes in. It allows us to ease our troubles when it comes to adding new operations. As per usual, I find the best way to demonstrate something like the visitor design pattern is with an example. Lets try and tackle our Shape issue above by adding two new classes, an abstract ShapeVisitor class and a derived class meant to represent one operation, the draw operation we had above:

C++

 1class ShapeVisitor {
 2  public:
 3    virtual ~ShapeVisitor() = default;
 4    virtual void visit(Circle const&) const = 0; 
 5    virtual void visit(Square const&) const = 0; 
 6}
 7class Draw: public ShapeVisitor {
 8  public:
 9    void visit(Circle const& c) const override;
10    void visit(Square const& s) const override;
11}

With these in place, we can now re-implement our Shape classes:

C++

 1class Shape {
 2  public:
 3    Shape() = default;
 4    virtual ~Shape() = default;
 5    virtual void accept(ShapeVisitor const& v) = 0;
 6};
 7class Circle: public Shape {
 8  public:
 9    explicit Circle(double radius) : radius_(radius) { };
10    double radius() const { return radius_; }
11    Point center() const {return center_; }
12    void accept(ShapeVisitor const& v) const override;
13  private:
14    double radius_;
15    Point center_{};
16};
17void Circle::accept(ShapeVisitor const& v) const {
18  v.visit(*this);
19}
20class Square: public Shape {
21  public:
22    explicit Square(double side) : Shape(square), side_(side) {}
23    double side() const { return side_; }
24    Point center const { return center_; }
25    void accept(ShapeVisitor const& v) const override;
26  private:
27    double side_;
28    Point center_{};
29};
30void Square::accept(ShapeVisitor const& v) cons {
31  v.visit(*this);
32}

We have now removed the draw operation from the derived classes and instead added a virtual accept operation that takes a ShapeVisitor class. The ShapeVisitor class is itself a pure abstract class which means what exactly the visit operation will do within the accept operation is unknown to the Square and Circle class. This is good as it helps keep our dependencies seperated. We can now finish our draw operation:

C++

1void drawAllShapes(std::vector<std::unique_ptr<Shape>> const& shape) {
2  for (auto const& shape : shapes) {
3    shape->accept( Draw{} );
4  }
5}

The logic you added to your Draw class for any of the supported shape classes will be run. This is great because our Shape base class is now completely closed and operations are now added by extension. Remember the serialization operation we needed to add earlier? We can now support it without modifying the Shape class or any of its derived classes.

C++

 1class Serialize: public ShapeVisitor {
 2  public:
 3    void visit(Circle const& c) const override;
 4    void visit(Square const& s) const override;
 5};
 6void serializeAllShapes(std::vector<std::unique_ptr<Shape>> const& shape) {
 7  for (auto const& shape : shapes) {
 8    shape->accept( Serialize{} );
 9  }
10}

As great as this seems, it comes with some significant downsides. The first is that the visitor design pattern is susceptible to violating the DRY principle. Serializing Squares and Circles might be very similar, yet we are forced to implement a seperate function for each derived class we want to support. Secondly, adding new types has once again become more challenging. The very benefit of our original object oriented approach was the easy it provides for adding new derived classes of Shape. But now, when we go to add a new shape such as Triangle, we are required to update the entire ShapeVisitor heirarchy. We need to add a new virtual class to ShapeVisitor and modify all of its derived classes. Thirdly, and in my opinion most importantly, the visitor design pattern invokes a massive performance penalty on our software. It is slow. This is because we have a double dispatch operation where C++ needs to resolve two virtual operations (accept() and visit()). Dynamic polymorphism tends to produce a performance penalty upon our code, but the visitor pattern doubles down on this creating multiple layers of virtual functions that need to be resolved.

But fear not, for those willing to abandon their old ways of dynamic polymorphism and join us on the side of light and goodness, C++ supports a better way to build out the visitor pattern. It involves the use of std::variant.

The std::variant is a container that will store one object at a time, but is not limited to supporting only one kind of object. When defining a std::variant, you inform its template parameters of all the different objects it could potentially possess:

C++

1  std::variant<Circle, Square, Triangle> shape;

Then you can assign any instance of these objects into the variant

C++

1 Square s{10};
2 shape = s;
3 Square const s2 = std::get<Square>(shape);
4
5 Circle c{5};
6 shape = c;
7 Circle* const c2 = std::get_if<Circle>(shape);

As different objects get assigned to the variant, any previous object the variant may have had will be released, meaning only one object will ever be in the variant. The two ways shown above to retrieve the value from the variant are get and get_if. When calling get(), the variant will attempt to return the object being asked for as a reference, but if the variant is empty or has a different type of object, it will throw an std::bad_variant_exception. If instead you call get_if, it will always return a pointer, but if the variant is empty or has a different object type, it will return a nullptr. The last thing worth noting on variants is that they should be used with caution. Only use these with objects of similar memory footprint. That is because the variant will allocate for itself enough memory to store the largest object it built a template parameter for. If the variant needs to support int and Circle, each instance of the variant will have way more memory than is required to store the int which can have a negative impact on the performance of your code.

With all that out of the way, let me show you how to re-implement the visitor design patter in C++ using std::variant.

C++

 1class Circle {
 2  public:
 3    explicit Circle(double radius) : radius_(radius) { };
 4    double radius() const { return radius_; }
 5    Point center() const {return center_; }
 6  private:
 7    double radius_;
 8    Point center_{};
 9};
10
11class Square {
12  public:
13    explicit Square(double side) : Shape(square), side_(side) {}
14    double side() const { return side_; }
15    Point center const { return center_; }
16  private:
17    double side_;
18    Point center_{};
19};
20
21struct Draw {
22  void operator()(Circle const& c) const {
23    // handle draw logic for circle
24  }
25  void operator()(Square const& s) const {
26    // handle draw logic for square
27  }
28}

So that is largely it. First thing you will notice is that we have removed inheritance and all virtual functions. Our classes are now simpler and ready for better compiler optimization when the time comes. Secondly, We have defined our draw operation as well, and similar to before, we will be able to add other types of operations without modifying the Square or Circle class.

C++

 1using Shape = std::variant<Circle, Square>;
 2using Shapes = std::vector<Shape>;
 3
 4void drawAllShapes(Shapes const& shapes) {
 5  for (auto const& shape : shapes) {
 6    std::visit( Draw{}, shape);
 7  }
 8}
 9
10int main() {
11  Shapes shapes;
12  shapes.emplace_back( Circle{2.3});
13  shapes.emplace_back( Square{1.2});
14  shapes.emplace_back( Circle{4.1});
15
16  drawAllShapes(shapes);
17}

So a couple of things here. First you will notice we have defined Shape and Shapes to simplify our code and resemble some of the nice parts of the dynamic polymorphism we left behind. When we operate on the Shape, we are operating on a std::variant that will contain either a Circle or Square. Secondly, we are using std::visit which I have not yet gone over. Along with get() and get_if() operator supported by std::variant, we can also use this std::visit operator. This operator allows us to perform an operation on the stored value of the variant. Std::visit will check the draw object we pass in and see if it has an operator() that takes the type the variant currently has and execute that one. This approach requires no inheritance, virtual functions, and many of the other things that hinder performance, create dependencies, and increase complexity. We also no longer need to store all our derived classes as pointers but instead can default to value-based programming. Many of our pitfalls are gone. Adding a new operator to serialize is as simple as:

C++

 1struct Serialize {
 2  void operator()(Circle const& c) const {
 3    // handle draw logic for circle
 4  }
 5  void operator()(Square const& s) const {
 6    // handle draw logic for square
 7  }
 8}
 9void serializeAllShapes(Shapes const& shapes) {
10  for (auto const& shape : shapes) {
11    std::visit( Serialize{}, shape);
12  }
13}

Better than getting rid of the all the complexity inheritance and dynamic polymorphism brings with it is the nice improvement in performance. Consider this benchmark¹ that ran 25,000 operations (updating the center point) on 10,000 randomly created shapes.

GCC 11.1

Classic visitor design pattern: 1.6161s
Object-oriented solution: 1.5205s
std::variant solution: 1.1992s

The variant solution performed significantly better. But there is a further optimization we could accomplish at the sake of clean code. Instead of using std::visit, we could manually implement each operation using std::get_if():

C++

1void drawAllShapes(Shapes const& shapes) {
2  for (auto const& shape : shapes) {
3    if (Circle* circle = std::get_if<Circle>(&shape)) {
4      // Draw Circle
5    } else if (Square* square = std::get_if<Square>(&shape)) {
6      // Draw Square
7    }
8  }
9}

Using the same benchmark as above, this approach scored:

std::variant (with std::get_if()): 1.0252s

A nice additional improvement which highlights what is often the tradeoff between clean code and performant code. There is no perfect answer on which way to proceed. But it is always worth reiterating that software is a means to an end, not an end itself. Especially in C++, a difficult language usually choosen because we need superior performance, choosing run time performance over clean code is often the right choice.

So this is great right? Using the std::variant visitor design pattern, we have now made adding new operations to our object types much easier and SOLID. But we have a problem. This improvement was made at the expense of being able to add new types. We have come full circle. Again, if we need to support a new type like a Triangle, we must attempt to delve into every visitor operation() and add support for that type, which means these classes are no longer closed.

This is a tradeoff we will always have to make with the visitor pattern. If we do not anticipate more types will be added in the future, the visitor pattern is a great choice for helping us easily extend the functionality of the types we have.

p.131 Iglberger, Klaus. 2022. “C++ Software Design”. OREILLY. ↩︎