Caffe2 is a python-based lightweight, modular, and scalable deep learning framework. Building on the original Caffe, Caffe2 is designed with expression, speed, and modularity in mind.
See official Caffe2 GitHub page (https://github.com/caffe2/caffe2).
This Caffe2-GPU-Distributed recipe contains information on how to run distributed Caffe2 training job across multiple GPU nodes with BatchAI, by setting up a single-node NFS file server.
If you have any problems or questions, you can reach the Batch AI team at [email protected] or you can create an issue on GitHub.
We also welcome your contributions of additional sample notebooks, scripts, or other examples of working with Batch AI.