For reference, RDMA over Thunderbolt is a capability on macOS that enables single digit microsecond latency clustering.
This enables tensor parallelism which makes models run faster as you cluster more devices.
Alex Cheema (@alexocheema)
The new M5 Pro/Max MacBooks have 3 Thunderbolt 5 ports, enabling you to create RDMA clusters with up to 4 MacBooks.
The latency with RDMA over Thunderbolt is single digit microseconds, fast enough for tensor parallelism with close to linear scaling.
— https://nitter.net/alexocheema/status/2035873888903512187#m