This is a series posts on reverse engineering the internal structure of vision nets by looking at the weights and features such networks learn. We’ll particularly focus on the “Branch Specialization” post:
https://distill.pub/2020/circuits/branch-specialization/, which discusses how different network regions can specialize in different sorts of tasks.