First, we need to perform the rotations needed to align the two coordinate frames. Then we need to perform a translation that will move the origin of our world space to the camera's origin. (Why did I rotate first and then translate?)