Multiple Choice

When a model performs well on training data but poorly on the dev set due to distribution differences, what is one recommended strategy?