I need to calculate class-weights to train my deep learning model.
In order to simulate real-world producing scenario as possible as I can, I have excluded the testing/infering dataset from which calculate the class-weights, because when my model is being used for producing purpose, I can NOT re-calculate class-weights.
So I think that the testing/infering dataset should NOT be taked into account when calculating class-weights.
My question is: Should a validating-set be excluded also?
The way I'm splitting fit/valid/test sets, is by date. For example, data between 2021~2023 act as fitting-set, 2024's data as validating-set. While the distributions of the classes are approximately equal, I'm wondering whether or not it make sense. Thanks!