I have a large dataset that covers 5 countries. I plan to build a prediction model using this dataset. I would like to compute a ROC curve for each country, and then one overall ROC curve for all countries. To go about this, I plan to either:
Build 5 individual prediction models for each of the five countries by partitioning the data according to country. Then, compute 5 ROC curves from the 5 models. Next, using a meta analysis approach, form one summary ROC curve to represent all the countries.
Build 1 prediction model using the entire dataset, which includes a model variable corresponding to country. Compute 5 different ROC curves for each of the 5 countries by partitioning the test data i.e. only use test samples from country X to compute the country X ROC curve. Next, use the entire test dataset to compute the overall ROC curve for all the countries.
Please let me know which of the two above approaches would be optimal. Thank you.