I'm working on a binary classification task where the goal is to determine whether a tissue contains malignant cells Each instance in my dataset consists of
- a microscope image of the cell
- a small set of tabular metadata including an identifier of the imaging session and a binary feature indicating whether the cell was treated with fluorescent particles or not
I'm considering a hybrid neural network combining a CNN to extract features from the image and either a TabNet model or a fully connected MLP to process the tabular data My idea is to concatenate the features from both branches and pass them to a shared classification head My questions
- how should I handle the identifier? should I embed it or drop it completely
- are there alternative ways to model the tabular branch beyond MLP or TabNet especially with very few tabular features
- any best practices when combining CNN image embeddings with tabular data? Thanks in advance for any suggestions or shared experiences