YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

U-Net for Image Segmentation (Kvasir-SEG)

This model is a custom U-Net implementation designed for medical image segmentation, specifically targeting gastrointestinal polyp identification. It was developed as part of the Complements of Machine Learning course (Weekly Lab 4).

The model follows a symmetric encoder-decoder architecture with skip connections, inspired by Ronneberger et al. (2015).

Encoder: Four downsampling stages reducing spatial resolution to a 1024-channel bottleneck.
Decoder: Symmetric upsampling stages using learnable transposed convolutions to recover original image dimensions.
Enhancements: Inclusion of Batch Normalization for training stability and Same-Padding to maintain consistent spatial shapes without cropping.

Training Procedure

BCEWithLogitsLoss was chosen to optimize pixel-level binary classification and address the inherent class imbalance in medical imaging.

Hyperparameters:

Epochs: 20
Learning Rate: 1e-4
Batch Size: 8
Resolution: 256x256

The model was developed using PyTorch and the Hugging Face Trainer API, accelerated by a T4 GPU on Google Colab.

Downloads last month: -; Downloads are not tracked for this model. How to track

Safetensors

Model size

31M params

Tensor type

F32

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support