U-Net Image Segmentation with TensorFlow/Keras (Oxford-IIIT Pets)

/ TensorFlow tutorials, Unet

Last Updated on 08/10/2025 by Eran Feit

This tutorial provides a step-by-step guide on how to implement and train a U-Net Image Segmentation TensorFlow .

The tutorial is divided into four parts:

Part 1: Data Preprocessing and Preparation

In this part, you load and preprocess the persons dataset, including resizing images and masks, converting masks to binary format, and splitting the data into training, validation, and testing sets.

Part 2: U-Net Model Architecture

This part defines the U-Net model architecture using Keras. It includes building blocks for convolutional layers, constructing the encoder and decoder parts of the U-Net, and defining the final output layer.

Part 3: Model Training

Here, you load the preprocessed data and train the U-Net model. You compile the model, define training parameters like learning rate and batch size, and use callbacks for model checkpointing, learning rate reduction, and early stopping.

Part 4: Model Evaluation and Inference

The final part demonstrates how to load the trained model, perform inference on test data, and visualize the predicted segmentation masks.

Check out our tutorial here : https://www.youtube.com/watch?v=oHc4yrV64wU

Link for the full code here : https://ko-fi.com/s/a88e66f66b

Link for my blog : https://eranfeit.net/blog/

Here is the code for U-Net Image Segmentation :

(Part 1): Prepare the Oxford-IIIT Pet Dataset (images + trimap masks)

Introduction

In this section we set up the Oxford-IIIT Pet dataset for multi-class semantic segmentation.
Each mask encodes background, border, and pet as class IDs, which lets the model learn accurate edges around fur and whiskers.
We’ll make the structure reproducible, split the data (train/val/test), and save ready-to-load arrays so training is fast and deterministic.

Related setup guides: U-Net image segmentation (persons) · OpenCV contour-based segmentation · K-Means segmentation (OpenCV)

Detailed guide

Download & folders. Use the official Oxford-IIIT Pet images and trimaps, then organize them as:

data/   images/               # original images   masks/                # trimap masks   splits/               # txt/csv with train/val/test

Class mapping (very important). Masks are integers:

0 = background, 1 = border, 2 = pet.
Ensure you do not one-hot encode unless you also change the loss to match. For sparse labels, keep masks as single-channel int arrays.

Image size. For a balanced trade-off between detail and speed, resize to 256×256 or 320×320. Use consistent resizing for images and masks (nearest-neighbor for masks to preserve labels).

Splits. Create 80/10/10 (train/val/test) and save index files so results are reproducible.

Augmentations (optional, recommended). Mild flips, small rotations, and color jitter improve generalization; avoid heavy transforms that misalign masks.

Preview/QA. Visually verify a few samples (image, mask palette, overlay). If borders look broken, check that you used nearest-neighbor interpolation.

Persist to disk. Save X_train.npy, y_train.npy, etc., and note their shapes in the text so readers can compare with your setup.

Note. This tutorial focuses on Oxford-IIIT Pets (not a “persons” dataset). Keeping the dataset references accurate helps readers replicate your results.

Preparing Libraries and Dataset Parameters

We start by importing the necessary Python libraries and defining dataset parameters.
These steps set up the environment for reading and processing images and masks.

### Import pandas for reading the dataset annotation files import pandas as pd  ### Import OpenCV for image loading and resizing import cv2  ### Import NumPy for numerical operations and array handling import numpy as np  ### Define the target image height for resizing Height = 128  ### Define the target image width for resizing Width= 128  ### Define the number of categories in the mask (object, background, border) NumOfCategories = 3  ### The mask images contain three types of values: # Value = 1 indicates the main object (the animal) # Value = 2 indicates the background # Value = 3 indicates the border of the object

Link for the full code here : https://ko-fi.com/s/a88e66f66b

Initializing Data Structures and Loading Training Images

Next, we create arrays for storing training and testing data.
We then load the training images and their corresponding masks, resize them, normalize them, and save them into lists.

### Create lists for training images and masks allImages = [] maskImages = []  ### Create lists for test images and masks allTestImages = [] maskTestImages = []  ### Define dataset path path = "E:/Data-sets/Unet-Multi-class/"  ### Define the path to the training and testing annotation files trainFile = path + "annotations/trainval.txt" testFile = path + "annotations/test.txt"  ### Load the training annotations print("Load train data : ")  ### Read the training file which contains image names df = pd.read_csv(trainFile, sep=" ", header=None)  ### Extract the list of training file names names = df[0].values print ("Train data info :") print(len(names))  ### Loop over each training file name for name in names :     ### Build the path for the image file     imageFileName = path + "images/" + name + ".jpg"     print(imageFileName)      ### Load the image using OpenCV     img = cv2.imread(imageFileName , cv2.IMREAD_COLOR)      ### Resize the image to the defined width and height     img = cv2.resize(img, (Width,Height))      ### Normalize the image by dividing pixel values by 255     img = img / 255.0      ### Convert the image to float32 format     img = img.astype(np.float32)      ### Append the processed image to the list     allImages.append(img)      ### Build the path for the mask file     maskFileName = path + "annotations/trimaps/" + name + ".png"      ### Load the mask in grayscale mode     mask = cv2.imread(maskFileName , cv2.IMREAD_GRAYSCALE)      ### Resize the mask to the same size as the image     mask = cv2.resize(mask , (Width, Height))      ### Append the processed mask to the list     maskImages.append(mask)

Link for the full code here : https://ko-fi.com/s/a88e66f66b

Converting Training Data to NumPy Arrays and Analyzing Masks

After collecting all training images and masks, we convert them into NumPy arrays for easier handling.
We also explore mask values by resizing and replacing categories for better understanding.

### Convert training images list into a NumPy array allImagesNP = np.array(allImages)  ### Convert training masks list into a NumPy array maskImagesNP = np.array(maskImages)  ### Convert masks to integer type maskImagesNP = maskImagesNP.astype(int)  ### Print array details for images and masks print(allImagesNP.shape) print(allImagesNP.dtype)  print(maskImagesNP.shape) print(maskImagesNP.dtype)  ### Resize one mask to a smaller size for visualization x = cv2.resize(maskImagesNP[0], (16,16), interpolation=cv2.INTER_NEAREST) print(x)  ### Loop through each row in the reduced mask for i in range(len(x)):     ### Loop through each column in the row     for j in range(len(x[i])):         ### Get the pixel value         v = x[i][j]          ### Replace the values according to the rules         if v==1 : # the object             x[i][j] = 0         if v==2 : # the background             x[i][j] = 22         if v==3 : # the border             x[i][j] = 333  ### Print the updated mask values print(x)

Link for the full code here : https://ko-fi.com/s/a88e66f66b

Loading Test Data and Saving Processed Arrays

Finally, we repeat the same process for test data and save all preprocessed arrays into .npy files for future use in training deep learning models.

### Print message before loading test data print("load test data :")  ### Read the test annotation file df = pd.read_csv(testFile, sep=" ", header=None)  ### Extract test file names names = df[0].values print ("Test data info :") print(len(names))  ### Loop through each test image for name in names :     imageFileName = path + "images/" + name + ".jpg"     print(imageFileName)      ### Load and preprocess the image     img = cv2.imread(imageFileName , cv2.IMREAD_COLOR)     img = cv2.resize(img, (Width,Height))     img = img / 255.0     img = img.astype(np.float32)     allTestImages.append(img)      ### Load and preprocess the mask     maskFileName = path + "annotations/trimaps/" + name + ".png"     mask = cv2.imread(maskFileName , cv2.IMREAD_GRAYSCALE)     mask = cv2.resize(mask , (Width, Height))     maskTestImages.append(mask)  ### Convert test lists into NumPy arrays allTestImagesNP = np.array(allTestImages) maskTestImagesNP = np.array(maskTestImages) maskTestImagesNP = maskTestImagesNP.astype(int)  ### Print details of test arrays print(allTestImagesNP.shape) print(allTestImagesNP.dtype)  print(maskTestImagesNP.shape) print(maskTestImagesNP.dtype)  ### Save training and test arrays into .npy files print("Save the Data :") np.save("e:/temp/Unet-Animals-train-images.npy", allImagesNP) np.save("e:/temp/Unet-Animals-train-mask.npy", maskImagesNP) np.save("e:/temp/Unet-Animals-test-images.npy", allTestImagesNP) np.save("e:/temp/Unet-Animals-test-mask.npy", maskTestImagesNP) print("Finish save the data !")

Link for the full code here : https://ko-fi.com/s/a88e66f66b

(Part 2): Build a Clean U-Net in Keras (encoder–decoder with skip connections)

Save the following code parts as one file named : “Step02UnetModel.py” in the same folder

Introduction

U-Net is a lightweight, high-accuracy architecture for pixel-wise prediction.
The encoder captures context; the decoder restores resolution; and skip connections fuse fine spatial details back into the output.
Here we’ll keep the model simple, explain each design choice, and prepare it for multi-class training.

See more U-Net architectures on my blog: U-Net for colon polyp segmentation · U-Net for chest X-ray lung masks · U-Net for melanoma (skin)

Detailed guide

Blocks & filters. Start with 64 filters and double per down-sample (64→128→256→512). Use Conv(3×3)→BN→ReLU twice per block; down-sample with MaxPool(2).

Bottleneck. One or two double-conv blocks at the lowest resolution stabilize training without exploding parameters.

Decoder. Use UpSampling2D(2) (or transposed conv), concatenate the matching encoder feature map, then apply the same double-conv pattern.

Output layer. For 3 classes, set Conv2D(3, 1×1) and softmax activation.

Loss & metrics.

Use sparse_categorical_crossentropy if masks are single-channel integers.
Track MeanIoU with num_classes=3. Optionally add Dice as a second metric when borders are thin.

Optimizer & LR. Adam with lr=1e-3 is a good default; reduce on plateau during training.

Sanity check. Print model.summary() and mention parameter count in the text so readers know the model size and memory expectations.

Importing Required Libraries

We start by importing the essential TensorFlow Keras layers and the Model class.
These components allow us to construct the encoder, decoder, and final output layers of the U-Net.

### Import Input for defining the input layer of the model ### Import Conv2D for convolution operations ### Import BatchNormalization to normalize activations ### Import Activation for nonlinear transformations ### Import MaxPool2D for downsampling in the encoder ### Import UpSampling2D for upsampling in the decoder ### Import Concatenate for merging skip connections from tensorflow.keras.layers import Input, Conv2D, BatchNormalization , Activation, MaxPool2D, UpSampling2D, Concatenate  ### Import Model class to define the complete U-Net architecture from tensorflow.keras.models import Model

Link for the full code here : https://ko-fi.com/s/a88e66f66b

Defining the Convolutional Block

The convolutional block is the core element of U-Net.
It applies two convolutional layers with batch normalization and ReLU activation.
An optional max pooling layer reduces spatial dimensions when used in the encoder.

### Define a function for the convolutional block def conv_block(inputs, filters, pool=True):     ### Apply first convolutional layer with 3x3 filter     x = Conv2D(filters , 3 , padding="same")(inputs)      ### Apply batch normalization for stable training     x = BatchNormalization()(x)      ### Apply ReLU activation function     x = Activation("relu")(x)      ### Apply second convolutional layer with 3x3 filter     x= Conv2D(filters, 3, padding="same")(x)      ### Apply batch normalization     x = BatchNormalization()(x)      ### Apply ReLU activation     x= Activation("relu")(x)      ### If pooling is enabled, apply max pooling to reduce dimensions     if pool == True:         p = MaxPool2D((2,2))(x)         return x, p     else :         return x

Link for the full code here : https://ko-fi.com/s/a88e66f66b

Building the U-Net Architecture

Now we define the U-Net model by stacking encoder blocks, a bridge, and decoder blocks with skip connections.

### Define a function to build the U-Net model def build_unet(shape , num_classes):     ### Input layer with defined shape     inputs = Input(shape)      ### Encoder section: progressively downsample the input     x1 , p1 = conv_block(inputs, 16, pool=True)     x2 , p2 = conv_block(p1, 32, pool=True)     x3 , p3 = conv_block(p2 , 48 , pool=True)     x4 , p4 = conv_block(p3, 64, pool=True)      ### Bridge section: bottom of the U-Net without pooling     b1 = conv_block(p4 , 128 , pool=False)      ### Decoder section: upsample and concatenate with encoder features     u1 = UpSampling2D((2,2), interpolation="bilinear")(b1)     c1 = Concatenate()([u1, x4])     x5 = conv_block(c1, 64, pool=False)      u2 = UpSampling2D((2,2),interpolation="bilinear")(x5)     c2 = Concatenate()([u2, x3])     x6 = conv_block(c2,48,pool=False)      u3 = UpSampling2D((2,2),interpolation="bilinear")(x6)     c3 = Concatenate()([u3, x2])     x7 = conv_block(c3, 32 , pool=False)      u4 = UpSampling2D((2,2) ,interpolation="bilinear")(x7)     c4 = Concatenate()([u4, x1])     x8 = conv_block(c4 , 16 , pool=False)      ### Output layer with softmax for multi-class segmentation     output = Conv2D(num_classes,1, padding="same", activation="softmax")(x8)      ### Return the complete U-Net model     return Model(inputs, output)

Link for the full code here : https://ko-fi.com/s/a88e66f66b

Running the Model

Finally, we create an instance of the U-Net with an input size of 128×128 pixels and 3 output classes.
We then print the summary to visualize the architecture.

### Run the script only if executed directly if __name__ =="__main__":     ### Build the U-Net model with input shape (128,128,3) and 3 output classes     model = build_unet((128,128,3), 3)      ### Print the model summary to see the architecture     print(model.summary())

Link for the full code here : https://ko-fi.com/s/a88e66f66b

(Part 3): Train the U-Net (loss, callbacks, mIoU, curves)

Introduction

Now we train with solid defaults, safety callbacks, and metrics that matter for segmentation.
The goal is to reach stable validation mIoU and avoid overfitting by reacting to the validation signal rather than blindly increasing epochs.

Detailed guide

Reproducibility. Note your seed and exact library versions (TensorFlow, NumPy, OpenCV) so others can match your run.

Hyperparameters. Start with batch_size=8 (adjust to GPU RAM), epochs=50–80, input_size=256×256.

Callbacks.

ModelCheckpoint → save best weights by val_mean_io_u (or val_loss if you prefer).

ReduceLROnPlateau → factor 0.5, patience 4–6, min LR 1e-6.

EarlyStopping → patience 10–12 with restore_best_weights=True.

Class imbalance. If pets are small relative to background, try class weights or switch to a Dice-weighted loss. Document which setting you used.

What to report.

Final best epoch and the corresponding val mIoU.

Training time per epoch and total time (helpful for reader expectations).

Two small charts: loss vs. epoch and mIoU vs. epoch for train/val.

Quick error analysis. Mention typical failure modes (pet edges, tails, or dark fur on dark background) and what helped most (slightly larger input size; stronger augmentation on brightness/contrast; small LR decay).

Loading Data and Preparing Masks

The first step is to load the preprocessed NumPy arrays containing training images and masks.
Since segmentation masks contain class values, we convert them into categorical format so that they are ready for training a multi-class model.

### Import NumPy for handling arrays import numpy as np  ### Load preprocessed training images from .npy file allImagesNP = np.load("e:/temp/Unet-Animals-train-images.npy")  ### Load preprocessed training masks from .npy file maskImagesNP = np.load("e:/temp/Unet-Animals-train-mask.npy")  ### Print shapes of images and masks arrays print(allImagesNP.shape) print(maskImagesNP.shape)  ### Define input size and number of categories Weight = 128 Width = 128 numOfCategories = 3  ### Import utility to convert labels to categorical format from keras.utils import np_utils  ### Select first mask for testing the conversion test = maskImagesNP[0]  ### Convert values from range 1–3 to 0–2 test = test -1  ### Convert mask into categorical one-hot encoding test2 = np_utils.to_categorical(test, num_classes=numOfCategories)  ### Print the mask before and after conversion print(test) print(test2)  ### Apply conversion to all masks maskImagesNP = maskImagesNP - 1 maskForTheModel = np_utils.to_categorical(maskImagesNP , num_classes=numOfCategories)  ### Print type after conversion print("print the type after the convert :") print(maskForTheModel.dtype)  ### Convert mask array to integers maskForTheModel = maskForTheModel.astype(int) print(maskForTheModel.dtype)

Link for the full code here : https://ko-fi.com/s/a88e66f66b

Splitting Data into Training and Validation Sets

We now split the dataset into training and validation sets to evaluate model performance.

### Import train_test_split for dataset splitting from sklearn.model_selection import train_test_split  ### Split into training and validation sets (90% train, 10% validation) X_train, X_val , y_train , y_val = train_test_split(allImagesNP, maskForTheModel, test_size=0.1 , random_state=42)  ### Print the shapes of resulting arrays print("X_train , X_val , y_train , y_val --------->>>>  shapes :") print(X_train.shape) print(y_train.shape) print(X_val.shape) print(y_val.shape)

Link for the full code here : https://ko-fi.com/s/a88e66f66b

Building and Training the U-Net Model

Next, we load our U-Net architecture from the previous step, compile the model, and define callbacks to optimize training.

### Import TensorFlow import tensorflow as tf  ### Import the custom U-Net model definition from Step02UnetModel import build_unet  ### Import training callbacks from keras.callbacks import ModelCheckpoint, ReduceLROnPlateau, EarlyStopping  ### Define input shape and number of classes shape = (128,128,3) num_classes = 3  ### Set learning rate, batch size, and number of epochs lr = 1e-4 batch_size = 4 epochs = 10  ### Build U-Net model model = build_unet(shape , num_classes) print(model.summary())  ### Compile the model with categorical crossentropy and Adam optimizer model.compile(loss="categorical_crossentropy", optimizer = tf.keras.optimizers.Adam(lr), metrics=['accuracy'])  ### Define steps per epoch and validation steps stepsPerEpoch = np.ceil(len(X_train)/batch_size) validationSteps = np.ceil(len(X_val)/batch_size)  ### File path for saving the best model best_model_file="e:/temp/Animals-Unet.h5"  ### Define training callbacks callbacks = [     ModelCheckpoint(best_model_file, verbose=1, save_best_only=True),     ReduceLROnPlateau(monitor="val_loss", patience=3, factor=0.1, verbose=1, min_lr=1e-6),     EarlyStopping(monitor='val_loss',patience=5 , verbose=1) ]  ### Train the U-Net model history = model.fit(X_train, y_train,                     batch_size=batch_size,                     epochs=epochs,                     verbose=1,                     validation_data = (X_val, y_val),                     validation_steps = validationSteps,                     steps_per_epoch = stepsPerEpoch,                     shuffle=True,                     callbacks=callbacks)

Link for the full code here : https://ko-fi.com/s/a88e66f66b

Training examples with full loops & metrics: Melanoma U-Net: training + evaluation · Polyp U-Net: callbacks & validation

Visualizing Training Results

Finally, we plot the training and validation accuracy and loss to understand the learning progress of the model.

### Import Matplotlib for plotting graphs import matplotlib.pyplot as plt  ### Extract accuracy and loss from training history acc = history.history['accuracy'] val_acc = history.history['val_accuracy'] loss = history.history['loss'] val_loss = history.history['val_loss']  ### Define epoch range epochs = range(len(acc))  ### Plot training and validation accuracy plt.plot(epochs, acc , 'r', label="Train Accuracy") plt.plot(epochs, val_acc, 'b' , label="Validation Accuracy") plt.xlabel('Epoch') plt.ylabel('Accuracy') plt.title("Train and Validation Accuracy") plt.legend(loc='lower right') plt.show()  ### Plot training and validation loss plt.plot(epochs, loss , 'r', label="Train Loss") plt.plot(epochs, val_loss, 'b' , label="Validation Loss") plt.xlabel('Epoch') plt.ylabel('Loss') plt.title("Train and Validation Loss") plt.legend(loc='upper right') plt.show()

Link for the full code here : https://ko-fi.com/s/a88e66f66b

(Part 4): Inference & Evaluation on the Test Set (side-by-side visuals)

Introduction

Training is only half the story. In this section we quantify quality on the held-out test set and show results that are easy to judge: input, ground truth, prediction, and an overlay.
We also compute per-class IoU to reveal where the model struggles.

Detailed guide

Export. Suggest saving to SavedModel and (optionally) exporting to ONNX for deployment. Link to your own deployment article if you have one.

Load the best weights from Part 3 and run predictions on the test split.

Side-by-side figure. For each sample, display:

original image, 2) ground-truth mask, 3) predicted mask (argmax), 4) overlay.
Use the same color palette across the post and add alt text to every image (e.g., “U-Net predicted pet mask overlay”).

Metrics. Compute per-class IoU (background, border, pet) and mIoU across N test images. Present a small table right in the post and discuss which class is hardest and why.

Generalization check. Add one external pet photo (not from Oxford-IIIT), resize to your input size, and show the prediction. Note differences vs. test images.

Post-processing (optional). A light morphological close can fill small holes in the pet region. Explain when to use it and when it hides real errors.

Compare & go further: OpenCV contour baseline (fast) · OpenCV K-Means baseline · Inception-V3 birds: full classification pipeline

Loading the Trained U-Net Model and Test Data

We start by loading the saved U-Net model (.h5 file) and the preprocessed test images and masks.

### Import required libraries import numpy as np import tensorflow as tf import cv2  ### Define the path to the best trained model best_model_file="e:/temp/Animals-Unet.h5"  ### Load the trained U-Net model from file model = tf.keras.models.load_model(best_model_file)  ### Print the model summary to confirm successful loading print(model.summary())  ### Define image size and number of segmentation categories Height = 128 Width= 128 NumOfCategories = 3  ### Load preprocessed test images and masks from .npy files allTestImagesNP = np.load("e:/temp/Unet-Animals-test-images.npy") maskTestImagesNP = np.load("e:/temp/Unet-Animals-test-mask.npy")  ### Adjust mask values from range 1–3 to 0–2 maskTestImagesNP = maskTestImagesNP -1  ### Import utility for categorical encoding from keras.utils import np_utils  ### Convert test images to categorical format (for consistency) maskImagesForModel = np_utils.to_categorical(allTestImagesNP,num_classes=NumOfCategories)  ### Convert data type from float to integer maskImagesForModel = maskImagesForModel.astype(int)  ### Print shapes of image and mask arrays print("Shapes : ") print(allTestImagesNP.shape) print(maskTestImagesNP.shape)

Link for the full code here : https://ko-fi.com/s/a88e66f66b

Running Prediction on a Test Image

Next, we select one test image, prepare it for the model, and run a prediction.
The model outputs a probability mask for each pixel, which we then process.

### Select the 5th test image for prediction img = allTestImagesNP[4]  ### Add batch dimension before feeding image to the model imgForModel = np.expand_dims(img, axis=0)  ### Run prediction using the U-Net model p = model.predict(imgForModel) print(p)  ### Extract the predicted mask for the image resultMask = p[0]  ### Print mask shape to confirm 3 channels (3 categories) print(resultMask.shape)  ### Reduce the mask to a single-channel image by taking the argmax resultMask = np.argmax(resultMask, axis= -1)  ### Print shape after reduction print ("Result after aregmax axis -1 :") print(resultMask.shape)  ### Add an extra dimension back to the mask resultMask = np.expand_dims(resultMask , axis=-1) print("result after expand dims -1") print(resultMask.shape)  ### Scale values to 0–255 for visualization resultMask = resultMask * (255 / NumOfCategories)  ### Convert mask to unsigned integer type resultMask = resultMask.astype(np.uint8)  ### Resize mask to 16x16 for quick visualization x = cv2.resize(resultMask, (16,16), interpolation=cv2.INTER_NEAREST) print(x)

Link for the full code here : https://ko-fi.com/s/a88e66f66b

Visualizing the Predicted Mask

We now visualize the results alongside the original image.
We convert the mask into a displayable format and prepare it for further processing.

### Convert single-channel mask into 3-channel for display predictedMakImg = np.concatenate([resultMask, resultMask, resultMask], axis=2)  ### Display the original image and predicted mask cv2.imshow("original image ", img) cv2.imshow("Predicted mask ", predictedMakImg)  ### Convert predicted mask into grayscale gray = predictedMakImg.copy() gray = cv2.cvtColor(gray , cv2.COLOR_BGR2GRAY) print("Gray Shape", gray.shape)  ### Find unique values inside the grayscale mask unique_vals = np.unique(gray) print("Unique : ", unique_vals.shape)

Link for the full code here : https://ko-fi.com/s/a88e66f66b

Extracting the Object from the Image

Finally, we refine the predicted mask, convert categories into black and white, and apply it to the original image to extract only the object.

### Convert object and border values to white color gray[gray == 170] = 255 gray[gray == 0] = 255  ### Convert all other values to black gray[gray == 85] = 0  ### Display the refined grayscale mask cv2.imshow("Gray", gray)  ### Apply the mask to extract the object from the original image masked_img = cv2.bitwise_and(img, img, mask=gray)  ### Resize the masked image for better visualization masked_img = cv2.resize(masked_img, (256,256))  ### Show the masked image cv2.imshow("masked_img", masked_img)  ### Wait for key press to close windows cv2.waitKey(0)

Link for the full code here : https://ko-fi.com/s/a88e66f66b

Further reading on my site: All U-Net tutorials · Image segmentation articles · TensorFlow tutorials

Connect :

☕ Buy me a coffee — https://ko-fi.com/eranfeit

🖥️ Email : feitgemel@gmail.com

🌐 https://eranfeit.net

🤝 Fiverr : https://www.fiverr.com/s/mB3Pbb

Planning a trip and want ideas you can copy fast?
Here are three detailed guides from our travels:

• 5-Day Ireland Itinerary: Cliffs, Castles, Pubs & Wild Atlantic Views
https://eranfeit.net/unforgettable-trip-to-ireland-full-itinerary/

• My Kraków Travel Guide: Best Places to Eat, Stay & Explore
https://eranfeit.net/my-krakow-travel-guide-best-places-to-eat-stay-explore/

• Northern Greece: Athens, Meteora, Tzoumerka, Ioannina & Nafpaktos (7 Days)
https://eranfeit.net/my-amazing-trip-to-greece/

Each guide includes maps, practical tips, and family-friendly stops—so you can plan in minutes, not hours.

Enjoy,

Eran