Problem-solving guide

This page summarizes the recurring operations from the documentation as tasks you should be able to perform from memory.

Core Python and files

Use variables to store values, then combine them with expressions.
Use a loop and accumulator when a result must be built step by step.
Use with open(...) to write text files safely.
Know which file commands are Colab-specific: drive.mount, files.download, and files.upload.

Load with Pillow using Image.open(path), or with OpenCV using cv2.imread(path).
Inspect .size, .mode, and .shape.
Crop with array slicing: image[y1:y2, x1:x2].
Convert OpenCV BGR images to RGB before displaying with Matplotlib.

For scikit-learn datasets, use dataset.data for features and dataset.target for labels.
Use dataset.target_names and dataset.feature_names to interpret the numbers.
Use NumPy slicing: array[start:stop:step, columns].
Use np.unique(y, return_counts=True) to count classes.

Classification predicts discrete class labels.
Train KNN with KNeighborsClassifier(n_neighbors=...).
Measure performance with accuracy, confusion matrix, precision, recall, F1-score, and classification_report.
Compare models by using the same train/test split.

Use Conv2D layers to learn local image patterns and MaxPooling2D to reduce spatial size.
Use one-hot labels with categorical_crossentropy.
Use dropout and early stopping to reduce overfitting.
Resize external images to the model input shape before prediction.

Train autoencoders with the input image as both input and target.
Use the bottleneck layer as the compressed representation.
For denoising, train on noisy inputs and clean targets.
Use transposed convolutions when decoding convolutional feature maps back into images.

Resize and preprocess input images to match the pretrained model.
Use include_top=False when replacing the original classifier head.
Freeze pretrained layers with trainable = False when using the base model as a feature extractor.
Model zoos such as TensorFlow Hub, PyTorch, and Ultralytics provide pretrained models that can be adapted instead of trained from scratch.

If CIFAR-100 appears instead of CIFAR-10, change the final softmax layer and one-hot encoding from 10 classes to 100 classes.
For CIFAR images, flattened input size is 32 * 32 * 3 = 3072; for MNIST it is 28 * 28 = 784.
For denoising autoencoders, train with noisy inputs and clean targets: autoencoder.fit(x_noisy, x_clean).
After adding noise to normalized images, use np.clip(x_noisy, 0., 1.).
If the task uses Input(...) and Model(inputs=..., outputs=...), it is Keras Functional API. The architecture idea is the same as Sequential.
Use stratify before one-hot encoding when splitting class labels.
If the task says padding, edge filling, or preserving spatial size in a convolution, use padding="same".