Template Credit: Adapted from a template made available by Dr. Jason Brownlee of Machine Learning Mastery.

SUMMARY: This project aims to construct a predictive model using various machine learning algorithms and document the end-to-end steps using a template. The CycleGAN Apple vs. Orange dataset is a binary classification situation where we attempt to predict one of the two possible outcomes.

INTRODUCTION: The CycleGAN dataset collection contains datasets that consist of images from two classes A and B (for example, apple vs. orange, horses vs. zebras, and so on). The researchers used the images to train machine learning models for research work in the area of General Adversarial Networks.

In iteration Take1, we constructed and tuned machine learning models for this dataset using TensorFlow with a simple VGG-1 network. We also observed the best result that we could obtain using the test dataset.

In iteration Take2, we constructed and tuned machine learning models for this dataset using TensorFlow with a VGG-2 network. We also observed the best result that we could obtain using the test dataset.

This Take3 iteration will construct and tune machine learning models for this dataset using TensorFlow with a VGG-3 network. We will also observe the best result that we can obtain using the test dataset.

ANALYSIS: In iteration Take1, the baseline model’s (one layer with 16 convolutional filters) performance achieved an accuracy score of 48.25% after 15 epochs using the unseen test dataset. After experimenting with different layer configurations, the best model (one layer with 32 convolutional filters) processed the test dataset with 91.44% accuracy.

In iteration Take2, the baseline model’s (two layers with 8/16 convolutional filters) performance achieved an accuracy score of 92.22% after 15 epochs using the unseen test dataset. After experimenting with different layer configurations, the best model (two layers with 64/128 convolutional filters) processed the test dataset with 92.61% accuracy.

In this Take3 iteration, the baseline model’s (three layers with 8/16/32 convolutional filters) performance achieved an accuracy score of 92.02% after 15 epochs using the unseen test dataset. After experimenting with different layer configurations, the best model (three layers with 32/64/128 convolutional filters) processed the test dataset with 93.97% accuracy.

CONCLUSION: In this iteration, the best TensorFlow model appeared to be suitable for modeling this dataset. We should consider experimenting with TensorFlow for further modeling.

Dataset Used: CycleGAN Apple vs. Orange Dataset

Dataset ML Model: Binary classification with numerical attributes

Dataset Reference: https://people.eecs.berkeley.edu/%7Etaesung_park/CycleGAN/datasets/

One potential source of performance benchmarks: https://arxiv.org/abs/1703.10593 or https://junyanz.github.io/CycleGAN/

The HTML formatted report can be found here on GitHub.